Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How data or file is written into Hadoop HDFS?
Can you explain broadcast variables?
How to change a number of mappers running on a slave in MapReduce?
Explain how jobtracker schedules a task?
Can you explain spark rdd?
What is impala data types?
Mention what is the difference between Hbase and Hive?
What does impala do for fast access?
What do shuffling do?
Explain foreach() operation in apache spark?
In which kind of scenarios MapReduce jobs will be more useful than PIG in Hadoop?
Does spark use zookeeper?
Explain Zookeeper Queues?
What is Slot in Hadoop v1? Why was it removed from Hadoop v2?
What are apache tajo sql functions?