Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the different data formats supported by apache tajo?
Explain why to use hbase?
Explain write ahead log(journaling) in spark?
What is Flatten?
How can you send some messages in kafka?
What is the use of flume in hadoop?
Explain about the core components of a distributed Spark application?
How to set up local repository manually?
When the reducers are are started in a mapreduce job?
what does the shell commands “Capture” and “Consistency” determines?
What are accumulators in spark?
Differentiate between the various types of primary keys in cassandra.
What is Rack Awareness? What is its need in Hadoop?
Can you explain rack awareness?
What is the sequence of execution of Mapper, Combiner, and Partitioner in MapReduce?