Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain the difference between an hdfs block and input split?
Specify the different types of tables accessible in hive?
Explain bagtotuple?
Mention what is the best way to copy files between hdfs clusters?
Explain what is zookeeper in kafka? Can we use kafka without zookeeper?
Can you explain how do ‘map’ and ‘reduce’ work?
What is a cell in hbase?
What is presto?
What is vectorized query execution?
What is the difference between map and flatmap?
What are benefits of DataFrame in Spark?
How are joins performed in impala?
Does cassandra support acid tractions?
What is the significance of cluster class in Cassandra?
Is hive similar to sql?