Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are benefits of DataFrame in Spark?
Define Nodetool Utility?
What is a bloom filter and how does it help in searching rows?
Define a task tracker?
What are the different types of tables available in Hive?
How will you update the rows that are already exported?
How is the splitting of file invoked in Hadoop ?
Define the purpose of the partition function in mapreduce framework
What happens to a namenode, when job tracker is down?
Difference between hbase and rdbms?
What is a tuple in pig?
Mention what does the shell commands “capture” and “consistency” determines?
Can hive run without hadoop?
Do I need to know hadoop to learn spark?
How data or file is read in Hadoop HDFS?