Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Define a daemon?
Explain the lookup() operation in Spark?
Why is block size set to 128 MB in Hadoop HDFS?
On which port does ssh work?
Where are hadoop’s configuration files located and list them?
What is pagerank in graphx?
What do you mean by Schema Resolution?
What are brokers in kafka?
Explain how does hbase actually delete a row?
What are producers in kafka?
Explain what is storage and compute nodes?
What is shuffling and sorting in Hadoop MapReduce?
What is difference between spark and mapreduce?
What are the side effects of not running a secondary name node?
What are the most common OutputFormat in Hadoop?