Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is LazyOutputFormat in MapReduce?
Explain apache kafka?
Mention what is rack awareness?
How do I start a spark server?
What is pagerank in graphx?
Explain why the name ‘hadoop’?
What is fsck?
What is the difference between RDBMS with Hadoop MapReduce?
If data is present in HDFS and RF is defined, then how can we change Replication Factor?
How can client interact with Hive?
What are the different Complex Data Types available in Hive?
Does spark store data?
How to create hadoop archive?
What is the role zookeeper plays in a cluster of kafka?
What are the 2 modes used to run pig scripts?