Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Mention Hadoop core components?
What is the main difference between Kafka and Flume?
Define HDFS and talk about their respective components?
What is serialization in spark?
How can you add a new partition for the month December in the above partitioned table?
What are the Advantages of using Cassandra ?
What is the latest version of sqoop?
Mention what is rack awareness?
What is the difference between Apache Hadoop and RDBMS?
What is spark deploy mode?
Use of import-all-tables command in hadoop sqoop?
How can one increase replication factor to a desired value in Hadoop?
Why is spark good?
What is a Heartbeat in Hadoop?
Explain in brief what is the architecture of Spark?