Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is difference between secondary namenode, checkpoint namenode & backupnod secondary namenode, a poorly named component of hadoop?
Name some companies that are already using Spark Streaming?
What is streaming?
What do you understand by cluster in cassandra?
what are the steps involved in commissioning adding
Is kafka big data?
How can you import only a subset of rows from a table?
What is the difference between the external table and managed table?
Which data storage components are used by hadoop?
Can impala do user-defined functions (udfs)?
What are the applications of Apache ZooKeeper?
What is Safemode in Apache Hadoop?
Why Do We Need Apache Pig?
How do we write our own custom serde?
What is an Agent?