Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Apache Flume support third-party plugins also?
What are the complicated steps in Flume configurations?
Describe Accumulator in detail in Apache Spark?
Can Flume can distribute data to multiple destinations?
Is there an api for implementing graphs in spark?
How Big is ‘Big Data’?
What is mapreduce algorithm?
After increasing the replication level, I still see that data is under replicated. What could be wrong?
What happens to rdd when one of the nodes on which it is distributed goes down?
What is the difference between namenode, backup node and checkpoint namenode?
What is the use of InputFormat in MapReduce process?
What is graph db? Explain with an example.
What is the core of the job in MapReduce framework?
Can sqoop use spark?
Can you define yarn?