Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) On what basis data will be stored on a rack?
What is a bloom filter?
Difference between Sqoop and Cassandra?
How do I start flume agent?
What was the design goal of Cassandra?
What is the FlatMap Transformation in Apache Spark RDD?
What is the function of "MLlib"?
How tables are managed in apache tajo?
Is spark faster than hadoop?
Can we use Ambari Python Client to use of Ambari API’s?
Describe SPM?
What do you know about nlineoutputformat?
Why replication is required in Kafka?
Explain about tablespace?
Why is output file name in Hadoop MapReduce part-r-00000?