Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the advantages of DataSets?
Do I need to install hadoop for spark?
What is the usage of "cqlsh-version" command?
What is hinted handoff?
Explain what is a task tracker in hadoop?
Does 'ILLUSTRATE' run MR job?
How does lazy evaluation work in spark?
When is it suggested to use a combiner in a MapReduce job?
How can an application connect to Hive run as a server?
How does bloom filter help in searching rows?
What is a namenode in hadoop?
What is Block in HDFS?
What is the job of blend () and repartition () in Map Reduce?
What is NoSQL database?
How many types of ambari repositories are available?