Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain first() operation in Spark?
What are Prerequisites to learn Avro?
For using hadoop list the network requirements?
What happens to existing data in my cluster when I add new nodes?
What are clusters in cassandra?
What is difference between spark and hadoop?
In Hive, how can you enable buckets?
Can impala be used for complex event processing?
How is Spark not quite the same as MapReduce? Is Spark quicker than MapReduce?
What do you understand by sstabl in cassandra?
what is JobTracker in Hadoop? What are the actions followed by Hadoop?
What are the types of transformation in RDD in Apache Spark?
What are the different components of a Hive query processor?
What is the Job interface in MapReduce framework?
What are nodes and ephemeral nodes?