Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are consumers or users?
What is apache spark for beginners?
What are the different tools used for Ambari monitoring purpose?
Explain what is “map” and what is "reducer" in hadoop?
What is the use of expand cqlsh command in Cassandra?
What is the jobtracker and what it performs in a hadoop cluster?
How do I use spark with big data?
Why do we need indexing?
What do you mean by logging in cassandra?
Can you define serde in hive?
What is coalesce in spark sql?
Why spark is faster than hive?
What is a spark shuffle?
Who divides the file into Block while storing inside hdfs in hadoop?
What happens if the number of reducers is 0 in Hadoop?