Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How many compaction types are in HBase?
how JobTracker schedules a task ?
What is difference between spark and scala?
What are the different commands used to startup and shutdown Hadoop daemons?
When should you use cassandra?
Compare Spark vs Hadoop MapReduce
What is spark client?
What is the latest version of sqoop?
What are shared variables?
Define "Transformations" in Spark
How to optimize Hadoop MapReduce Job?
Is hadoop open source?
is HQL case sensitive?
If the source data gets updated every now and then, how will you synchronize the data in hdfs that is imported by sqoop?
how is data partitioned before it is sent to the reducer if no custom partitioner is defined in Hadoop?