Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Which port does SSH work on?
What are the operational commands of HBase?
How to come out of the insert mode?
Why the output of map tasks are stored (spilled ) into local disc and not in hdfs?
What problem does Apache Flume solve?
What happens if you get a ‘connection refused java exception’ when you type hadoop fsck /?
What kinds of impala queries or data are best suited for hbase?
What are the different execution modes available in Pig?
What will happen in case you have not issued the command?
Why comparison of types is important for MapReduce?
What happens to zk sessions while the cluster is down?
How to load data into table created in hive ?
How does apache spark work?
What is the difference between namenode, backup node and checkpoint namenode?
What is difference between map and flatmap in spark?