Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Ideally what should be replication factor in a Hadoop cluster?
What Is Difference Between Mapreduce and Pig ?
How can I import large objects (BLOB and CLOB objects) in Apache Sqoop?
Why is Transformation lazy in Spark?
What is speculative execution in spark?
What is heartbeat in hadoop?
Explain hdfs?
Describe how hbase uses zookeeper?
What is the best method for Storing Objects in Cassandra ?
What do you know about yarn?
What is configuration of a typical slave node on Hadoop cluster? How many JVMs run on a slave node?
On what basis data will be stored on a rack?
How to resolve IOException: Cannot create directory
Is bigger than spark driver maxresultsize?
What is BloomMapFile?