Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Is it necessary to write a mapreduce job in java?
Explain HCatLoader and HCatStorer APIs?
Mention what is the hadoop mapreduce apis contract for a key and value class?
What is the difference between apache mahout and spark mllib ?
What is the roadmap for apache driver version one.0?
Explain how does hadoop classpath plays a vital role in stopping or starting in hadoop daemons?
Compare Apache Hadoop and Apache Spark?
Explain avrostorage function?
What are the DDL commands used in hbase?
Which classes are used by the hive to read and write hdfs files?
What is the benifit of Distributed cache, why can we just have the file in HDFS and have the application read it?
Are Namenode and job tracker on the same host?
What is the heartbeat used for?
What is the key difference between textfile and wholetextfile method?
Explain the process that overwrites the replication factors in HDFS?