Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How to set which framework would be used to run mapreduce program?
How is indexing done in HDFS?
What are the Hadoop features extended to its eco-system components ?
What types of costs are associated with creating the index on hive tables?
Explain the core components of hadoop?
Explain the features of pseudo mode?
Define HDFS and talk about their respective components?
Explain HCatInputFormat and HCatOutputFormat?
What are the design goals of zookeeper?
Explain fullOuterJoin() operation in Apache Spark?
What is the hadoop-core configuration?
What is the process to perform an incremental data load in Sqoop?
What are the three layers where the hadoop components are actually supported by ambari?
What does apache mahout do?
In hbase what is column families?