Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What kind of data warehouse application is suitable for Hive? What are the types of tables in Hive?
What are the components used in Hive query processor?
How will you make changes to the default configuration files?
What does consumer api in kafka?
In how many ways RDDs can be created? Explain.
How can Spark be connected to Apache Mesos?
What are mapreduce new and old apis while writing map reduce program?. Explain how it works
Is apache spark a programming language?
How does pipe operation writes the result to standard output in Apache Spark?
How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?
Define Apache Pig?
What are common uses of Apache Spark?
How are sparks created?
State one best feature of Kafka?
Is it necessary to know java to learn hadoop?