Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the machine learning algorithms supports in apache mahout?
Explain what is shuffling in mapreduce?
Why is pig used in hadoop?
Define a worker node?
What OS Cassandra supports?
Is map like a pointer?
How is rdd distributed?
What is Zookeeper Cluster?
What are the types of ambari repositories are available?
What is the current version of Hive?
Input Split & Record Reader and what they do?
Is it possible to create cartesian join between 2 tables, using hive?
What are the ways in which Apache Spark handles accumulated Metadata?
What are the basics of zookeeper api?
What is the difference between like and rlike operators in hive?