Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
List the various options available with the Hive command?
State some highlights of Ambari?
What do you mean by replication strategy?
What is Cassandra?
how can we change Replication Factor?
Different running modes for running Pig?
What are combiners? When should I use a combiner in my MapReduce Job?
What do you think about the speculative execution?
What are the main benefits of using cassandra?
Explain the functionality of object-inspector.
What are Pig Execution modes?
How many types of nosql databases?
What are the machine learning algorithms supports in apache mahout?
How can you native libraries be included in yarn jobs?
What is Mapper in Hadoop?