Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Mention the common features in Pig and Hive?
Explain the different logging levels in cassandra.
Can I do trforms or add new functionality?
How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?
What do you understand by column family?
When to use secondary indexes?
What are the advantages of DataFrame?
What is partitioner and its usage?
What is the latest version of spark?
Explain what happens in textinformat ?
What is a shuffle block in spark?
What is Counter in MapReduce?
How to use combiner in hadoop ?
How can we create children / sub-znode?
Do I need to learn scala for spark?