Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the different ways of representing data in Spark?
Explain Zookeeper Leader election?
How does the Pig platform handle relational systems data?
What do you know about transformations in spark?
What is the use of ycsb?
What is RDD lineage graph? How does it enable fault-tolerance in Spark?
What does FOREACH do?
What does dag stand for?
When to use –target-dir and when to use –warehouse-dir while importing data?
What are impala built-in functions?
What is Fault Tolerance?
What is the process for starting a Kafka server?
What is a MapFile?
Can you explain textinformat?
What webdav is in hadoop?