Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is the command to start and stop the Spark in an interactive shell?
Why Avro?
What is the purpose of textinputformat?
What is nlineoutputformat?
Why do we need Hadoop?
Should the region server be located on all DataNodes?
What are the limitations of Hive?
What is Hadoop Custom partitioner ?
What is yarn in hadoop?
Can multiple clients write into an HDFS file concurrently?
Who should learn Apache Ambari?
How data or file is written into Hadoop HDFS?
What is wal and hlog in hbase?
Explain how you can reduce churn in isr? When does broker leave the isr?
What will be the output of cast ('XYZ' as INT)?