Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are the independent extensions that contributed to the ambari codebase?
Explain about tajo worker configuration?
What does rdd stand for in logistics?
Why is space not freed up when I issue drop table?
What is configuration of a typical slave node on Hadoop cluster? How many JVMs run on a slave node?
We have already sql then why nosql?
What is lazy evaluation in Spark?
Define the level of parallelism and its need in spark streaming?
Is Hive supports Temporary Tables?
How do you define "block" in HDFS?
What are the abstractions of Apache Spark?
How do I use spark with big data?
What is spark tool?
How does Hadoop Classpath plays a vital role in stopping or starting in Hadoop daemons?
Explain the difference between gen1 and gen2 hadoop with regards to the namenode?