Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is rack awareness in hadoop?
Why apache spark is faster than hadoop?
What problems have you faced when you are working on Hadoop code?
If map reduce is inferior to spark then is there any benefit of learning it?
Explain the Job OutputFormat?
Is there any benefit of learning mapreduce if spark is better than mapreduce?
What is Apache Spark Machine learning library?
What is Ambari shell?
Does Apache Flume provide support for third party plug-ins?
Name different types of NoSQL database?
Give some points of hive for hadoop ?
What is the difference between Pig and SQL?
Explain the core components of hadoop?
What is spark client?
Does impala use caching?