Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is CQL?
What is HDFS block size and what did you chose in your project?
Explain about trformations and actions in the context of rdds?
Name the most common input formats defined in hadoop?
What is Sqoop Job?
What is the primary purpose of flume in the hadoop architecture?
Explain caching in spark streaming.
Why is Apache Spark faster than Apache Hadoop?
Explain the lookup() operation in Spark?
How rdd persist the data?
What is map in apache spark?
Describe how hbase uses zookeeper?
What is Rack Awareness? What is its need in Hadoop?
Name job control options specified by mapreduce.
Differentiate between drop and truncate in cqlsh