Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is dataframe in spark?
What is the difference between apache mahout and apache spark’s mllib?
Explain the features of pseudo mode?
Can a spark cause a fire?
Name different types of NoSQL database?
What is in memory in spark?
What is the spark driver?
Which interface needs to be implemented to create Mapper and Reducer for the Hadoop?
How many distinct layers are of storm’s codebase?
What is check pointing in hadoop?
Explain the operation reduce() in Spark?
what is a sequence file in Hadoop?
What are the use cases of Apache Pig?
Mention what is the best way to copy files between hdfs clusters?
Explain the composite key?