Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the most commonly defined input formats in Hadoop?
Does spark load all data in memory?
What are the abstractions of Apache Spark?
What is key-value store db? Explain with an example.
What is map side join?
List out the different stream grouping in apache storm?
What are the main components in Hadoop Eco-System and what are their functions ?
Explain the core benefits for hadoop users by using the apache ambari?
Explain pigdump function?
Explain what happens in textinformat ?
If there is certain data that we want to use again and again in different transformations, what should improve the performance?
How can we create rdds in apache spark?
What is the difference between hadoop and spark?
Explain leftOuterJoin() and rightOuterJoin() operation in Apache Spark?
how will you implement SQL in Spark?