Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is decorating filters?
What do you know about keyvaluetextinputformat?
What is the functionality of jobtracker in hadoop? How many instances of a jobtracker run on hadoop cluster?
What is the use of apache mahout?
State the limitations of Apache Pig?
Explain briefly what is Action in Apache Spark? How is final result generated using an action?
Do we need to install scala for spark?
Is hadoop a memory?
How can multi-hop agent be set up in Flume?
What is the difference between sqoop and hive?
Why Avro?
Why HDFS?
In what ways sparksession different from sparkcontext?
What is hdfs in big data?
What are the disadvantages of hadoop serialization?