Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Define the difference between hive and hbase?
Give examples of some companies that are using Hadoop structure?
What is the use of context object?
Mention what happens if the preferred replica is not in the ISR?
What database does spark use?
List some commonly used Machine Learning Algorithm Apache Spark?
Is it possible to provide multiple input to Hadoop? If yes then how can you give multiple directories as input to the Hadoop job?
What is the difference between like and rlike operators in hive?
What are the most common InputFormats in Hadoop?
What is apache spark and what is it used for?
What are the optimization techniques in spark?
What is the difference between apache mahout and spark mllib ?
What are apache tajo sql functions?
What is Apache Hadoop YARN?
What do you understand by cassandra?