Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the use of MasterServer?
Mention what is the difference between Hbase and Hive?
What do you understand by the term snitch in cassandra? Give some example.
How many Daemon processes run on a Hadoop system?
Define “speculative execution” in hadoop?
Hadoop Libraries and Utilities and Miscellaneous Hadoop Applications?
How did you debug your Hadoop code ?
Is it possible to change the default location of Managed Tables in Hive, if so how?
What exactly is apache spark?
What is the difference between Cassandra and Hadoop ?
What is the communication channel between client and namenode/datanode?
Define SSTable?
How mahout used with python ?
How does data transfer happen from hdfs to hive?
State some advantages of impala?