Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How would you tackle counting words in several text documents?
What are components of Cassandra Data Model?
How is Pig Useful For?
What is full form of rdd?
What do you mean by meta data in hdfs? List the files associated with metadata.
Mention key components of Hive Architecture?
What is the difference betwaeen mapreduce engine and hdfs cluster?
What is the difference between cassandra's schema and rdbms schema?
How is hadoop different from spark?
How hive can improve performance with orc format tables?
How can you avoid importing tables one-by-one when importing a large number of tables from a database?
What is the key- value pair in MapReduce?
What are the fundamental configurations parameters specified in map reduce?
What do you mean by commit log in Cassandra?
What are clusters in cassandra?