Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is the full form of MSLAB?
Explain the common input formats in hadoop?
What is the difference between an RDBMS and Hadoop?
How does an hadoop application look like or their basic components?
What are shared variables in Apache Spark?
What is SSTable? How is it different from other relational tables?
Can we change the file cached by distributed cache
What is the use of binstorage?
What are the collection data types provided by CQL?
What is DistributedCache and its purpose?
Why is apache spark so fast?
What is Apache Hadoop YARN?
What is the process to change the files at arbitrary locations in HDFS?
Name different types of primary keys in Cassandra?
Replication causes data redundancy then why is pursued in hdfs?