Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Name different types of the data model?
Why Ambari?
What are configuration files in Hadoop?
What is session in Cassandra?
What do you mean by column family in Cassandra?
How would you check whether your NameNode is working or not?
MapReduce Types and Formats and Setting up a Hadoop Cluster?
What are the various programming languages supported by Spark?
Can we do real-time processing using spark sql?
How do I download and install spark?
What is a block and block scanner in HDFS?
What is the difference between spark and python?
Can you change the block size of hdfs files?
How does inputsplit in mapreduce determines the record boundaries correctly?
What is difference between secondary namenode, checkpoint namenode & backupnod secondary namenode, a poorly named component of hadoop?