Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Which object can be used to get the progress of a particular job
What are the limitations of Hive?
Explain the tajo architecture?
How many distinct layers are of storm’s codebase?
In which location name node sores its metadata and why?
Explain about the core components of Flume?
Name three data source available in SparkSQL
What is the key difference between NameNode and DataNode in Hadoop?
Can we run Apache Spark without Hadoop?
Explain the role of offset in kafka?
What is sink processors?
Explain Creating an Index?
Do you know the comparative differences between apache spark and hadoop?
What are the befefits of nosql over relational database?
Can you give some examples of Big Data?