Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain why the name ‘hadoop’?
What is CAP Theorem? What aspects does Hadoop support from this theorem?
Explain what you understand by speculative execution
How do I check my spark status?
What are apache tajo sql functions?
How to Delete directory from HDFS?
What are active and passive "NameNodes"?
What is the benefit of kafka?
How analysis of Big Data is useful for organizations?
What is an accumulator in spark?
What is the difference between python and spark?
What is the default replication factor and how will you change it?
What is the benifit of Distributed cache, why can we just have the file in HDFS and have the application read it?
On what basis Namenode will decide which datanode to write on?
What are the main key structures of hbase?