Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) Explain HBase Architecture in brief?
How namenode handles data node failures?
Can we deploy job tracker other than name node?
What are the different types of filaters used in hbase?
What is the problem with small files in Hadoop?
Is spark difficult to learn?
Does if offer scaling?
What are the advantages of kafka?
What do you mean by Persistence?
Explain avrostorage function?
Define streaming access?
What happen if a datanode loses network connection for a few minutes?
How is HDFS fault tolerant?
Can you define a checkpoint?
What is a keyspace in Cassandra?