Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is difference between secondary namenode, checkpoint namenode & backupnod secondary namenode, a poorly named component of hadoop?
727What mechanism does hadoop framework provides to synchronize changes made in distribution cache during runtime of the application?
673Did you ever ran into a lop sided job that resulted in out of memory error, if yes then how did you handled it ?
442
Is it possible to rename the output file?
What is difference between secondary namenode, checkpoint namenode & backupnod secondary namenode, a poorly named component of hadoop?
Where the mapper's intermediate data will be stored?
Can we do real-time processing using spark sql?
What are the limitations of importing RDBMS tables into Hcatalog directly?
What is a tuple?
What is Spark Streaming?
State some DDL Command with brief Description?
How many JVMs run on a slave node?
What are the restriction to the key and value class ?
What is a block and block scanner in HDFS?
What are consumers in kafka?
How is rdd fault?
What can you do with Kafka?
What are use cases of Apache Flume?