Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is lineage graph?
Can we change the file cached by distributed cache
List some difference between flume and kafka?
what is the default replication factor in HDFS?
UPPER or UCASE function in Hive with example?
Do I need to learn scala for spark?
Is it possible to add or delete column families in a working group?
What is RDD in Apache Spark? How are they computed in Spark? what are the various ways in which it can create?
How hdfs is different from traditional file systems?
Differentiate between piglatin and hiveql?
did you maintain the hadoop cluster in-house or used hadoop in the cloud?
What is Sqoop Job?
What is document store db? Explain with an example.
What is JPS? Why is it used in Hadoop?
What does the high availability of a name-node means?