Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) how is a file of the size 1 GB uncompressed
What is graph db? Explain with an example.
Which classes are used by the hive to read and write hdfs files?
Explain why the name ‘hadoop’?
What are the most common OutputFormat in Hadoop?
How Sqoop word came? Sqoop is which type of tool and the main use of sqoop?
State some applications of HBase?
List down the segments of a hive question processor?
What is the role of Consumer API?
What are the primary phases of the reducer?
What are the design goals of zookeeper?
Does the HDFS go wrong? If so, how?
Is spark faster than hadoop?
Explain the various Transformation on Apache Spark RDD like distinct(), union(), intersection(), and subtract()?
What are the independent extensions that contributed to the ambari codebase?