Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What does ‘jps’ command do?
What are the transformations in spark?
What are the components used in Hive query processor?
Whether Pig Latin language is case-sensitive or not?
How Hive organize the data?
Which command is used to run hbase shell?
Explain Machine Learning library in Spark?
What is InputFormat in Hadoop MapReduce?
What is the role of a zookeeper in a kafka cluster?
List down the segments of a hive question processor?
List out some common problems faced by data analyst?
What is Resilient Distributed Dataset (RDD) in Apache Spark? How does it make spark operator rich?
What is write ahead log(journaling) in Spark?
How can apache spark be used alongside hadoop?
What are ‘reduces’?