Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Is it mandatory to set input and output type/format in MapReduce?
What do you understand from Node redundancy and is it exist in hadoop cluster?
What is a flume agent?
What are Features of Hive?
What are the Applications of Apache Pig?
How would you diagnose or do exception handling in the pig?
Which serialization libraries are supported in spark?
What is RDD Lineage?
What is a spark rdd?
State one best feature of Kafka?
Explain about the different types of trformations on dstreams?
How to change the replication factor of data which is already stored in HDFS?
Explain what is the role of the zookeeper?
What is the difference between map and flatmap?
What are the types of Transformation in Spark RDD Operations?