Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What is dag spark?
State some disadvantages of impala?
What is the input type/format in MapReduce by default?
What happen when namenode enters in safemode in hadoop?
What is small file problem in hadoop?
What is spark architecture?
When running Spark applications, is it necessary to install Spark on all the nodes of YARN cluster?
What is metadata storage service in bookkeeper?
Which language is better for spark?
How can we have to see the all hosts that are available in the ambari?
Explain Spark leftOuterJoin() and rightOuterJoin() operation?
What is the History of Cassandra Database ?
Explain about the major libraries that constitute the Spark Ecosystem?
In a given spark program, how will you identify whether a given operation is Transformation or Action ?
What is bookkeeper?