Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What are the various programming languages supported by Spark?
How often do you need to reformat the namenode?
What is Streaming / Log Data?
What is Block in HDFS?
Define column families?
What is the function of UNION and SPLIT operators? Give examples?
Can the region server will be located on all datanodes?
What is the difference between apache mahout and prediction.io ?
How can you launch Spark jobs inside Hadoop MapReduce?
How namenode handles data node failures?
Should I install spark on all nodes of yarn cluster?
State the difference between Spark SQL and Hql
How to enable recycle bin or trash in hadoop?
What is Spark.executor.memory in a Spark Application?
What are the four modules that make up the Apache Hadoop framework?