Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is the problem with small files in Hadoop?
Is fs.mapr.working.dir a single directory?
What is OutputCommitter?
Can hadoop replace relational database?
Can you explain spark streaming?
How will format the HDFS ?
Clarify SSTable?
Name various types of Cluster Managers in Spark.
Explain what happens in textinformat ?
What combiners is and when you should use a combiner in a MapReduce Job?
What does the high availability of a name-node means?
Can you join multiple fields in Apache
What is executor spark?
What is the difference between local and remote metastore?
Can you explain apache spark?