Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
What is spark executor cores?
How to handle record boundaries in Text files or Sequence files in MapReduce InputSplits?
Does spark work with python 3?
Compare Pig vs Hive vs Hadoop MapReduce?
What is the difference between apache mahout and apache spark’s mllib?
What is SSTable?
What are the different methods to set up local repositories?
Explain pipe() operation in Apache Spark?
What are the different types of nosql databases?
What is Spark Core?
What is cluster in Cassandra?
What is Replication Factor in Cassandra?
What does hadoop-metrics.properties file do?
How will you write a custom partitioner for a Hadoop job?
What is Identity reducer?