Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) What are ‘reduces’?
Why do we need Pig?
Is a job split into maps?
What are the main classes of Data Transfer API?
How to copy file from HDFS to local?
Explain HDFS “Write once Read many” pattern?
List some benefits of apache kafka?
How does apache spark engine work?
How is spark different from hadoop?
Can aluminum cause a spark?
Can you explain how to minimize data transfers while working with Spark?
What are the types of Apache Spark transformation?
What is application master in spark?
what is difference between int and intwritable?
If data is present in HDFS and RF is defined, then how can we change Replication Factor?