Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407) If map reduce is inferior to spark then is there any benefit of learning it?
can you explain about configuration files?
How do ‘map’ and ‘reduce’ work?
What happens when we submit a spark job?
Can you explain spark streaming?
What is Replication Factor in Cassandra?
Name different types of the data model?
Mention what are the most common input formats defined in hadoop?
What is sink in flume?
Explain some of the basic commands used for Apache Ambari server?
What is the purpose of Hive Driver?
How does broadcast join work in spark?
Explain the different logging levels in cassandra.
Explain what are the basic parameters of a mapper?
What happens if the preferred replica is not in the isr?