Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
How does apache flume work?
Mention if we can name view same as the name of a Hive table?
What is configured in /etc/hosts and what is its role in setting Hadoop cluster?
Explain what is the function of mapreduce partitioner?
When to use –target-dir and when to use –warehouse-dir while importing data?
How to create a user in Hadoop?
What is ColumnFamily?
Which command is used for the retrieval of the status of daemons running the hadoop cluster?
How can you stop a partition form being queried?
Explain transformation in rdd. How is lazy evaluation helpful in reducing the complexity of the system?
What is tungsten engine in spark?
What exactly is apache spark?
Explain slot in Hadoop Map-Reduce v1?
Explain what is sequencefileinputformat?
Compare Apache Hadoop and Apache Spark?