Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)
Explain how is data partitioned before it is sent to the reducer if no custom partitioner is defined in hadoop?
Why should we use ‘orderby’ keyword in pig scripts?
What are the different Primitive Data Types available in Hive?
Explain what does the conf.setMapper Class do in MapReduce?
What is Distributed Cache?
How can native libraries be included in yarn jobs?
Mention key components of Hive Architecture?
What is a MapReduce Combiner?
Explain the wordcount implementation via hadoop framework ?
Explain how do you overwrite replication factor?
What is the command to start and stop the Spark in an interactive shell?
What is the default file format to import data using Apache Sqoop?
When should you use sequencefileinputformat?
What are the Features of Hadoop?
Which language is best for spark?