How to specify more than one directory as input in the Hadoop MapReduce Program?
No Answer is Posted For this Question
Be the First to Post Answer
How would you tackle calculating the number of unique visitors for each hour by mining a huge apache log? You can use post processing on the output of the mapreduce job.
Define the Use of MapReduce?
What is the Reducer used for?
How can you add the arbitrary key-value pairs in your mapper?
What is the difference between an RDBMS and Hadoop?
Clarify what is shuffling in map reduce?
What are the various configuration parameters required to run a mapreduce job?
What do you understand by the term Straggler ?
How to get the single file as the output from MapReduce Job?
what are the main configuration parameters that user need to specify to run Mapreduce Job ?
Explain what are the basic parameters of a mapper?
Is it legal to set the number of reducer task to zero? Where the output will be stored in this case?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)