How would you tackle calculating the number of unique visitors for each hour by mining a huge apache log? You can use post processing on the output of the mapreduce job.
No Answer is Posted For this Question
Be the First to Post Answer
List the configuration parameters that have to be specified when running a MapReduce job.
What are the the issues associated with the map and reduce slots based mechanism in mapReduce?
How does Hadoop Classpath plays a vital role in stopping or starting in Hadoop daemons?
What is the default input type in MapReduce?
What are the main components of MapReduce Job?
Explain Working of MapReduce?
What comes in Hadoop 2.0 and MapReduce V2 YARN
What happens when a DataNode fails during the write process?
What are the steps involved in MapReduce framework?
what is "map" and what is "reducer" in Hadoop?
In MapReduce, ideally how many mappers should be configured on a slave?
What is the need of MapReduce in Hadoop?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)