How to overwrite an existing output file/dir during execution of Hadoop MapReduce jobs?
Answer / Shantanu Sharma
To overwrite an existing output file/dir during the execution of a Hadoop MapReduce job, you can set the 'mapred.job.reuse.jobid' configuration property to false before calling Job.waitForCompletion(). Setting this property to false allows the new job to overwrite any previous job with the same ID.
| Is This Answer Correct ? | 0 Yes | 0 No |
What do sorting and shuffling do?
What is sqoop in Hadoop ?
What is identity mapper and chain mapper?
Where the mapper's intermediate data will be stored?
What is the difference between a MapReduce InputSplit and HDFS block?
What do you mean by inputformat?
How to optimize MapReduce Job?
Whether the output of mapper or output of partitioner written on local disk?
How hadoop mapreduce works?
What is the difference between an RDBMS and Hadoop?
Which one will you decide for an undertaking – Hadoop MapReduce or Apache Spark?
Explain about the partitioning, shuffle and sort phase in MapReduce?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)