How to overwrite an existing output file/dir during execution of Hadoop MapReduce jobs?



How to overwrite an existing output file/dir during execution of Hadoop MapReduce jobs?..

Answer / Shantanu Sharma

To overwrite an existing output file/dir during the execution of a Hadoop MapReduce job, you can set the 'mapred.job.reuse.jobid' configuration property to false before calling Job.waitForCompletion(). Setting this property to false allows the new job to overwrite any previous job with the same ID.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More MapReduce Interview Questions

What do sorting and shuffling do?

1 Answers  


What is sqoop in Hadoop ?

1 Answers  


What is identity mapper and chain mapper?

1 Answers  


Where the mapper's intermediate data will be stored?

1 Answers  


What is the difference between a MapReduce InputSplit and HDFS block?

1 Answers  


What do you mean by inputformat?

1 Answers  


How to optimize MapReduce Job?

1 Answers  


Whether the output of mapper or output of partitioner written on local disk?

1 Answers  


How hadoop mapreduce works?

1 Answers  


What is the difference between an RDBMS and Hadoop?

1 Answers  


Which one will you decide for an undertaking – Hadoop MapReduce or Apache Spark?

1 Answers  


Explain about the partitioning, shuffle and sort phase in MapReduce?

1 Answers  


Categories