Is it possible to have hadoop job output in multiple directories? If yes, how?
Answer / Sachin Batra
Yes, it is possible to have Hadoop job output in multiple directories. This can be achieved by calling the 'setOutputPath' method multiple times with different Path objects in the Reducer class. Alternatively, you can create a custom OutputFormat that allows writing to multiple locations.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is the job tracker role in hadoop?
Can you give some examples of Big Data?
What does secondary name-node means?
What is the procedure for namenode recovery?
Can you explain indexing?
What is the difference between TextinputFormat and KeyValueTextInputFormat class?
Mention what daemons run on a master node and slave nodes?
Explain what is a sequence file in hadoop?
Compare Apache Hadoop and Apache Spark?
How can you native libraries be included in yarn jobs?
What is single node cluster in Hadoop? for what all purposes Hadoop run on a single node cluster?
What are the main features and Characteristics of Hadoop which makes it the most popular and powerful Big Data tool?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)