Is it possible to provide multiple input to Hadoop? If yes then how?



Is it possible to provide multiple input to Hadoop? If yes then how?..

Answer / Chetan Warkari

Yes, it is possible to provide multiple inputs to Hadoop. This can be achieved by using multiple InputSplit objects in the custom InputFormat class, or by specifying multiple input paths when submitting a job using the 'hadoop jar' command. The InputFormat class should implement the 'getSplits()' method to return multiple InputSplit objects.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Hadoop General Interview Questions

What is the default replication factor and how will you change it?

1 Answers  


Did you ever ran into a lop sided job that resulted in out of memory error, if yes then how did you handled it ?

1 Answers  


Can we deploy job tracker other than name node?

1 Answers  


What is a Heartbeat in Hadoop?

1 Answers  


Explain the difference between an inputsplit and a block?

1 Answers  


What are the different commands used to startup and shutdown Hadoop daemons?

1 Answers  


What are the side data distribution techniques?

1 Answers  


Suppose hadoop spawned 100 tasks for a job and one of the tasks failed. What will hadoop do?

1 Answers  


How can you set an arbitrary number of mappers to be created for a job in Hadoop?

1 Answers  


What happens if one hadoop client renames a file or a directory containing this file while another client is still writing into it?

1 Answers  


Hadoop achieves parallelism by dividing the tasks across many nodes, it is possible for a few slow nodes to rate-limit the rest of the program and slow down the program. What mechanism Hadoop provides to combat this?

1 Answers  


Explain edge nodes in hadoop?

1 Answers  


Categories