Is it possible to provide multiple input to Hadoop? If yes then how?
Answer / Chetan Warkari
Yes, it is possible to provide multiple inputs to Hadoop. This can be achieved by using multiple InputSplit objects in the custom InputFormat class, or by specifying multiple input paths when submitting a job using the 'hadoop jar' command. The InputFormat class should implement the 'getSplits()' method to return multiple InputSplit objects.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is the default replication factor and how will you change it?
Did you ever ran into a lop sided job that resulted in out of memory error, if yes then how did you handled it ?
Can we deploy job tracker other than name node?
What is a Heartbeat in Hadoop?
Explain the difference between an inputsplit and a block?
What are the different commands used to startup and shutdown Hadoop daemons?
What are the side data distribution techniques?
Suppose hadoop spawned 100 tasks for a job and one of the tasks failed. What will hadoop do?
How can you set an arbitrary number of mappers to be created for a job in Hadoop?
What happens if one hadoop client renames a file or a directory containing this file while another client is still writing into it?
Hadoop achieves parallelism by dividing the tasks across many nodes, it is possible for a few slow nodes to rate-limit the rest of the program and slow down the program. What mechanism Hadoop provides to combat this?
Explain edge nodes in hadoop?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)