What is Hadoop streaming?
Answer / Shiva Chaudhary
Apache Hadoop Streaming is a method for using Unix-like operating systems as data sinks and sources with MapReduce jobs. It allows developers to write user-defined mapper and reducer programs in any language that can read standard input (stdin) and write standard output (stdout).
| Is This Answer Correct ? | 0 Yes | 0 No |
What problems have you faced when you are working on Hadoop code?
What alternate way does HDFS provides to recover data in case a Namenode
What do shuffling do?
Why do we use HDFS for applications having large data sets and not when there are lot of small files?
what should be the ideal replication factor in hadoop?
What is partioner in hadoop? Where does it run,mapper or reducer?
Why are the number of splits equal to the number of maps?
Explain the difference between gen1 and gen2 hadoop with regards to the namenode?
What is a checkpoint?
Explain the basic architecture of Hadoop?
What is Partioner in hadoop? Where does it run
What is structured data?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)