Define streaming?
Answer / Priyanshu Kaushik
"Streaming" is a real-time data processing feature in Apache Hadoop. It allows users to input data streams (such as live data feeds) into MapReduce jobs, enabling them to process large volumes of data in real time without the need for preprocessing or storage.nThe Hadoop Streaming API facilitates this by allowing users to write their custom mappers and reducers using scripts written in various programming languages such as Python, Perl, and Java.
| Is This Answer Correct ? | 0 Yes | 0 No |
Define a job tracker?
Define a metadata?
What if a namenode has no data?
Can Hadoop be compared to NOSQL database like Cassandra?
What are active and passive "NameNodes"?
what is the default replication factor in HDFS?
What is HDFS Federation?
What are the Basics of Hadoop?
Explain what happens in textinformat ?
Is client the end user in HDFS?
What happens to a NameNode that has no data?
Explain InputFormat?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)