Can you explain hadoop streaming?
No Answer is Posted For this Question
Be the First to Post Answer
Clarify what jobtracker is in hadoop? What are the activities followed by hadoop?
Explain how do you overwrite replication factor?
What is the number of default partitioner in hadoop?
Explain what is a sequence file in hadoop?
What is the characteristic of streaming API that makes it flexible run MapReduce jobs in languages like Perl, Ruby, Awk etc.?
What are the side effects of not running a secondary name node?
What are the port numbers of task tracker?
Is it possible to rename the output file, and if so, how?
What is difference between secondary namenode, checkpoint namenode & backupnode?
How NameNode tackle Datanode failures in Hadoop?
Can you explain logistic regression?
Name the most common Input Formats defined in Hadoop? Which one is default?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)