What is a block and block scanner in HDFS?
No Answer is Posted For this Question
Be the First to Post Answer
What is Partioner in hadoop? Where does it run
How can one increase replication factor to a desired value in Hadoop?
What is partioner in hadoop? Where does it run,mapper or reducer?
What is a speculative execution in Apache Hadoop MapReduce?
What are sink processors?
What are the modules that constitute the Apache Hadoop 2.0 framework?
What is 'Key value pair' in HDFS?
What is the sequencefileinputformat in hadoop?
What is crontab? Explain with suitable example?
What's the best way to copy files between HDFS clusters?
Data Engineer Given a list of followers in the format:123, 345234, 678345, 123…Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?
Explain the basic difference between traditional rdbms and hadoop?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)