Explain the shuffle?
No Answer is Posted For this Question
Be the First to Post Answer
How the Client communicates with HDFS?
What is the port number for NameNode
What are the problems with Hadoop 1.0?
What is a heartbeat in HDFS?
What is nlineoutputformat?
What is the difference between a Hadoop and Relational Database and Nosql?
Why we cannot do aggregation (addition) in a mapper? Why we require reducer for that?
how to share the metastore within multiple users?
If a data Node is full how it's identified?
How to change from su to cloudera?
Is a job split into maps?
Data Engineer Given a list of followers in the format:123, 345234, 678345, 123…Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)