Data Engineer Given a list of followers in the format:123, 345234, 678345, 123…Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?
No Answer is Posted For this Question
Be the First to Post Answer
how would you modify that solution to only count the number of unique words in all the documents?
What is InputSplit and RecordReader?
Knox and Hadoop Development Tools?
Explain the features of fully distributed mode?
What is the problem with HDFS and streaming data like logs
What if a namenode has no data?
Why password is needed in ssh localhost?
How hdfa differs with nfs?
What is the use of Combiner?
what is difference between int and intwritable?
Can NameNode and DataNode be a commodity hardware?
What is a secondary namenode?