Data Engineer Given a list of followers in the format:123, 345234, 678345, 123…Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?
780How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?
726Post New Apache Hadoop Questions
Have you ever used Counters in Hadoop. Give us an example scenario?
What is Apache Hadoop? Why is Hadoop essential for every Big Data application?
What is a Combiner?
How to enable recycle bin in hadoop?
What is the InputFormat ?
What is inputformat in hadoop?
What is unstructured data?
Can we have multiple entries in the master files?
How does an hadoop application look like or their basic components?
What is Hadoop serialization?
What are channel selectors?
What is DistributedCache and its purpose?
What is the problem with small files in Apache Hadoop?
What are the characteristics of hadoop framework?
How to exit the vi editor?