Apache Hadoop Interview Questions, Answers for Freshers and Experienced asked in Job Interviews

Un-Answered Questions { Apache Hadoop }

Data Engineer Given a list of followers in the format:123, 345234, 678345, 123â€¦Where column one is the ID of the follower and column two is the ID of the followee. Find all mutual following pairs (the pair 123, 345 in the example above). How would you use Map/Reduce to solve the problem when the list does not fit in memory?

748

How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?

698

How to exit the vi editor?

738

Does the hdfs client decide the input split or namenode?

680

Which files are used by the startup and shutdown commands?

720

What is cloudera and why it is used?

800

What is a spill factor with respect to the ram?

874

Can we have multiple entries in the master files?

696

On which port does ssh work?

745

Do we need to give a password, even if the key is added in ssh?

727

What are the port numbers of namenode, job tracker and task tracker?

714

Is fs.mapr.working.dir a single directory?

689

How can we look for the namenode in the browser?

719

What do slaves consist of?

673

How can we check whether namenode is working or not?

661