What are the main components in Hadoop Eco-System and what are their functions ?
Name the most common Input Formats defined in Hadoop? Which one is default?
What is the block size in Hadoop?
Can you give some examples of Big Data?
After the Map phase finishes, the Hadoop framework does 'Partitioning, Shuffle and sort'. Explain what happens in this phase?
What is streaming in Hadoop?
What is a commodity hardware? Does commodity hardware include RAM?
What is streaming access?
How can you set an arbitrary number of Reducers to be created for a job in Hadoop?
Give examples of some companies that are using Hadoop structure?
Which modes can Hadoop be run in? List a few features for each mode?
How analysis of Big Data is useful for organizations?
What does the file hadoop-metrics.properties do?
Hadoop achieves parallelism by dividing the tasks across many nodes, it is possible for a few slow nodes to rate-limit the rest of the program and slow down the program. What mechanism Hadoop provides to combat this?
How will you write a custom partitioner for a Hadoop job?