Define “speculative execution” in hadoop?
Are job tracker and task trackers present in separate machines?
Whats the default port that jobtrackers listens ?
What is TextInputFormat in Hadoop?
What is distributed copy (distcp)?
How many maps are there in a particular job?
Is it possible to have hadoop job output in multiple directories?
What is the logistic regression?
What is pseudo-distributed mode?
Mention what is rack awareness?
What is JPS? Why is it used in Hadoop?
Where are hadoop’s configuration files located and list them?
How can we create a hadoop cluster from scratch?
Did you ever ran into a lop sided job that resulted in out of memory error, if yes then how did you handled it ?
Clarify how ordering in hdfs is finished?