What type of data we should put in distributed cache? When to put the data in dc? How much volume we should put in?
471Post New Hadoop General Questions
Mention what is distributed cache in hadoop?
While starting hadoop services, datanode service is not running?
What is the NameNode port number?
What is TaskTracker?
Why slaves limited to 4000 in hadoop version 1?
Why is checkpointing important in hadoop?
What is the difference between an inputsplit and a block?
How is the splitting of file invoked in Hadoop ?
How often DataNode send heartbeat to NameNode in Hadoop?
How job tracker schedules an assignment?
Explain InputSplit in Hadoop?
How would an hadoop administrator deploy various components of hadoop in production?
What is the characteristic of streaming API that makes it flexible run MapReduce jobs in languages like Perl, Ruby, Awk etc.?
What is Federation?
How can you set an arbitrary number of Reducers to be created for a job in Hadoop?