Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain how jobtracker schedules a task?
What is the difference between map and flatmap?
What are broadcast variables in Apache Spark? Why do we need them?
How many types of rdd are there in spark?
What do you mean by commit log in Cassandra?
Who should learn Apache Ambari?
How many datanodes can run on a single Hadoop cluster?
In which location Name Node stores its Metadata and why?
What is TaskTracker?
What is application master in spark?
What is a Speculative Execution in Hadoop MapReduce?
What are the main configuration parameters in a MapReduce program?
Explain about the core components of Flume?
List of some best tools that can be useful for data-analysis?
Explain some Disadvantages of Avro?