Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain Apache Ambari?
Why Ambari?
Mention what needs to be taken care while adding a column?
Is spark faster than hadoop?
Where is kafka used?
How to save RDD?
Mention what is the maximum size of the message does kafka server can receive?
What is the role of the offset.
How to set mappers and reducers for Hadoop jobs?
What is the process for starting a Kafka server?
Name a few commonly used spark ecosystems?
What is a checkpoint?
What is a bloom filter and how does it help in searching rows?
How do ‘map’ and ‘reduce’ work?
Explain the common input formats in hadoop?