Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain the general mapreduce algorithm
How to use Avro?
What are the different execution mode available in Pig?
What are the languages in which Apache Spark create API?
What are the four basic parameters of a mapper?
What is the default partition in spark?
In which location Name Node stores its Metadata and why?
Explain the various Transformation on Apache Spark RDD like distinct(), union(), intersection(), and subtract()?
How to submit extra files(jars, static files) for MapReduce job during runtime?
Explain how do you overwrite replication factor?
What is the role of the zookeeper?
How is security achieved in Hadoop?
Why do we need a new framework for handling big data?
What is high availability in hadoop?
What are the majorly used commands in sqoop?