Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Explain the level of parallelism in spark streaming?
Explain about the major libraries that constitute the Spark Ecosystem?
Is it important for Hadoop MapReduce jobs to be written in Java?
what is Bloom Filter is used for in Cassandra?
How apache spark works?
Where cassandra stores its data?
Differentiate between Pig Latin and Pig Engine?
What is secondary namenode? Is it a substitute or back up node for the namenode?
In Map Reduce why map write output to Local Disk instead of HDFS?
When should you use spark cache?
What is Directed Acyclic Graph in Apache Spark?
What is the spark driver?
Mention Hadoop core components?
What is a udf?
What does rack awareness mean?