Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is document store db?
Is it possible to split 100 lines of input as a single split in MapReduce?
Explain job scheduling through JobTracker
What are complex data types in pig?
What is rdd lineage graph? How is it useful in achieving fault tolerance?
Did you ever ran into a lop sided job that resulted in out of memory error, if yes then how did you handled it ?
What is Safemode in Apache Hadoop?
how can you debug Hadoop code?
What is apache spark sql?
Does spark need yarn?
What is ObjectInspector functionality?
Does spark run hadoop?
Explain the difference between Spark SQL and Hive.
Why ‘Reading‘ is done in parallel and ‘Writing‘ is not in HDFS?
How apache spark works?