Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What is the relationship between Jobs and Tasks in Hadoop?
Explain about the execution plans of a Pig Script? Or Differentiate between the logical and physical plan of an Apache Pig script?
How are large objects handled in Sqoop?
What is sparkconf spark?
What is HBaseFsck class?
Define role of velocity in big data?
How to use Avro?
what is SStable consist of?
Explain the difference between COUNT_STAR and COUNT functions in Apache Pig?
What is Streaming / Log Data?
Who invented spark?
What is kafka topic?
How do I get better performance with spark?
What is the need of MapReduce?
What are the advantages of using map side join in mapreduce?