Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Which object can be used to get the progress of a particular job
What is spark database?
Explain how can you minimize data transfers when working with spark?
Name the two types of shared variable available in Apache Spark?
How does hbase actually delete a row?
What is the difference between an hdfs block and input split?
What are the various diagnostic operators available in Apache Pig?
Why use hadoop?
What is the difference between Pig and SQL?
Describe the distnct(),union(),intersection() and substract() transformation in Apache Spark RDD?
What is High Availability feature in Hadoop2?
What is the core of the job in MapReduce framework?
Is hadoop required for data science?
What is Hadoop Map Reduce ?
What does the Spark Engine do?