Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Mention what is ObjectInspector functionality in Hive?
What is the difference between DSM and RDD?
What do sorting do?
What is the difference between spark ml and spark mllib?
What combiners is and when you should use a combiner in a MapReduce Job?
Why apache spark is faster than hadoop?
How to create database statement in apache tajo?
What are the different input sources for Spark Streaming?
Define Writable data types in Hadoop MapReduce?
How does cassandra perform read operation?
Which modes can Hadoop be run in? List a few features for each mode?
What is Hive Database?
List few components that are using big data?
Can you explain commodity hardware?
Explain cap theorem?