Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What is spark reducebykey?
Explain task granularity
What are the file formats supported by spark?
What is the difference between Hive CLI and Beeline?
State the difference between persist() and cache() functions.
Is kafka open source?
Who is a 'user' in HDFS?
Explain about postgresql storage handler?
Explain the functionality of object-inspector.
What is flume instagram?
Explain is it possible to search for files using wildcards?
Does spark use java?
What is an accumulator in spark?
A number of combiners can be changed or not in MapReduce?
List some use cases where classification machine learning algorithms can be used.