Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Explain the features of Apache Spark because of which it is superior to Apache MapReduce?
What is data skew in spark?
Name some features of Apache Cassandra?
Can I set the number of reducers to zero?
Explain HCatOutputFormat?
Explain is it possible to search for files using wildcards?
What are 5 vs of big data ?
Is impala production ready?
What combiners are and when you should utilize a combiner in a map reduce job?
Why do we need a password-less ssh in fully distributed environment?
What does serdes mean in apache kafka?
Explain cogroup() operation in Spark?
What is aws spark?
name few other popular column oriented databases like hbase.
How analysis of Big Data is useful for organizations?