Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Why do we need indexing?
Is hadoop a database?
What are the different compaction types in hbase?
Do you need to install spark on all nodes of yarn cluster?
What is Hadoop streaming?
How to process data using Transformation operation in Spark?
What are the options-process for upgrading zookeeper?
What does apache spark do?
What are the features of apache cassandra?
How can you set an arbitrary number of Reducers to be created for a job in Hadoop?
What is difference between coalesce and repartition?
Is there any benefit of learning mapreduce if spark is better than mapreduce?
Explain the operation transformation and action in Apache Spark RDD?
Can you explain difference between apache mahout and apache spark’s mllib?
How to read file in HDFS?