Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What are the features of Fully-Distributed mode?
What is the role of a MapReduce partitioner?
Can hive run without hadoop?
What are the advantages of DataFrame?
How do I start flume in hadoop?
What are the four basic parameters of a reducer?
Who is the founder of spark?
Can we change the file cached by distributed cache
Is it possible to search for files using wildcards?
What is the difference between coalesce and repartition in spark?
What is CQL?
Why rack awareness algorithm is used in hadoop?
How data transfer happens from HDFS to Hive?
What is spark reducebykey?
Which interface needs to be implemented to create Mapper and Reducer for the Hadoop?