Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Is there any benefit of learning mapreduce if spark is better than mapreduce?
What are the additional benefits YARN brings in to Hadoop?
How can you trigger automatic clean-ups in Spark to handle accumulated metadata?
What is MapFile?
Where is the Mapper Output intermediate kay-value data stored ?
What is the difference between a node, a cluster, and data centre?
Give me examples of unstructured data?
Use of export command in hadoop sqoop?
How do I clear my spark cache?
What is shuffle spill in spark?
What are the various levels of persistence in Apache Spark?
What is the functionality of jobtracker in hadoop? How many instances of a jobtracker run on hadoop cluster?
Explain how cassandra writes data?
What port does spark use?
What happens if rdd partition is lost due to worker node failure?