Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How does cassandra perform read operation?
What is shuffleing in mapreduce?
Explain about the common workflow of a Spark program?
Please enumerate the various components of the Spark Ecosystem.
Is it possible to provide multiple input to Hadoop? If yes then how can you give multiple directories as input to the Hadoop job?
How do I clear my spark cache?
What is the use of truncate command?
What file systems Spark support?
Can we change Replication Factor on a live cluster?
What are the relational databases supported in sqoop?
Give the data storage units in Cassandra?
Have you ever used counters in hadoop?
when hadoop enter in safe mode?
Is big data unstructured?
What is data skew and how do you fix it?