Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is the advantage of a Parquet file?
What is the use of shutdown command?
What is the role of Driver program in Spark Application?
How hdfa differs with nfs?
What is data skew in spark?
Explain what is heartbeat in hdfs?
Do I need to know scala to learn spark?
Is it possible to provide multiple input to Hadoop? If yes then how can you give multiple directories as input to the Hadoop job?
State some impala hadoop benefits?
What is the use of cassandra and why to use cassandra?
We have already sql then why nosql?
What is the full form of fsck?
What is the purpose of sqoop-merge?
What are the features of Fully-Distributed mode?
Explain Spark streaming?