Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is the difference between hadoop and spark?
How HDFS client divide the file into the block while storing inside HDFS?
What is an accumulator in spark?
Can you define data lake?
What is sc textfile?
What is the difference between Apache Pig and Hive?
What are the differences between Caching and Persistence method in Apache Spark?
When you are dealing with static data instead of dynamic data?
What is the communication channel between client and namenode/datanode?
Explain the different logging levels in cassandra.
How namenode handles data node failures?
Explain what happens in textinformat ?
How big data analysis helps businesses increase their revenue?
Clarify what combiners are and when you should utilize a combiner in a map reduce job?
Is spark and hadoop same?