Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How do you stop a spark?
Why is Reading done in parallel and writing is not in HDFS?
What is the use of get() method?
Why Apache Spark?
What is the command to change the replication factor ?
What is an identity mapper and identity reducer?
What is Resilient Distributed Dataset (RDD) in Apache Spark? How does it make spark operator rich?
What are the components of spark?
What are the different types of nosql databases?
Explain bloom?
What is an "RDD Lineage"?
Specify Cassandra’s importance on Facebook?
Explain what is sqoop in Hadoop ?
What are the benefits of Spark lazy evaluation?
Give the use of the bootstrap panel.