Apache Spark Interview Questions
Questions Answers Views Company eMail

What is Spark SQL?

212

Can you explain how to minimize data transfers while working with Spark?

405

What are the ways to launch Apache Spark over YARN?

194

is it necessary to install Spark on all nodes while running Spark application on Yarn?

196

What is a worker node in Apache Spark?

204

What is worker node in Apache Spark cluster?

207

What is action, how it process data in apache spark

230

What is sparkContext?

203

Name various types of Cluster Managers in Spark.

205

How much faster is Apache spark than Hadoop?

205

Difference between groupByKey vs reduceByKey in Apache Spark?

239

How you can use Akka with Spark?

212

Name three features of using Apache Spark

193

How to start and stop spark in interactive shell?

204

To use Spark on an existing Hadoop Cluster, do we need to install Spark on all nodes of Hadoop?

224


Post New Apache Spark Questions

Un-Answered Questions { Apache Spark }

How many ways we can create rdd in spark?

184


What are Paired RDD?

225


Explain Spark Core?

208


Explain about trformations and actions in the context of rdds?

211


What are the various modes in which Spark runs on YARN? (Local vs Client vs Cluster Mode)

168






What is map in spark?

192


What is a DStream?

238


What is meant by rdd in spark?

176


How to create a Sparse vector from a dense vector?

218


Which storage level does the cache () function use?

209


What are the features and characteristics of Apache Spark?

204


What is worker node in Apache Spark cluster?

207


What is apache spark written in?

196


Describe Accumulator in detail in Apache Spark?

204


What is Resilient Distributed Dataset (RDD) in Apache Spark? How does it make spark operator rich?

187