Apache Spark Interview Questions
Questions Answers Views Company eMail

What is off heap memory in spark?

182

What is a tuple in spark?

191

Is spark an etl?

186

How is rdd distributed?

195

What are the common transformations in apache spark?

181

What is the difference between dataset and dataframe in spark?

219

What is distributed cache in spark?

197

What is catalyst framework in spark?

189

How is dag created in spark?

184

What does spark do during speculative execution?

197

What is heap memory in spark?

178

What is external shuffle service in spark?

202

What is spark client?

187

Which are the various data sources available in spark sql?

194

Can you run spark without hadoop?

212


Post New Apache Spark Questions

Un-Answered Questions { Apache Spark }

What is the default level of parallelism in apache spark?

233


Define the level of parallelism and its need in spark streaming?

232


Compare Hadoop and Spark?

192


What is a spark standalone cluster?

151


How to identify that given operation is transformation/action in your program?

177






Explain about transformations and actions in the context of RDDs.

212


Explain the flatMap operation on Apache Spark RDD?

181


How Spark uses Hadoop?

195


How do I download spark?

181


What is cluster in apache spark?

216


How does reducebykey work in spark?

175


Which language is best for spark?

186


Do you need to install spark on all nodes of yarn cluster?

1866


Explain accumulators in apache spark.

1990


How do I optimize my spark code?

197