Un-Answered Questions { Big Data }

List out the various advantages of dataframe over rdd in apache spark?

192


What is map in apache spark?

184


Write the command to start and stop the spark in an interactive shell?

187


Define various running modes of apache spark?

189


What are the ways to run spark over hadoop?

181


What is catalyst query optimizer in apache spark?

195


What are the various types of shared variable in apache spark?

185


Define the common faults of the developer while using apache spark?

199


What is the use of spark driver, where it gets executed on the cluster?

213


What is speculative execution in spark?

235


Explain write ahead log(journaling) in spark?

186


Explain values() operation in apache spark?

270


Define the level of parallelism and its need in spark streaming?

232


Define sparksession in apache spark? Why is it needed?

198


Describe different transformations in dstream in apache spark streaming?

202