Topic :: Apache Spark





Apache Spark Interview Questions
Questions Answers Views Company eMail

What are numpy, scipy, and spark essential datatypes?

105

Can you provide examples for other computations in spark?

78

How is spark sql different from hql and sql?

185

Is there a module to implement sql in spark? How does it work?

206

How is streaming implemented in spark? Explain with examples.

206

Can you use spark to access and analyze data stored in cassandra databases?

218

What are the languages supported by apache spark?

190

Name the components of spark ecosystem.

174

Is there a module to implement sql in spark?

183

How is machine learning implemented in spark?

195

What is executor memory in a spark application?

226

Explain caching in spark streaming.

193

Do you need to install spark on all nodes of yarn cluster?

1870

Define actions in spark.

2106

How do we create rdds in spark?

424




Un-Answered Questions { Apache Spark }

What are Apache Spark, Flume, Lucene, Hama, HCatalog, Mahout, Drill, Crunch and Thrift?

328


Hadoop uses replication to achieve fault tolerance. How is this achieved in Apache Spark?

304


Does Apache Spark provide check pointing?

315


Explain about the popular use cases of Apache Spark

340


Why is Apache Spark faster than Apache Hadoop?

436






Compare Apache Hadoop and Apache Spark?

229


What is Apache Spark?

201


explain the key features of Apache Spark?

218


How is Apache Spark better than Hadoop?

208


Explain the term paired RDD in Apache Spark?

266


Which all languages Apache Spark supports?

240


explain the concept of RDD (Resilient Distributed Dataset). Also, state how you can create RDDs in Apache Spark.

296


What are the types of Apache Spark transformation?

198


Why Apache Spark?

223


Explain transformation and action in RDD in Apache Spark?

205