Big Data Interview Questions
Questions Answers Views Company eMail

What is the FlatMap Transformation in Apache Spark RDD?

203

can you run Apache Spark On Apache Mesos?

215

Describe Partition and Partitioner in Apache Spark?

219

Describe Accumulator in detail in Apache Spark?

204

List down the languages supported by Apache Spark?

191

Discuss the various running mode of Apache Spark?

200

Describe Spark SQL?

222

Explain SparkContext in Apache Spark?

216

What are the types of Transformation in Spark RDD Operations?

196

Explain first() operation in Apache Spark RDD?

241

What are the ways in which Apache Spark handles accumulated Metadata?

256

Explain the various Transformation on Apache Spark RDD like distinct(), union(), intersection(), and subtract()?

199

Is it possible to run Apache Spark without Hadoop?

192

What is Apache Spark Streaming?

204

How can you implement machine learning in Spark?

181


Un-Answered Questions { Big Data }

What is a partitioner and how the user can control which key will go to which reducer?

637


List the various types of "Cluster Managers" in Spark.

197


What are some of the different modes used in hadoop.

203


What is the replication factor?

56


What is the use of “ResultSet execute(Statement statement)” method?

53






Which is better scala or python for spark?

201


What is the best practice on deciding the number of column families for HBase table?

127


Hadoop sqoop word came from?

5


What does consumer api in kafka?

308


Is apache spark a tool?

182


How can we create rdds in apache spark?

192


Define Partition and Partitioner in Apache Spark?

218


What is aggregatebykey spark?

167


How to come out of the insert mode?

409


How job tracker schedules an assignment?

223