Big Data Interview Questions
Questions Answers Views Company eMail

Can you list down the limitations of using Apache Spark?

199

Can you use Spark for ETL process?

187

What are the disadvantages of using Spark?

200

Where does Spark Driver run on Yarn?

201

Explain the terms Spark Partitions and Partitioners?

222

What do we mean by Partitions or slices?

196

How can you store the data in spark?

209

What are the advantages of DataFrame?

203

What are the components of Spark Ecosystem?

208

How is data represented in Spark?

214

What is the difference between Spark Transform in DStream and map ?

201

Explain various level of persistence in Apache Spark?

199

What are benefits of DataFrame in Spark?

229

What are the various programming languages supported by Spark?

234

What is accumulator?

226


Un-Answered Questions { Big Data }

Connection between hadoop and big data?

255


What is Mapper in Hadoop MapReduce?

394


Define the management tools in Cassandra?

63


What is Flume Client?

63


Which are the methods to create rdd in spark?

197






What is apache spark written in?

196


How to set which framework would be used to run mapreduce program?

410


Is it possible to rename the output file, and if so, how?

237


What does job conf class do?

381


What do you mean by Schema Resolution?

57


Explain is it possible to search for files using wildcards?

225


How does a namenode handle the failure of the data nodes?

420


Which language is best for spark?

190


What is an input reader in reference to mapreduce?

360


What is output format in hadoop?

407