Big Data Interview Questions, Answers for Freshers and Experienced asked in Job Interviews

Un-Answered Questions { Big Data }

Why is there a need for broadcast variables when working with Apache Spark?

323

What is the abstraction of Spark Streaming?

276

What are shared variables?

351

Does Spark provide the storage layer too?

297

What are the advantages of datasets in spark?

296

How to save RDD?

326

What are the common faults of the developer while using Apache Spark?

309

When creating an RDD, what goes on internally?

308

What is Spark MLlib?

436

What is meant by Transformation? Give some examples.

327

On which all platform can Apache Spark run?

282

What do we mean by Paraquet?

464

Explain various cluster manager in Apache Spark?

406

What is the difference between DAG and Lineage?

1650

What are the file formats supported by spark?

315