Big Data Interview Questions
Questions Answers Views Company eMail

What are the challenges Of Distributed Applications?

5

Explain the types Of Znodes?

5

Explain the CLI In Zookeeper?

5

What is ZooKeeper quorum?

5

What is the model of a ZooKeeper cluster?

5

What must we know to work on Zookeeper well?

5

What do you mean by ZNode?

5

Explain about the different types of transformations on DStreams?

227

What are the various levels of persistence in Apache Spark?

218

How can you trigger automatic clean-ups in Spark to handle accumulated metadata?

317

What are the disadvantages of using Apache Spark over Hadoop MapReduce?

346

Is it necessary to install spark on all the nodes of a YARN cluster while running Apache Spark on YARN ?

230

Explain about the major libraries that constitute the Spark Ecosystem?

255

What do you understand by Executor Memory in a Spark application?

260

Is Apache Spark a good fit for Reinforcement learning?

208


Un-Answered Questions { Big Data }

What is the procedure to create users in HDFS and how to allocate Quota to them?

23


How to change a number of mappers running on a slave in MapReduce?

453


Can rdd be shared between sparkcontexts?

194


What is the procedure for namenode recovery?

235


Can multiple clients write into an HDFS file concurrently?

36






Is it possible to have hadoop job output in multiple directories?

252


What are the additional benefits YARN brings in to Hadoop?

638


How data or a file is written into hdfs?

32


Explain different execution modes available in Pig?

528


Which is the best hadoop certification?

391


Can you explain the benefits of big data?

231


Name different types of NoSQL database?

70


Explain the process of spilling in MapReduce?

343


What are spark stages?

202


What do you understand by the partitions in spark?

197