Big Data Interview Questions
Questions Answers Views Company eMail

Explain different transformation on DStream?

196

What are Paired RDD?

225

Name some sources from where Spark streaming component can process real-time data?

194

What is meant by in-memory processing in Spark?

213

Explain what are the various types of Transformation on DStream?

196

Define Partition in Apache Spark?

222

How many types of Transformation are there?

228

How you can remove the element with a critical present in any other Rdd is Apache spark?

210

What is Sparse Vector?

251

Is it possible to run Spark and Mesos along with Hadoop?

191

What is DataFrames?

222

Discuss writeahead logging in Apache Spark Streaming?

215

How can data transfer be minimized when working with Apache Spark?

205

What do you mean by Speculative execution in Apache Spark?

199

Explain about the different cluster managers in Apache Spark

208


Un-Answered Questions { Big Data }

What do you mean by Stream Processing in Kafka?

320


What are the languages supported by apache spark?

190


What will you do when NameNode is down?

671


What is difference between flume and kafka?

58


Does apache flume support third-party plugins?

50






What is the fundamental difference between a MapReduce InputSplit and HDFS block?

341


Is hadoop a memory?

446


Is big data unstructured?

203


What is HDFS Federation?

655


What is the use of cloudera?

228


Explain the role of the Kafka Producer API?

319


What happens to a namenode, when job tracker is down?

423


what happens when Hadoop spawned 50 tasks for a job and one of the task failed?

416


What happens to existing data in my cluster when I add new nodes?

121


Why are we using Flume?

51