Big Data Interview Questions
Questions Answers Views Company eMail

Why is Kafka technology significant to use?

308

Explain the role of the offset?

322

What is the process for starting a Kafka server?

322

What can you do with Kafka?

324

What roles do Replicas and the ISR play?

328

What is the role of the ZooKeeper in Kafka?

317

What are the types of traditional method of message transfer?

337

Explain the role of the Kafka Producer API?

315

In the Producer, when does QueueFullException occur?

322

Why are Replications critical in Kafka?

311

What does ISR stand in Kafka environment?

426

What are consumers or users?

356

What is the main difference between Kafka and Flume?

387

Is it possible to use Kafka without ZooKeeper?

421

What is the purpose of retention period in Kafka cluster?

425


Un-Answered Questions { Big Data }

How Hive organize the data?

446


Mention what happens if the preferred replica is not in the ISR?

337


Which technique can you use in hbase to access hfile directly without the help of hbase?

123


What is Spark Core?

203


What is spark mapvalues?

204






State some applications of HBase?

126


What are the various modes in which Spark runs on YARN? (Local vs Client vs Cluster Mode)

168


What is skew data?

202


What is Hive Database?

430


What is the importance of dfs.namenode.name.dir in HDFS?

33


What is the difference between HDFS block and input split?

463


What is difference between client and cluster mode in spark?

207


Explain the use of .mecia class?

495


What is the use of BloomMapFile?

333


What's rdd?

193