Big Data Interview Questions
Questions Answers Views Company eMail

how you can get exactly once messaging from Kafka during data production?

294

What are the types of System tools?

326

Why Should we use Apache Kafka Cluster?

309

Mention what happens if the preferred replica is not in the ISR?

330

Explain the term 'Topic Replication Factor'?

313

Explain the term 'Log Anatomy'?

309

what is the traditional method of message transfer?

370

Explain Sort Order in brief?

162

When to use Avro, explain?

49

Name some AVRO Reference APIs?

38

What is the way of creating Avro Schemas?

41

Who developed Apache Avro?

40

What is the required action you need to perform if you opt for scheduled maintenance on the cluster nodes?

45

What are the purposes of using Ambari shell?

41

What is the role of “ambari-qa” user?

51


Un-Answered Questions { Big Data }

What is the difference between hbase and hadoop/hdfs?

114


Explain what are the various types of Transformation on DStream?

188


Which one would you recommend for hbase table design approach – tall-narrow or flat wide?

175


What is spark parallelize?

203


Are multiline comments supported in Hive?

2825






What is Spark Streaming?

188


Does cloudera offer a vm for demonstrating impala?

38


Mention what are the values stored in the Cassandra Column?

43


Do I need to know scala to learn spark?

205


Explain why the name ‘hadoop’?

374


Explain what is webdav in hadoop?

263


On what basis data will be stored on a rack?

821


What do you understand by unit and ()in scala?

280


Why scala is used in spark?

196


Can You Use Apache Spark To Analyze and Access Data Stored In Cassandra Databases?

190