Big Data Interview Questions
Questions Answers Views Company eMail

What do you mean by logging in cassandra?

43

What do you mean by replication strategy?

48

How data or a file is written into hdfs?

32

What is a namenode in hadoop?

33

What is secondary namenode?

28

What do you mean by meta data in hdfs? List the files associated with metadata.

20

Explain the hdfs architecture and list the various hdfs daemons in hdfs cluster?

30

Can you modify the file present in hdfs?

61

Define data integrity?

20

What do you mean by the high availability of a namenode? How is it achieved?

16

Hdfs stores data using commodity hardware which has higher chances of failures. So, how hdfs ensures the fault tolerance capability of the system?

14

Define hadoop archives?

16

What is secondary namenode? Is it a substitute or back up node for the namenode?

26

What is the command for archiving a group of files in hdfs.

26

Explain the hdfs architecture?

39


Un-Answered Questions { Big Data }

What do you understand by schemardd in apache spark rdd?

2180


What does the "USE" command in hive do?

420


What are Guarantees provided by Kafka?

297


Is spark a mapreduce?

201


What is the use of Bloom Filter in Cassandra?

52






How to invoke Command Line Interface?

5


What is the benefit of kafka?

274


What is client mode in spark?

195


What is ttl (time to live) in hbase?

125


How you can remove the element with a critical present in any other Rdd is Apache spark?

208


What are the key features of Apache Spark that you like?

256


What is meant by in-memory processing in Spark?

209


What are the functionalities of jobtracer?

231


Explain the various types of partitioners in cassandra?

48


Why is BlinkDB used?

218