Big Data Interview Questions
Questions Answers Views Company eMail

What is a 'block' in HDFS?

176

What is Derby database?

162

How many Daemon processes run on a Hadoop system?

375




What are the limitations of importing RDBMS tables into Hcatalog directly?

177

How the Client communicates with HDFS?

236

What is NoSQL?

165

What is Row Key?

166

What is a Task instance in Hadoop? Where does it run?1

149

What does the overwrite keyword denote in Hive load statement?

1 641

What is IdentityMapper?

152

Why do we use Hadoop?

158

what is the typical block size of an HDFS block?

168

What is Hadoop Custom partitioner ?

232

What is the default block size in Hadoop 1 and in Hadoop 2? Can it be changed?

294

What stored in HDFS?

133







Un-Answered Questions { Big Data }

What is compute and Storage nodes?

170


Does Apache Flume provide support for third party plug-ins?

5


What are the additional benefits YARN brings in to Hadoop?

182


What is a partitioner and how the user can control which key will go to which reducer?

156


How Pig differs from MapReduce?

179






Do you need to install Spark on all nodes of Yarn cluster while running Spark on Yarn?

85


what factors the block size takes before creation?

129


What is the purpose of DataNode block scanner?

193


What is a checkpoint?

164


What is HBase?

115


What is the input type/format in MapReduce by default?

315


what are the nodes in the Hadoop cluster?

164


mapper or reducer?

154


Who is a 'user' in HDFS?

172


What is safe mode in Hadoop?

191