Big Data Interview Questions
Questions Answers Views Company eMail

How Hadoop’s CLASSPATH plays a vital role in starting or stopping in Hadoop daemons?

219

What are the different commands used to startup and shutdown Hadoop daemons?

245

What is configured in /etc/hosts and what is its role in setting Hadoop cluster?

264

How is the splitting of file invoked in Hadoop framework?

261

Is it possible to provide multiple input to Hadoop? If yes then how?

264

Is it possible to have hadoop job output in multiple directories? If yes, how?

244

What is the default replication factor and how will you change it?

254

Explain Hadoop Archives?

249

Explain the Single point of Failure in Hadoop?

243

Explain Erasure Coding in Hadoop?

238

What is Disk Balancer in Hadoop?

229

How would you check whether your NameNode is working or not?

231

Is Namenode machine same as DataNode machine as in terms of hardware?

254

If DataNode increases, then do we need to upgrade NameNode?

279

What happens if the number of reducers is 0 in Hadoop?

255


Un-Answered Questions { Big Data }

Mention what is HiveServer2 (HS2)?

492


Define Cluster?

78


what does the conf.setMapper Class do ?

543


How is a keyspace created in cassandra?

65


Define big data analytics?

199






How does hdfs ensure information integrity of data blocks squares kept in hdfs?

17


How many operational commands in hbase?

119


Is kafka an etl tool?

265


Mention what are the main configuration parameters that user need to specify to run mapreduce job?

442


What is Clustring in Hive?

439


What is the best practice to deploy the secondary name node?

260


How can you implement machine learning in Spark?

181


How does hadoop achieve fault tolerance?

220


What is the full form of fsck?

410


What is speculative execution in spark?

235