Big Data Interview Questions
Questions Answers Views Company eMail

What happen on the namenode when a client tries to read a data file?

244

What is replication factor?

243

If the hadoop administrator needs to make a change, which configuration file does he need to change?

247

What is distributed copy (distcp)?

252

What is the role of the secondary namenode?

235

Are there any special requirements for namenode?

236

is there a standard procedure to deploy hadoop?

233

What happen if a datanode loses network connection for a few minutes?

326

What is the procedure for namenode recovery?

235

How would an hadoop administrator deploy various components of hadoop in production?

233

What happen if one of the datanodes has much slower cpu?

227

After increasing the replication level, I still see that data is under replicated. What could be wrong?

243

Web-ui shows that half of the datanodes are in decommissioning mode. What does that mean? Is it safe to remove those nodes from the network?

241

What is the difference between a hadoop database and relational database?

232

Can we deploye job tracker other than name node?

402


Un-Answered Questions { Big Data }

Why the name ‘hadoop’?

393


What if rack 2 and datanode fails?

790


Can aluminum cause a spark?

212


What is sink processors?

73


What are the key features of Apache Spark that you like?

259






what does the shell commands “Capture” and “Consistency” determines?

80


Write a Pig UDF Example ?

516


What is the default port of presto?

5


How are large objects handled in Sqoop?

5


How can we control particular key should go in a specific reducer?

638


State some DDL Command with brief Description?

5


What is the difference between a hadoop database and relational database?

232


Define the use of Source Command in Cassandra?

67


What is an rdd?

201


What roles do Replicas and the ISR play?

328