Big Data Interview Questions
Questions Answers Views Company eMail

How hdfa differs with nfs?

387

How a task is scheduled by a jobtracker?

390

How many maximum jvm can run on a slave node?

410

What are the two main parts of the hadoop framework?

352

What is the use of combiners in the hadoop framework?

360

What is the jobtracker?

362

What is the jobtracker and what it performs in a hadoop cluster?

366

How many instances of tasktracker run on a hadoop cluster?

404

Explain the use of tasktracker in the hadoop cluster?

368

How does a namenode handle the failure of the data nodes?

414

How is spark sql different from hql and sql?

183

Is there a module to implement sql in spark? How does it work?

200

What is pagerank in graphx?

188

How is streaming implemented in spark? Explain with examples.

204

Can you use spark to access and analyze data stored in cassandra databases?

210


Un-Answered Questions { Big Data }

Explain the difference between an inputsplit and a block?

216


Is apache spark a tool?

182


Can you explain the benefits of big data?

225


Did you ever ran into a lop sided job that resulted in out of memory error

926


Can aluminum cause a spark?

210






Explain the common input formats in hadoop?

234


What exactly is apache spark?

199


What is the reason for creating a new metastore_db whenever Hive query is run from a different directory?

702


What is a spark rdd?

219


What are the different modes in which we can configure/install Hadoop?

241


What is the use of combiners in the hadoop framework?

360


How is spark different from hadoop?

193


What are the different tools used for the ambari monitoring purpose?

45


How can you start a consumer in kafka?

286


What is spark etl?

200