Big Data Interview Questions
Questions Answers Views Company eMail

What is a IdentityMapper and IdentityReducer in MapReduce ?

629

Explain Working of MapReduce?

655

Write a Mapreduce Program for Character Count ?

701

how to proceed to write your first mapreducer program?

1021

How to set the number of reducers?

1028

Developing a MapReduce Application?

648

Different ways of debugging a job in MapReduce?

795

Explain the Reducer's reduce phase?

699

Why do we use HDFS for applications having large data sets and not when there are lot of small files?

1 2103

What are the functions of NameNode?

1 1390

How to configure hadoop to reuse JVM for mappers?

783

mapper or reducer?

620

How to resolve IOException: Cannot create directory

667

Does Pig support multi-line commands?

562

How to change replication factor of files already stored in HDFS?

697


Un-Answered Questions { Big Data }

How can you native libraries be included in yarn jobs?

245


Explain coalesce operation in Apache Spark?

232


What are advantages of Spark over MapReduce?

343


what is the meaning of broker in Kafka?

295


Why is flume used?

50






What are the main components of MapReduce Job?

414


Clarify what a task tracker is in hadoop?

229


What is a flume agent?

40


What does the "USE" command in hive do?

422


Explain apache kafka?

301


What is Your Cluster size ?

1131


What is the difference between cassandra, hadoop big data, mongodb, couchdb?

242


What is the relationship between Job and Task in Hadoop?

379


What is the problem with the small file in Hadoop?

369


Define the roles of the file system in any framework?

221