Hadoop Interview Questions, Answers for Freshers and Experienced asked in various Company Job Interviews

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)

Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)

Hadoop Interview Questions

Questions Answers Views Company eMail

Why the output of map tasks are stored (spilled ) into local disc and not in hdfs?

718

What is the role of recordreader in hadoop mapreduce?

790

What happens when the node running the map task fails before the map output has been sent to the reducer?

697

Define speculative execution?

767

Is it legal to set the number of reducer task to zero? Where the output will be stored in this case?

708

What are the advantages of using map side join in mapreduce?

696

What is a map side join?

731

What is a combiner and where you should use it?

721

When should you use sequencefileinputformat?

713

What is the purpose of textinputformat?

745

What is reduce side join in mapreduce?

657

What do you mean by inputformat?

657

What are the various configuration parameters required to run a mapreduce job?

719

What is a distributed cache in mapreduce framework?

663

What do you mean by data locality?

724

Un-Answered Questions { Hadoop }

How to configure the number of the Combiner in MapReduce?

667

What is the heartbeat used for?

1085

When should we use SORT BY instead of ORDER BY?

746

If a Replica stays out of the ISR for a long time, what does it signify?

668

Can a partition be archived? What are the advantages and Disadvantages?

804

How to load data into table created in hive ?

743

Can we run spark on windows?

293

Suppose there is file of size 514 mb stored in hdfs (hadoop 2.x) using default block size configuration and default replication factor. Then, how many blocks will be created in total and what will be the size of each block?

129

what are views in Hive?

805

What is Apache Flume?

108

Can you explain how it is different from doing machine learning in r or sas?

What is number of executors in spark?

306

What are the components of Pig Execution Environment?

618

Explain the commit log?

What is row rdd in spark?

308

For More Un-Answered { Hadoop } Questions Click Here