Hadoop Interview Questions, Answers for Freshers and Experienced asked in various Company Job Interviews

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)

Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)

Hadoop Interview Questions

Questions Answers Views Company eMail

What is the difference between Input Split and an HDFS Block?

What is NameNode and DataNode in HDFS?

Why HDFS stores data using commodity hardware despite the higher chance of failures?

What are file permissions in HDFS and how HDFS check permissions for files or directory?

How data or file is written into HDFS?

What is throughput? How does HDFS get a good throughput?

What is the benifit of Distributed cache, why can we just have the file in HDFS and have the application read it?

List the various HDFS daemons in HDFS cluster?

What is the difference between NAS and HDFS?

If data is present in HDFS and RF is defined, then how can we change Replication Factor?

What do you mean by metadata in HDFS? Where is it stored in Hadoop?

What is the optimal block size in HDFS?

What do you mean by metadata in HDFS?

Data node block size in HDFS, why 64MB?

What happens if the block in HDFS is corrupted?

Un-Answered Questions { Hadoop }

Explain how is hadoop different from other data processing tools?

769

Assume that an HBase table Student is disabled. So, how to access the student table once it is disabled, by using Scan command?

171

Explain Reliability and Failure Handling in Apache Flume?

108

Is there an easy way to expire a session for testing?

What is a Combiner?

798

Clarify what is sequence file input format?

570

What is difference between dataset and dataframe?

406

What is mlib in apache spark?

319

Explain the general mapreduce algorithm

698

What is DistributedCache and its purpose?

944

When to use hadoop, hbase, hive and pig?

571

How many partitions are created by default in Apache Spark RDD?

319

Can you give a detailed overview about the Big Data being generated by Facebook?

479

Define functions of SparkCore?

514

In MapReduce how to change the name of the output file from part-r-00000?

667

For More Un-Answered { Hadoop } Questions Click Here