Hadoop Interview Questions
Questions Answers Views Company eMail

What is the difference between Gen1 and Gen2 Hadoop with regards to the Namenode?

895

What is the Job interface in MapReduce framework?

629

What is the problem with HDFS and streaming data like logs

665

What is the Use of SSH in Hadoop ?

635

What infrastructure do we need to process 100 TB data using Hadoop?

732

What is the difference between a Hadoop and Relational Database and Nosql?

702

What is Schema on Read and Schema on Write?

633

How to overwrite an existing output file during execution of mapreduce jobs?

MRO,

831

Write a Pig UDF Example ?

506

What is the difference between traditional RDBMS and Hadoop?

789

What is the main purpose of HDFS fsck command?

1394

What is safe mode in Hadoop?

674

Detail description of the Reducer phases?

572

What is MapFile?

640

What do you understand from Node redundancy and is it exist in hadoop cluster?

893


Un-Answered Questions { Hadoop }

What is a Block Scanner in HDFS?

66


What do you understand by mem-table in cassandra?

45


Does Apache Flume provide support for third party plug-ins?

66


Explain the Use of Hive?

403


What is job tracker in Hadoop?

249






Is a log flume a roller coaster?

58


Can you define what is Event Serializer in Flume?

60


Where are hadoop’s configuration files located and list them?

214


How can you set an arbitrary number of mappers to be created for a job in Hadoop?

263


What do you mean by column family?

41


What are snapshots and how do you create one in cassandra?

38


What are the major differences between Hadoop 2 and Hadoop 3?

228


Explain why the name ‘hadoop’?

374


How many types of tunable consistency are supported in Cassandra?

56


What is Spark MLlib?

233