Apache Hadoop Interview Questions
Questions Answers Views Company eMail

Why do we use HDFS for applications having large data sets and not when there are lot of small files?

1 2111

What are the functions of NameNode?

1 1403

How to configure hadoop to reuse JVM for mappers?

789

How to resolve IOException: Cannot create directory

673

How to change replication factor of files already stored in HDFS?

703

Which one is default InputFormat in Hadoop ?

1 1657

shouldn't DFS be able to handle large volumes of data already?

764

what is a datanode?

636

How does NameNode tackle DataNode failures?

873

What is InputSplit and RecordReader?

635

What is the purpose of dfsadmin tool?

907

How can you connect an application

714

how is a file of the size 1 GB uncompressed

587

Is map like a pointer?

669

What is the default replication factor?

683


Post New Apache Hadoop Questions

Un-Answered Questions { Apache Hadoop }

What are the modules that constitute the Apache Hadoop 2.0 framework?

692


How can I restart namenode?

428


Suppose Hadoop spawned 100 tasks for a job and one of the task failed. What will Hadoop do?

484


Does this lead to security issues?

430


Explain the master class and the output class do?

367






Explain what if rack 2 and datanode fails?

347


On what basis name node distribute blocks across the data nodes?

761


Is hadoop a memory?

442


How blocks are distributed among all data nodes for a particular chunk of data?

746


What does ‘jps’ command do?

417


What is the functionality of jobtracker in hadoop?

440


How is hadoop different from other data processing tools?

402


what should be the ideal replication factor in hadoop?

386


Explain the features of pseudo mode?

416


Why we cannot do aggregation (addition) in a mapper? Why we require reducer for that?

379