Hadoop Interview Questions
Questions Answers Views Company eMail

Is it important for Hadoop MapReduce jobs to be written in Java?

478

what job does the conf class do?

484

What is the key- value pair in Hadoop MapReduce?

399

What is InputFormat in Hadoop MapReduce?

357

What are the various InputFormats in Hadoop?

368

Explain InputSplit in Hadoop MapReduce?

385

How much space will the split occupy in Mapreduce?

393

What is a RecordReader in Hadoop MapReduce?

388

What is the difference between HDFS block and input split?

455

How to write MapReduce Programs?

393

What is Apache Hive?

459

What kind of applications is supported by Apache Hive?

442

How can you configure remote metastore mode in Hive?

464

What is indexing and why do we need it?

454

What is the use of Hcatalog?

514


Un-Answered Questions { Hadoop }

Explain Creating an Index?

5


List the various HDFS daemons in HDFS cluster?

20


What is the use of ZooKeeper?

181


In a given spark program, how will you identify whether a given operation is Transformation or Action ?

248


Do you need to install spark on all nodes of yarn cluster?

1866






Why not just use zookeeper for everything?

1


What is the next step after Mapper or MapTask?

379


How would you use Map/Reduce to split a very large graph into smaller pieces and parallelize the computation of edges according to the fast/dynamic change of data?

410


Clarify how job tracker schedules an assignment?

241


How rdd can be created in spark?

181


How is it different from doing machine learning in r or sas?

35


Which companies are mostly using Hive ?

444


What do you understand by node in cassandra?

57


Explain the top() and takeordered() operation?

225


How does NameNode tackle DataNode failures?

869