Hadoop Interview Questions, Answers for Freshers and Experienced asked in various Company Job Interviews

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)

Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)

Hadoop Interview Questions

Questions Answers Views Company eMail

what do you mean by the worker node?

326

What is rdd lineage graph? How is it useful in achieving fault tolerance?

322

Explain about trformations and actions in the context of rdds?

324

What is the key difference between textfile and wholetextfile method?

292

What do you understand by the parquet file?

294

If there is certain data that we want to use again and again in different transformations, what should improve the performance?

322

Explain partitions?

293

Explain api create or replace tempview()?

351

Define parquet file format? How to convert data to parquet format?

342

Explain mappartitions() and mappartitionswithindex()?

440

Explain pipe() operation. How it writes the result to the standard output?

302

Explain transformation in rdd. How is lazy evaluation helpful in reducing the complexity of the system?

360

How to identify that given operation is transformation/action in your program?

301

explain the use of blinkdb?

313

How do you parse data in xml? Which kind of class do you use with java to parse data?

357

Un-Answered Questions { Hadoop }

What are the uses and applications of mahout ?

Can we set the number of reducers to zero in MapReduce?

752

Hadoop achieves parallelism by dividing the tasks across many nodes, it is possible for a few slow nodes to rate-limit the rest of the program and slow down the program. What mechanism Hadoop provides to combat this?

499

What are the types of hive ddl commands?

808

what Hive query processor does?

712

What is the job of blend () and repartition () in Map Reduce?

684

What is structured and unstructured data?

496

What is the need of MapReduce?

690

What is HBase?

367

What is the use of flume in hadoop?

Explain the concept of bloom filter?

What language is apache spark?

311

What is Distributed Cache in Hadoop?

499

How does executor work in spark?

331

Can you explain how to minimize data transfers while working with Spark?

548

For More Un-Answered { Hadoop } Questions Click Here