Hadoop Interview Questions, Answers for Freshers and Experienced asked in various Company Job Interviews

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)

Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)

Hadoop Interview Questions

Questions Answers Views Company eMail

What does map transformation do? Provide an example.

332

What are the different ways of representing data in Spark?

287

What are the features of Spark?

305

What are shared variables in Apache Spark?

341

What are the various libraries available on top of Apache Spark?

326

Explain the operations of Apache Spark RDD?

302

What are the limitations of Apache Spark?

295

State the difference between persist() and cache() functions.

333

What is Directed Acyclic Graph(DAG)?

329

What are Actions? Give some examples.

330

What is the difference between DSM and RDD?

316

What do you mean by Persistence?

330

How to create a Sparse vector from a dense vector?

374

What are common uses of Apache Spark?

311

In a very huge text file, you want to just check if a particular keyword exists. How would you do this using Spark?

416

Un-Answered Questions { Hadoop }

What is CQL?

What is HDFS block size and what did you chose in your project?

1016

Explain about trformations and actions in the context of rdds?

324

Name the most common input formats defined in hadoop?

481

What is Sqoop Job?

What is the primary purpose of flume in the hadoop architecture?

875

Explain caching in spark streaming.

329

Why is Apache Spark faster than Apache Hadoop?

881

Explain the lookup() operation in Spark?

245

How rdd persist the data?

309

What is map in apache spark?

287

Describe how hbase uses zookeeper?

What is Rack Awareness? What is its need in Hadoop?

572

Name job control options specified by mapreduce.

758

Differentiate between drop and truncate in cqlsh

For More Un-Answered { Hadoop } Questions Click Here