Hadoop Interview Questions
Questions Answers Views Company eMail

What are the advantages of pig language?

563

What are the different execution mode available in Pig?

729

Define Spark Streaming.

309

Hadoop uses replication to achieve fault tolerance. How is this achieved in Apache Spark?

300

What is lineage graph?

345

What are benefits of Spark over MapReduce?

332

List the functions of Spark SQL?

377

What is RDD?

396

How to create RDD?

361

Does Apache Spark provide check pointing?

313

Explain about the popular use cases of Apache Spark

340

Do you need to install Spark on all nodes of Yarn cluster while running Spark on Yarn?

418

What are the different String functions available in pig?

433

Differentiate between the physical plan and logical plan in Pig script?

497

What are the use cases of Apache Pig?

493


Un-Answered Questions { Hadoop }

What is HDFS High Availability?

702


What do you mean by inputformat?

345


What is lambda architecture spark?

188


For a Hadoop job, how will you write a custom partitioner?

381


What type of data we should put in distributed cache? When to put the data in dc? How much volume we should put in?

243






What are the main components of a Hadoop Application?

734


What is the function of co-group in Pig?

556


Specify the different types of tables accessible in hive?

458


What is the latest version of Ambari that is available in the market?

42


What are best features of Apache Avro?

52


What is the command for archiving a group of files in hdfs.

26


Can you give some examples of Big Data?

267


What is the difference between the ZooKeeper ensemble and ZooKeeper quorum?

5


Explain sum(), max(), min() operation in Apache Spark?

206


How the HDFS Blocks are replicated?

636