Big Data Interview Questions
Questions Answers Views Company eMail

List the functions of Spark SQL?

377

What is RDD?

396

How to create RDD?

361

Does Apache Spark provide check pointing?

313

Explain about the popular use cases of Apache Spark

340

Do you need to install Spark on all nodes of Yarn cluster while running Spark on Yarn?

418

What are the different String functions available in pig?

435

Differentiate between the physical plan and logical plan in Pig script?

501

What are the use cases of Apache Pig?

497

What do you understand by an inner bag and outer bag in Pig?

574

Explain different execution modes available in Pig?

526

How do users interact with HDFS in Apache Pig ?

521

what are the basic parameters of a Mapper?

504

What is a MapReduce Combiner?

508

Where is Mapper output stored?

501


Un-Answered Questions { Big Data }

What does rack awareness algorithm means?

248


In which language apache kafka is written?

279


What is data cleansing?

246


Explain the Reducer's reduce phase?

699


What is the difference between kafka and mq?

271






What is the importance of eval tool?

5


How do you stop a running job gracefully?

361


Explain the operations of Apache Spark RDD?

198


What is a yaml file in cassandra?

62


What Mapper does?

659


Which one is the master node in HDFS? Can it be commodity hardware?

39


What are all stats classes in the java api package available?

302


What happens to existing data in my cluster when I add new nodes?

121


In which language is the Ambari Shell is developed?

41


What are the different life cycle commands in ambari?

63