Hadoop Interview Questions
Questions Answers Views Company eMail

Discuss the various running mode of Apache Spark?

200

Describe Spark SQL?

214

Explain SparkContext in Apache Spark?

214

What are the types of Transformation in Spark RDD Operations?

196

Explain first() operation in Apache Spark RDD?

239

What are the ways in which Apache Spark handles accumulated Metadata?

254

Explain the various Transformation on Apache Spark RDD like distinct(), union(), intersection(), and subtract()?

195

Is it possible to run Apache Spark without Hadoop?

190

What is Apache Spark Streaming?

198

How can you implement machine learning in Spark?

179

List some commonly used Machine Learning Algorithm Apache Spark?

186

What is the command to start and stop the Spark in an interactive shell?

207

List out the ways of creating RDD in Apache Spark?

192

What are the various advantages of DataFrame over RDD in Apache Spark?

193

What is flatmap in apache spark?

203


Un-Answered Questions { Hadoop }

Explain what if rack 2 and datanode fails?

344


What are the main benefits of using cassandra?

49


Can we change Replication Factor on a live cluster?

61


Name three data source available in SparkSQL

206


Is spark a programming language?

195






Can you join multiple fields in Apache

312


What is Cassandra-CQL collection?

49


What does serdes mean in apache kafka?

344


If a data Node is full how it's identified?

643


Explain job scheduling through JobTracker

400


what job does the conf class do?

484


What are the main features of SPM in Cassandra?

48


What is the default input type in MapReduce?

381


What are different logging levels in cassandra?

54


Does impala performance improve as it is deployed to more hosts in a cluster in much the same way that hadoop performance does?

94