Big Data Interview Questions
Questions Answers Views Company eMail

Does impala performance improve as it is deployed to more hosts in a cluster in much the same way that hadoop performance does?

94

What is RDD in Apache Spark? How are they computed in Spark? what are the various ways in which it can create?

219

What role does worker node play in Apache Spark Cluster? And what is the need to register a worker node with the driver program?

222

What is SparkSession in Apache Spark? Why is it needed?

217

What is the task of Spark Engine

230

What is the user of sparkContext?

221

How is the processing of streaming data achieved in Apache Spark? Explain.

191

Can you do real-time processing with Spark SQL?

193

Discuss the role of Spark driver in Spark application?

198

What are the features of RDD, that makes RDD an important abstraction of Spark?

192

What is Apache Spark? What is the reason behind the evolution of this framework?

183

What are accumulators in Apache Spark?

222

What is the reason behind Transformation being a lazy operation in Apache Spark RDD? How is it useful?

272

Explain about the different types of trformations on dstreams?

188

Describe the run-time architecture of Spark?

188


Un-Answered Questions { Big Data }

What is the difference between piglatin and hiveql?

395


What are the characteristics of hadoop framework?

367


What is the latest version of ambari that is available in the present market?

39


how Hadoop is different from other data processing tools?

380


Describe DataStaxOpsCenter?

88






Where are rdd stored?

194


What do we mean by Partitions or slices?

196


What is fluming?

106


What is the difference between a MapReduce InputSplit and HDFS block?

411


Explain how Hive Deserialize and serialize the data?

430


When creating an RDD, what goes on internally?

194


How to change replication factor of files already stored in HDFS?

697


How can you use producer api code?

300


Hadoop sqoop is which type of tool?

5


Is apache spark a programming language?

220