Big Data Interview Questions
Questions Answers Views Company eMail

How to save RDD?

218

What are the common faults of the developer while using Apache Spark?

195

When creating an RDD, what goes on internally?

196

What is Spark MLlib?

239

What is meant by Transformation? Give some examples.

212

On which all platform can Apache Spark run?

177

What do we mean by Paraquet?

337

Explain various cluster manager in Apache Spark?

291

What is the difference between DAG and Lineage?

1306

What are the file formats supported by spark?

195

List some use cases where Spark outperforms Hadoop in processing.

204

Explain the use of File system API in Apache Spark

202

How can you minimize data transfers when working with Spark?

231

Explain about the common workflow of a Spark program?

181

What do you understand by receivers in Spark Streaming ?

216


Un-Answered Questions { Big Data }

Give the difference between Column and SuperColumn?

86


What is HBase HMaster?

294


Use of export command in hadoop sqoop?

5


What is JMX?

141


What apache spark is used for?

164






Name the operations supported by rdd?

206


what is "map" and what is "reducer" in Hadoop?

367


What relational operators can we use that are related to combining and splitting in Pig language?

320


What are the other components of Cassandra?

52


What do masters consist of?

405


Is hive an impala requirement?

42


What is the use of context object?

250


What are the different composite keys in Cassandra?

47


When Hive is run in embedded mode

1575


How is it different from doing machine learning in r or sas?

35