Big Data Interview Questions
Questions Answers Views Company eMail

What is the FlatMap Transformation in Apache Spark RDD?

199

can you run Apache Spark On Apache Mesos?

211

Describe Partition and Partitioner in Apache Spark?

215

Describe Accumulator in detail in Apache Spark?

200

List down the languages supported by Apache Spark?

187

Discuss the various running mode of Apache Spark?

196

Describe Spark SQL?

212

Explain SparkContext in Apache Spark?

208

What are the types of Transformation in Spark RDD Operations?

194

Explain first() operation in Apache Spark RDD?

235

What are the ways in which Apache Spark handles accumulated Metadata?

252

Explain the various Transformation on Apache Spark RDD like distinct(), union(), intersection(), and subtract()?

190

Is it possible to run Apache Spark without Hadoop?

190

What is Apache Spark Streaming?

198

How can you implement machine learning in Spark?

177


Un-Answered Questions { Big Data }

Which classes are used by the hive to read and write hdfs files?

32


How is dag created in spark?

184


Write a Mapreduce Program for Character Count ?

691


What is ZooKeeper Atomic Broadcast (ZAB) protocol?

5


What is Hadoop Custom partitioner ?

719






How can we see all the clusters that are available in Ambari?

123


Name the operating system(s) which are supported for production hadoop deployment?

233


What are use cases of Apache Flume?

66


How is the splitting of file invoked in Hadoop ?

258


What are combiners and its purpose?

616


What is the difference between Hbase and Hive?

400


What is a Cluster, Node and Key space in Cassandra ?

137


How can you overwrite the replication factors in HDFS?

933


What is the difference between coalesce and repartition in spark?

192


Which one will you choose for a project –Hadoop MapReduce or Apache Spark?

198