Big Data Interview Questions
Questions Answers Views Company eMail

List some commonly used Machine Learning Algorithm Apache Spark?

186

What is the command to start and stop the Spark in an interactive shell?

207

List out the ways of creating RDD in Apache Spark?

192

What are the various advantages of DataFrame over RDD in Apache Spark?

193

What is flatmap in apache spark?

203

What is the standalone mode in spark cluster?

164

Explain apache spark streaming? How is the processing of streaming data achieved in apache spark?

190

In what ways sparksession different from sparkcontext?

236

Explain fold() operation in spark?

200

Define sparkcontext in apache spark?

190

List out the various advantages of dataframe over rdd in apache spark?

192

What is map in apache spark?

184

Write the command to start and stop the spark in an interactive shell?

185

Define various running modes of apache spark?

189

What are the ways to run spark over hadoop?

181


Un-Answered Questions { Big Data }

What makes Apache Spark good at low-latency workloads like graph processing and machine learning?

234


What is namenode?

285


UPPER or UCASE function in Hive with example?

433


What is client mode in spark?

195


What are the execution modes in the apache pig?

294






Define a worker node?

239


State some impala hadoop benefits?

36


What is the sequence of execution of map, reduce, recordreader, split, combiner, partitioner?

390


Explain values() operation in apache spark?

270


What are file permissions in HDFS and how HDFS check permissions for files or directory?

24


What is a generic UDF in the hive?

397


What is the Use of Sqoop?

5


What is python spark?

206


List down the segments of a hive question processor?

363


Does Cassandra support ACID transactions?

80