Follow Our FB Page << CircleMedia.in >> for Daily Laughter. We Post Funny, Viral, Comedy Videos, Memes, Vines...



Big Data Interview Questions
Questions Answers Views Company eMail

List some commonly used Machine Learning Algorithm Apache Spark?

132

What is the command to start and stop the Spark in an interactive shell?

142

List out the ways of creating RDD in Apache Spark?

135

What are the various advantages of DataFrame over RDD in Apache Spark?

135

What is flatmap in apache spark?

150

What is the standalone mode in spark cluster?

111

Explain apache spark streaming? How is the processing of streaming data achieved in apache spark?

140

In what ways sparksession different from sparkcontext?

173

Explain fold() operation in spark?

141

Define sparkcontext in apache spark?

133

List out the various advantages of dataframe over rdd in apache spark?

145

What is map in apache spark?

133

Write the command to start and stop the spark in an interactive shell?

131

Define various running modes of apache spark?

137

What are the ways to run spark over hadoop?

131


Un-Answered Questions { Big Data }

Can the region server will be located on all datanodes?

119


Why HDFS stores data using commodity hardware despite the higher chance of failures?

5


Define role of veracity in big data?

83


Why Apache Spark?

156


Define partitioning key?

198






Discuss the various running mode of Apache Spark?

150


When creating an RDD, what goes on internally?

146


What is tungsten engine in spark?

154


How can you start a consumer in kafka?

194


What are different modes of metastore deployment in Hive?

304


How to overwrite an existing output file/dir during execution of Hadoop MapReduce jobs?

284


Mention what is the difference between an rdbms and hadoop?

159


What is spark context spark session?

132


How businesses could be benefitted with Big Data?

137


how you can improve the throughput of a remote consumer?

230