Hadoop Interview Questions
Questions Answers Views Company eMail

What is the standalone mode in spark cluster?

164

Explain apache spark streaming? How is the processing of streaming data achieved in apache spark?

190

In what ways sparksession different from sparkcontext?

238

Explain fold() operation in spark?

200

Define sparkcontext in apache spark?

190

List out the various advantages of dataframe over rdd in apache spark?

192

What is map in apache spark?

184

Write the command to start and stop the spark in an interactive shell?

187

Define various running modes of apache spark?

189

What are the ways to run spark over hadoop?

181

What is catalyst query optimizer in apache spark?

195

What are the various types of shared variable in apache spark?

185

Define the common faults of the developer while using apache spark?

199

What is the use of spark driver, where it gets executed on the cluster?

213

What is speculative execution in spark?

235


Un-Answered Questions { Hadoop }

what does /*streamtable(table_name)*/ do?

477


Can NameNode and DataNode be a commodity hardware?

1392


Explain what if rack 2 and datanode fails?

347


What is executor memory in spark?

221


Is spark a programming language?

195






Can Ambari manage multiple clusters?

56


Explain Reliability and Failure Handling in Apache Flume?

80


What is the function of HMaster?

148


Which technique can you use in hbase to access hfile directly without the help of hbase?

123


Define cell in HBase?

117


What is the key- value pair in MapReduce?

384


What is spark architecture?

191


how can you identify whether a given operation is transformation or action?

185


What language is apache kafka written in?

282


How is hadoop different from other data processing tools?

402