Big Data Interview Questions
Questions Answers Views Company eMail

How many ways we can create rdd?

196

What does repartition do in spark?

195

What is the driver program in spark?

181

What is spark submit?

188

How do I clear my spark cache?

178

What is a partition in spark?

207

What is spark vectorization?

186

What is off heap memory in spark?

182

What is a tuple in spark?

191

Is spark an etl?

190

How is rdd distributed?

197

What are the common transformations in apache spark?

184

What is the difference between dataset and dataframe in spark?

221

What is distributed cache in spark?

199

What is catalyst framework in spark?

189


Un-Answered Questions { Big Data }

Compare Spark vs Hadoop MapReduce

199


What is the task of Spark Engine

230


Explain about the different channel types in Flume. Which channel type is faster?

145


List out the different stream grouping in apache storm?

217


According to IBM, what are the three characteristics of Big Data?

246






Can any impala query also be executed in hive?

74


What do you know about nlineinputformat?

399


Can you explain about the cluster manager of apache spark?

178


What is a rack?

233


What does hdfs mean?

24


What is graph db? Explain with an example.

55


What is difference between dataset and dataframe?

214


what is the traditional method of message trfer?

314


What is a dataset? What are its advantages over dataframe and rdd?

210


Explain the hadoop configuration files at present?

363