Big Data Interview Questions
Questions Answers Views Company eMail

What is apache spark for beginners?

185

What is deploy mode in spark?

198

What is pair rdd?

198

What is data pipeline in spark?

197

What is a spark rdd?

217

What are the optimization techniques in spark?

178

Can you run spark on windows?

193

Why is spark good?

195

Do I need to know hadoop to learn spark?

204

Is a distributed machine learning framework on top of spark?

192

How does hadoop achieve fault tolerance?

212

Is hadoop still in demand?

212

What is winutils hadoop?

229

Is hive a nosql database?

365

Is hive similar to sql?

399


Un-Answered Questions { Big Data }

How can we create RDD in Apache Spark?

206


What are Paired RDD?

221


What are different logging levels in cassandra?

54


In Hadoop, which file controls reporting in Hadoop?

479


What hadoop does in safe mode?

377






What is structured data?

368


What do you understand by Executor Memory in a Spark application?

258


What is the History of Cassandra Database ?

44


How can a developer utilize hive?

396


What is pre-requisites for contributing to apache mahout ?

57


What are producer-consumer queues?

1


What are all stats classes in the org.apache.pig.tools.pigstats package?

279


What is faster than apache spark?

188


How is the option in Hadoop to skip the bad records?

663


Explain a common use case for Flume?

60