Big Data Interview Questions
Questions Answers Views Company eMail

Is databricks an etl tool?

11

What is a databricks cluster?

7

What is coarsegrainedexecutorbackend?

15

What is skew data?

7







Un-Answered Questions { Big Data }

What is a Task instance in Hadoop? Where does it run?1

293


What is Immutable?

21


Can You Use Apache Spark To Analyze and Access Data Stored In Cassandra Databases?

19


What are all stats classes in the java api package available?

39


What is hive on spark?

11






What is HDFS ? How it is different from traditional file systems?

259


What is a partitioner and how the user can control which key will go to which reducer?

313


How can you manually partition the rdd?

15


Can you briefly explain the apache mahout?

1


What is the design architecture of Cassandra?

9


How would you tackle counting words in several text documents?

404


What is difference between coalesce and repartition?

12


What are the most memory-intensive operations?

3


What is partitioning key?

39


What are the different ways of executing Pig script?

146