Big Data Interview Questions
Questions Answers Views Company eMail

What is gossip protocol in Cassandra?

57

How Cassandra provide High availability feature?

40

What is NoSQL database?

62

When to use Cassandra?

50

Is there an update statement?

94

Does cloudera offer a vm for demonstrating impala?

38

How do I try impala out?

50

How does impala compare to hive and pig?

34

How does impala achieve its performance improvements?

33

Can I use impala to query data already loaded into hive and hbase?

45

What happens when the data set exceeds available memory?

85

How much memory is required?

42

Are results returned as they become available, or all at once when a query completes?

52

Why do I have to use refresh and invalidate metadata, what do they do?

31

Why does my select statement fail?

41


Un-Answered Questions { Big Data }

What are the core apis in kafka?

286


How to submit extra files(jars, static files) for Hadoop MapReduce job during runtime?

388


Is it possible to run Apache Spark on Apache Mesos?

200


What is LazyOutputFormat in MapReduce?

385


Explain about the execution plans of a pig script?
or
differentiate between the logical and physical plan of an apache pig script?

244






What is the difference between TextinputFormat and KeyValueTextInputFormat class?

250


What are the four basic parameters of a mapper?

303


What are Flume events?

72


Which java class handles the Input record encoding into files which store the tables in Hive?

423


What are Paired RDD?

225


What are the languages supported by apache spark?

190


What are the all tasks we can perform for managing services using the ambari service tab?

51


What are possible types of Channel Selectors?

76


What is the purpose of textinputformat?

434


How to handle record boundaries in Text files or Sequence files in MapReduce InputSplits?

392