Big Data Interview Questions
Questions Answers Views Company eMail

What is the maximum number of rows in a table?

3

Describe impala shell (impala-shell command)?

5

Can any impala query also be executed in hive?

47

State some advantages of impala?

4

Does impala use caching?

7

How are joins performed in impala?

52

What are distinct operators in impala?

9

What is troubleshooting for impala?

29

Can impala be used for complex event processing?

5

How do I configure hadoop high availability (ha) for impala?

11

How do I know how many impala nodes are in my cluster?

39

What are the differences between relational databases and impala?

5

On which hosts does impala run?

4

State some disadvantages of impala?

3

What is impala data types?

9







Un-Answered Questions { Big Data }

What do you know about keyvaluetextinputformat?

43


How is 0xdata's h2o different from apache mahout ?

1


What are problems with small files and hdfs?

1


What is spark driver application?

11


What is hbase fsck?

13






What is Cassandra?

9


What is a Cluster, Node and Key space in Cassandra ?

21


how would you modify that solution to only count the number of unique words in all the documents?

374


What does the "USE" command in hive do?

84


On which hosts does impala run?

4


What is a IdentityMapper and IdentityReducer in MapReduce ?

306


What are combiners and its purpose?

276


What is Fault Tolerance?

19


Do we need scala for spark?

9


Explain the concept of Tunable Consistency in Cassandra?

51