Big Data Interview Questions
Questions Answers Views Company eMail

What is the maximum number of rows in a table?

40

Describe impala shell (impala-shell command)?

36

Can any impala query also be executed in hive?

76

State some advantages of impala?

30

Does impala use caching?

38

How are joins performed in impala?

89

What are distinct operators in impala?

36

What is troubleshooting for impala?

65

Can impala be used for complex event processing?

90

How do I configure hadoop high availability (ha) for impala?

45

How do I know how many impala nodes are in my cluster?

74

What are the differences between relational databases and impala?

46

On which hosts does impala run?

37

State some disadvantages of impala?

23

What is impala data types?

33


Un-Answered Questions { Big Data }

What are the default configuration files that are used in hadoop?

401


What do you mean by “data centre” in cassandra?

76


What are the data components used by Hadoop?

428


Use of create-hive-table command in hadoop sqoop?

5


What are the exact differences between reduce and fold operation in Spark?

285






List various commonly used machine learning algorithm?

192


Give examples of the SerDe classes whihc hive uses to Serializa and Deserilize data?

445


Who created spark?

183


Compare Pig vs Hive vs Hadoop MapReduce?

368


What is jmx connector?

5


Explain the concept of bloom filter?

64


What is meant by rdd lazy evaluation?

311


What are the different data formats supported by apache tajo?

5


What is NameNode and DataNode in HDFS?

35


What are the different components of a Hive query processor?

1102