Big Data Interview Questions
Questions Answers Views Company eMail

Name some independent extensions that contribute to the Ambari codebase?

50

How can we see all the hosts that are available in Ambari?

49

Explain future growth of Apache Ambari?

49

Can we use Ambari Python Client to use of Ambari API’s?

52

Which command is used to list all the tables in a database or list all the columns in a table?

5

Explain HCatLoader APIs?

5

Explain HCatInputFormat and HCatOutputFormat?

5

Explain HCatStorer APIs?

5

Explain HCatWriter?

5

Name all HCatalog Features?

5

State syntax of the command to drop an index?

5

Name Applications and Use Cases of HCatalog?

5

Which command is used to SHOW PARTITIONS lists in HCatalog?

5

Explain HCatReader?

5

What is the role of data transfer API in HCatalog?

5


Un-Answered Questions { Big Data }

Do I need to know hadoop to learn spark?

206


What is the relation between MapReduce and Hive?

376


What are the different Data Types available in Hive?

442


What is hbase in hadoop?

385


What is the concept of SuperColumn in Cassandra?

52






Please explain apache kafka?

325


Is spark based on hadoop?

201


Explain use cases where SequenceFile class can be a good fit?

671


What is bag?

422


how you can improve the throughput of a remote consumer?

317


What can be optimum value for Reducer?

580


What is SSTable? How is it different from other relational tables?

91


Can I run an ensemble cluster behind a load balancer?

1


Do we require two servers for the namenode and the datanodes?

268


What is meant by streaming access?

284