Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Why do we perform partitioning in Hive?
Explain some Disadvantages of Avro?
What are the different commands used to startup and shutdown Hadoop daemons?
What are the independent extensions that are contributed to the ambari codebase?
What are the tools that are used in ambari monitoring?
Explain Spark Core?
Explain cassandra data model?
Does Apache Spark provide checkpoints?
When to avoid secondary indexes?
Mention how hadoop is different from other data processing tools?
How does hdfs ensure information integrity of data blocks squares kept in hdfs?
What are some of the different modes used in hadoop.
What is the difference between cache and persist in spark?
What is Apache Hadoop? Why is Hadoop essential for every Big Data application?
What is pre-requisites for contributing to apache mahout ?