Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Why do we use HDFS for applications having large data sets and not when there are lot of small files?
1 2770
Specify the partitions in hive?
Clarify what a task tracker is in hadoop?
What is the jobtracker?
What is Starvation scenario in spark streaming?
What is the Virtual Node in Cassandra ?
Give examples of the SerDe classes which hive uses to Serialize and Deserialize data?
What is atom in pig?
What are broker configs?
How is recovery achieved in Ambari?
What is the role of data transfer API in HCatalog?
What are the independent extensions that are contributed to the ambari codebase?
What happens when you issue a delete command in hbase?
what needs to be taken care while adding a Column?
Can we run Apache Spark without Hadoop?
Pig Features ?