Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) Name the management tools in Cassandra?
What is spark reducebykey?
What language is apache kafka written in?
What is the problem in having lots of small files in hdfs?
what is partitions in hive?
Why rack awareness algorithm is used in hadoop?
What is flume agent?
Define yum?
What is Apache Spark? What is the reason behind the evolution of this framework?
name few other popular column oriented databases like hbase.
Mention how hadoop is different from other data processing tools?
What is the difference between TextInputFormat and KeyValueInputFormat class?
Can we write map reduce program in other than java programming language. How?
What is the concept of SuperColumn in Cassandra?
Explain the process of spilling in MapReduce?