Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is the difference between kafka and mq?
What is the default input type in MapReduce?
How Mapper is instantiated in a running job?
How do we write our own custom serde?
Can you define yarn?
What do you understand by the partitions in spark?
Define replication strategy?
What is the optimal block size in HDFS?
What do you understand by column family?
What is a MapReduce Combiner?
What is spark yarn executor memoryoverhead?
What are the differences between relational databases and impala?
Who uses Cassandra?
What does rdd stand for in logistics?
Mention some important components of cassandra data models?