Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Is kafka a message queue?
Define primary key in Apache Cassandra?
Why MapReduce uses the key-value pair to process the data?
Explain sum(), max(), min() operation in Apache Spark?
How to use combiner in hadoop ?
What is difference between secondary namenode, checkpoint namenode & backupnod secondary namenode, a poorly named component of hadoop?
What is Bucketing and Clustering in Hive?
What is data processing in big data?
What is the unit of data that flows through a flume agent?
Explain the terms Spark Partitions and Partitioners?
What are some of the apache pig use cases you can think of?
What do you mean by consistency in Cassandra?
Explain the Scope operators used in hbase?
How can I install Cloudera VM in my system?
What is the difference between leader and follower in kafka?