Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What are the benefits of apache kafka over the traditional technique?
How do you check if a particular partition exists?
Is spark better than mapreduce?
How many layers of Hadoop components are supported by Apache Ambari and what are they?
What is bag?
How many ways we can create rdd in spark?
How to invoke Command Line Interface?
What is Bucket in Hive?
Why is Hive not suitable for OLTP systems?
Explain count_star?
Compare Traditional queuing systems vs Apache Kafka?
Define a record reader?
Does spark work with python 3?
How to identify that the given operation is transformation or action?
Can you explain broadcast variables?