Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
What is the use of foreach operation in Pig scripts?
When do you have to avoid secondary indexes?
What is Apache Spark and what are the benefits of Spark over MapReduce?
Explain what is storage and compute nodes?
What is the difference between kafka and mq?
What is partitioner spark?
Is Hive supports Temporary Tables?
Can flume provide 100% reliability to the data flow?
What are the additional benefits YARN brings in to Hadoop?
What do you use spark for?
What infrastructure do we need to process 100 TB data using Hadoop?
How to specify more than one directory as input to the MapReduce Job?
Can you explain apache kafka?
How can multi-hop agent be set up in Flume?
How to enable/configure the compression of map output data in hadoop?