Answer Posted / Anil Singh
"ReduceByKey" is a transformational operation in Apache Spark that works on RDD (Resilient Distributed Dataset). It partitions the data, performs reductions for each partition independently, and then merges the results. Each reduction function takes two values of the same key as input and produces one output. ReduceByKey is useful when you want to compute a summary statistic like count, sum, or average.
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers