What does repartition do in spark?
Answer / Sandeep Shandilya
Repartition in Apache Spark is a function used to change the number of partitions for a DataFrame or RDD. It helps to balance the data distribution across nodes by either increasing or decreasing the number of partitions.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is rdd lineage graph? How is it useful in achieving fault tolerance?
What is RDD?
What languages support spark?
What is mlib?
How does Apache Spark handles accumulated Metadata?
Can a spark cause a fire?
What are the various advantages of DataFrame over RDD in Apache Spark?
What are the libraries of spark sql?
What does apache spark stand for?
Does spark need hdfs?
Does spark sql use hive?
What is apache spark in big data?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)