What does repartition do in spark?

What does repartition do in spark?

Question Posted / mahesh singh

1 Answers
426 Views
I also Faced
E-Mail Answers

What does repartition do in spark?..

Answer / Sandeep Shandilya

Repartition in Apache Spark is a function used to change the number of partitions for a DataFrame or RDD. It helps to balance the data distribution across nodes by either increasing or decreasing the number of partitions.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More Apache Spark Interview Questions

What is rdd lineage graph? How is it useful in achieving fault tolerance?

What is RDD?

What languages support spark?

What is mlib?

How does Apache Spark handles accumulated Metadata?

Can a spark cause a fire?

What are the various advantages of DataFrame over RDD in Apache Spark?

What are the libraries of spark sql?

What does apache spark stand for?

Does spark need hdfs?

Does spark sql use hive?

What is apache spark in big data?

For more Apache Spark Interview Questions Click Here

Categories

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)