What is coalesce in spark?
Answer / Sakshi Upadhyay
Coalesce in Spark is an operation that re-partitions a DataFrame or Dataset into a specified number of partitions while ensuring that the total amount of data remains the same. This can help to balance the workload among executors.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is hadoop technology?
What is data skew and how do you fix it?
Name the operations supported by rdd?
What is master node in spark?
What are the ways in which Apache Spark handles accumulated Metadata?
Define RDD?
List out the various advantages of dataframe over rdd in apache spark?
Can you explain apache spark?
List some commonly used Machine Learning Algorithm Apache Spark?
What is map side join?
What are the different ways of representing data in Spark?
Explain distnct(),union(),intersection() and substract() transformation in Spark?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)