Describe Partition and Partitioner in Apache Spark?

Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...

Describe Partition and Partitioner in Apache Spark?

Question Posted / daizy sagar

1 Answers
318 Views
I also Faced
E-Mail Answers

Describe Partition and Partitioner in Apache Spark?..

Answer / Hariom Akash Sahay

In Apache Spark, a partition refers to a logical subset of data within a Resilient Distributed Dataset (RDD). Each RDD partition is stored on one or more worker nodes. A Partitioner is responsible for determining how the data is distributed across partitions. By default, Spark uses a HashPartitioner, which evenly distributes data based on a hash function.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More Apache Spark Interview Questions

Why Apache Spark?

What happens when an action is executed in spark?

What rdd stands for?

Why is spark popular?

Can you explain spark streaming?

How can apache spark be used alongside hadoop?

How to start and stop spark in interactive shell?

Compare Transformation and Action in Apache Spark?

What is the difference between cache and persist in spark?

How does Apache Spark handles accumulated Metadata?

What are the various types of shared variable in apache spark?

When to use spark sql?

For more Apache Spark Interview Questions Click Here

Categories

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)