How would you determine the quantity of parcels while making a RDD? What are the

Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...

How would you determine the quantity of parcels while making a RDD? What are the capacities?

Question Posted / deepanshi chauhan

1 Answers
12 Views
I also Faced
E-Mail Answers

How would you determine the quantity of parcels while making a RDD? What are the capacities?..

Answer / Ms Varsha

In PySpark, the number of parcels in a Resilient Distributed Dataset (RDD) can be determined by using actions like count(). The capacity of each parcel can be defined during the creation of RDD, for example, when reading data from a file or a database. Here is an example of creating an RDD with predefined capacities:

rdd = sc.textFile("data.txt", 4)

In this example, the textFile function takes two arguments - the path to the data and the number of splits (capacities).

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More PySpark Interview Questions

What are the different dimensions of constancy in Apache Spark?

What is parallelize in pyspark?

What is sparkcontext in pyspark?

What is YARN?

What is Pyspark?

Name the parts of Spark Ecosystem?

What is map in pyspark?

What are the enhancements that engineer can make while working with flash?

Explain the key highlights of Apache Spark?

What is DStream?

What is PageRank Algorithm?

Explain about the parts of Spark Architecture?

For more PySpark Interview Questions Click Here

Categories

Python (2735)
Django Python (504)
PySpark (73)