How might you limit information moves when working with Spark?

Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...

How might you limit information moves when working with Spark?

Question Posted / girijesh kumar

1 Answers
9 Views
I also Faced
E-Mail Answers

How might you limit information moves when working with Spark?..

Answer / Kavita Bhasker

To limit data movements in Spark, you can employ various strategies like: 1) Caching the intermediate results (using cache() and persist()) to reuse them without recomputing. 2) Using repartition() or coalesce() judiciously to balance data across nodes. 3) Using broadcast variables for sharing large datasets across many tasks.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More PySpark Interview Questions

Explain the key highlights of Apache Spark?

How is Spark SQL not the same as HQL and SQL?

What is Lazy Evaluation?

Do you have to introduce Spark on all hubs of YARN bunch?

Is pyspark a framework?

What is the use of pyspark?

What is PageRank Algorithm?

By what method can Spark be associated with Apache Mesos?

What is Pyspark?

Is pyspark a language?

What is a Data Frame?

How do I open pyspark shell in windows?

For more PySpark Interview Questions Click Here

Categories

Python (2735)
Django Python (504)
PySpark (73)