Can you explain how to minimize data transfers while working with Spark?

Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...

Can you explain how to minimize data transfers while working with Spark?

Question Posted / sameer jha

1 Answers
552 Views
I also Faced
E-Mail Answers

Can you explain how to minimize data transfers while working with Spark?..

Answer / Sushant Panigrahi

To minimize data transfers while working with Spark, you can use techniques such as caching data using `cache()` or persisting RDDs with higher storage levels (MEMORY_ONLY_SER, MEMORY_ONLY, MEMORY_AND_DISK, and DISK_ONLY).

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More Apache Spark Interview Questions

Explain Catalyst framework?

How many ways we can create rdd in spark?

explain the concept of RDD (Resilient Distributed Dataset). Also, state how you can create RDDs in Apache Spark.

Explain caching in spark streaming.

What is spark databricks?

Can we do real-time processing using spark sql?

Explain various cluster manager in Apache Spark?

What is map side join?

What is spark driver application?

What is "GraphX" in Spark?

Can you explain benefits of spark over mapreduce?

Does spark require hdfs?

For more Apache Spark Interview Questions Click Here

Categories

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)