Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...


explain the concept of RDD (Resilient Distributed Dataset). Also, state how you can create RDDs in Apache Spark.



explain the concept of RDD (Resilient Distributed Dataset). Also, state how you can create RDDs in A..

Answer / Sadhana Dubey

RDD (Resilient Distributed Dataset) is an immutable distributed collection of objects that provides fault-tolerant parallel processing for large datasets in Apache Spark. It serves as the fundamental data structure for performing computations in Spark. RDDs can be created from various sources such as local files, HDFS files, or even other RDDs using Spark's API (Application Programming Interface). Some ways to create RDDs include textFile(path), wholeTextFiles(path), and parallelize(iterable) in Scala, SparkSession.textFile(path), SparkSession.wholeTextFiles(path), and SparkSession.parallelize(iterable) in Java and Python respectively.

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Apache Spark Interview Questions

Can you use Spark to access and analyse data stored in Cassandra databases?

1 Answers  


How does spark rdd work?

1 Answers  


How tasks are created in spark?

1 Answers  


How is RDD in Apache Spark different from Distributed Storage Management?

1 Answers  


Define "PageRank".

1 Answers  


What is lazy evaluation and how is it useful?

1 Answers  


Describe the distnct(),union(),intersection() and substract() transformation in Apache Spark RDD?

1 Answers  


Can we run Apache Spark without Hadoop?

1 Answers  


Explain distnct(),union(),intersection() and substract() transformation in Spark?

1 Answers  


How do I start a spark cluster?

1 Answers  


How do you integrate spark and hive?

1 Answers  


What is the need for Spark DAG?

1 Answers  


Categories