What is RDD in Apache Spark? How are they computed in Spark? what are the variou

Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...

What is RDD in Apache Spark? How are they computed in Spark? what are the various ways in which it can create?

Question Posted / mahfooz alam

1 Answers
342 Views
I also Faced
E-Mail Answers

What is RDD in Apache Spark? How are they computed in Spark? what are the various ways in which it c..

Answer / Neeraj Kumar Soni

RDD (Resilient Distributed Datasets) in Apache Spark is an immutable distributed collection of data that can be manipulated using transformations and actions. RDDs are computed by splitting large datasets into smaller chunks called partitions, each residing on a single node. RDDs can be created from various sources such as Hadoop files (textFile), local files (textFile("localfile")), or even other RDDs.

Is This Answer Correct ?

0 Yes

0 No

Post New Answer

More Apache Spark Interview Questions

What is a tuple in spark?

What is dataproc cluster?

Which is the best spark certification?

Can you define rdd lineage?

What is spark lineage?

What are broadcast variables in spark?

Is apache spark going to replace hadoop?

How is Apache Spark better than Hadoop?

What is the difference between dataframe and dataset in spark?

How spark works on hadoop?

Compare Transformation and Action in Apache Spark?

What does MLlib do?

For more Apache Spark Interview Questions Click Here

Categories

Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)