What is lineage graph?
Answer / Amit Singh Harit
A lineage graph is a record in Apache Spark that stores the history of transformations applied to data. It helps in understanding how data was processed, allowing for debugging, data provenance, and recomputation if needed.
| Is This Answer Correct ? | 0 Yes | 0 No |
What is in memory processing in spark?
Can you explain spark mllib?
What operations does the "RDD" support?
Can you mention some features of spark?
Does spark require hdfs?
What is a "Parquet" in Spark?
What is spark certification?
Which are the methods to create rdd in spark?
How tasks are created in spark?
Explain about the core components of a distributed Spark application?
What is accumulator in spark?
What is row rdd in spark?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)