Define the roles of the file system in any framework?
Answer / Rajan Kumar Jaiswal
"A Data Source in Apache Spark refers to a collection of data that can be read and processed. Common data sources include Hadoop Distributed File System (HDFS), Local File System, Amazon S3, Cassandra, MongoDB, and others."
| Is This Answer Correct ? | 0 Yes | 0 No |
Explain values() operation in apache spark?
Explain the key features of Spark.
Is spark difficult to learn?
What do you understand by the parquet file?
How is Apache Spark better than Hadoop?
What is a dataset? What are its advantages over dataframe and rdd?
Why was spark created?
Can you use Spark for ETL process?
When should you use spark cache?
What is a spark rdd?
What is number of executors in spark?
What is spark application?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)