What is a hive on spark?
Answer / Vivek Bharti
"Hive is a data warehousing SQL-like query language built on top of Apache Spark for reading, writing, and managing large datasets stored in Hadoop Distributed File System (HDFS) or local file system. It provides an SQL-like interface to query data stored in various databases and execute MapReduce jobs under the covers"
| Is This Answer Correct ? | 0 Yes | 0 No |
What is hdfs spark?
How does pipe operation writes the result to standard output in Apache Spark?
What is the difference between persist() and cache()?
Why should I use spark?
On what all basis can you differentiate rdd, dataframe, and dataset?
Explain parquet file?
What is dataframe api?
What is the significance of Sliding Window operation?
What is the difference between spark and hive?
What is skew data?
Please provide an explanation on DStream in Spark.
Explain textFile Vs wholeTextFile in Spark?
Apache Hadoop (394)
MapReduce (354)
Apache Hive (345)
Apache Pig (225)
Apache Spark (991)
Apache HBase (164)
Apache Flume (95)
Apache Impala (72)
Apache Cassandra (392)
Apache Mahout (35)
Apache Sqoop (82)
Apache ZooKeeper (65)
Apache Ambari (93)
Apache HCatalog (34)
Apache HDFS Hadoop Distributed File System (214)
Apache Kafka (189)
Apache Avro (26)
Apache Presto (15)
Apache Tajo (26)
Hadoop General (407)