Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
Is Apache Spark a good fit for Reinforcement learning?
Why do we use persist () on links rdd?
While installing, why does apache have three config files - srm.conf, access.conf and httpd.conf?
What is the Difference SparkSession vs SparkContext in Apache Spark?
What is the purpose of Sqoop List Tables?
Replication causes data redundancy and consume a lot of space, then why is it pursued in hdfs?
Can we run unix shell commands from the hive? Give example?
How spark is used in hadoop?
Is it legal to set the number of reducer task to zero? Where the output will be stored in this case?
When to choose "External Table" in Hive?
Explain about the different channel types in Flume.
List some use cases where Spark outperforms Hadoop in processing.
What is rack-aware replica placement policy?
What does meta-store means in hive?
What is Reducer in MapReduce?