Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How does Apache Spark handles accumulated Metadata?
What does /var/hadoop/pids do?
Why is spark good?
What are the types of traditional method of message transfer?
Can you explain edge nodes in hadoop?
What is apache spark sql?
What is the use of flatmap in spark?
Explain first() operation in Apache Spark?
Does spark require hadoop?
What is the difference between reducebykey and groupbykey?
Why Ambari?
What is Sqoop Import Mainframe Tool and its Purpose?
Describe the distnct(),union(),intersection() and substract() transformation in Apache Spark RDD?
List the advantage of Parquet files?
What are the bookkeeper elements and concepts?