Hadoop (4218)
Big Data General (104)
Big Data AllOther (3)
How to set up local repository manually?
Define data lake?
What is the command for archiving a group of files in hdfs.
What is Apache Spark Streaming?
What are the exact differences between reduce and fold operation in Spark?
Do you need to install spark on all nodes of yarn cluster?
What is stage and task in spark?
How is transformation on rdd different from action?
Why is Kafka technology significant to use?
Is the keyword 'DEFINE' like a function name?
What is Identity reducer?
What do you think about the speculative execution?
When to use Cassandra?
What combiners is and when you should use a combiner in a MapReduce Job?
When a large data set is maintained?