Hadoop (4218)
Big Data General (104)
Big Data AllOther (3) What is the use of cassandra and why to use cassandra?
How can you use producer api code?
Which among the two is preferable for the project- Hadoop MapReduce or Apache Spark?
How is data represented in Spark?
Define functions of SparkCore?
What are the roles of the file system in any framework?
What are the various storages from which Spark can read data?
What do you mean by data center in Cassandra?
What OS Cassandra supports?
When we write a= load …, what does 'a' called?
What is spark vcores?
State the limitations of Apache Pig?
What is the roadmap for apache driver version one.0?
What webdav is in hadoop?
If reducers do not start before all mappers finish then why does the progress on mapreduce job shows something like map(50%) reduce(10%)? Why reducers progress percentage is displayed when mapper is not finished yet?