how would you modify that solution to only count the number of unique words in all the documents?
712Post New Apache Hadoop Questions
What is the best hardware configuration to run Hadoop?
How to enable trash/recycle bin in hadoop?
what factors the block size takes before creation?
What is the meaning of the term "non-DFS used" in Hadoop web-console?
Can hive run without hadoop?
How can we change the split size if our commodity hardware has less storage space?
What is Apache Hadoop? Why is Hadoop essential for every Big Data application?
Define a datanode?
Is it necessary to write jobs for hadoop in the java language?
What is the jobtracker?
How do you categorize a big data?
Should we use RAID in Hadoop or not?
Do we need to give a password, even if the key is added in ssh?
What is HDFS block size and what did you chose in your project?
What is rack-aware replica placement policy?