The current task is to compute the word counts on a huge number of books in the Hadoop MapReduce environment. A detailed description of the problem is given below: This assignment has been split into three tasks:
- To count the occurrences of words in a book on a per-book basis.
- To count the number of books in which a particular word occurs.
- Find the execution times of above programs on 2,5,10 number of reducers.