Programming background and a working Hadoop environment. The text of the novel War and Peace can be downloaded from http://onlinebooks .library.upenn.edu/ and used as the dataset for these exercises. However, other datasets can easily be substituted. Document all processing steps applied to the data.
Use Pig to perform a word count on the specified dataset.