Assignment 8

This assignment is worth 35 points.

Write a program which will analyze a text document.

When you have finished this assignment, you should have:

Implement a program which can read text from a file, classify the words and count their use frequency. For this program, a word consists of a contiguous collection of letters, surrounded by white space. We will not allow words such as contraction (don't is not a word ), or words with digits in them (BR549, NCC-1701 are not words), and we will not allow hyphenated words (low-budget is not a word). You do not need to decide if a sequence of characters is a word, if it meets the above criteria, it is a word ( kajigger is a word ). You may need to remove punctuation from a word (hello! should be hello). Words are surrounded by white space, so the stream extraction operator will be sufficient for extracting words from a file. You may, however, have to decided if a set of input is a word (according to the above definition). For the purposes of this program, please reduce all words to lower case. (World is the same as world).

Your program should request the name of a text file and should present the user with the following information and abilities:

In addition to your code, please submit a report which Please be advised, the input files might be quite large.

Please do not use hash tables to solve this problem.

You should use other tools, if necessary in your analysis.

You should submit your code via email to danbennett360@gmail.com on the due date. Submit your report in class.