Worksheets
The purpose of this assignment is to demonstrate that you can:
- collect, clean, and organize large datasets.
- conduct basic analysis of large datasets.
- create functional and well formatted electronic spreadsheet with a graph.
- developed mathematical, computer and quantitative skills
You must supply your project worksheets for grading. This should be in the form of an excel spreadsheet. Your spreadsheet should conform to the following standards.
- Label all data.
- Columns should have headers.
- Worksheets should be named with appropriate names.
- Any computed values should be labeled
- Each original dataset should be contained in a worksheet. This data should be labeled, but otherwise untouched. The worksheet should be named appropriately.
- Each cleaned dataset should be contained in a worksheet. This data should be labeled. This worksheet should be named appropriately.
- Perform each type of analysis in a new worksheet. Unless you have an extremely large dataset, make a new copy of the clean data, then perform the analysis. Make sure all fields are clearly labeled.
- Keep any graphs produced with the data used to produce those graphs.
- Provide a description of the analysis at the top of the worksheet (cell A1 for example)
- Provide a "cover" worksheet.
- Make this the first worksheet.
- Place your name and course section at the top.
- For each additional worksheet, provide a brief description of the contents of the worksheet.
- Provide a "summary" worksheet which summarizes the cleaned data
- The number of records.
- For each numeric field compute the five number summary.
- For each non numeric field, provide an appropriate description.
- When appropriate provide a simple graphic to describe the data.
If you wish, you may used multiple spreadsheets. If you do, please provide a single spreadsheets that contains the raw data, do not duplicate this across all spreadsheets. Provide some form of documentation, most likely a word document, which describes the each file you have provided. This is probably the best option for very large datasets.
This is the actual work you perform in cleaning and analyzing your data. This work will probably have occurred across the entire semester. The work submitted here should correspond directly to the description in the methods document.
All data should be appropriately labeled. Each worksheet should be appropriately named. All computations should be done in an efficient manner. For example, absolute and mixed references should be used where appropriate. Functions should be used when they exist. All graphs should be labeled. All computations should be done using excel.
Item | Weight | Full | Partial |
Cover worksheet | 10% | Present, other worksheets described. | Some information is missing |
Contents |
Original Data | 20% | Present, labeled, unchanged. | |
Clean Data | 20% | Present. Cleaned according to description in methods document. | Not consistent with methods document. |
Summary worksheet | 10% | Present, required computations present, graphics included. | Some statistics are missing, poor or missing graphics. |
Analysis | 40% | Each new analysis in a worksheet. Work conforms to description in methods document. | |
Please note, points will be taken off for the following
- Unlabeled or poorly data.
- Unnamed or poorly named worksheets.
- Work performed manually or in a grossly inefficient manner.
- Descriptions not provided.
- Graphs missing labels.
This work is due November 25 and should be submitted to the D2L folder Project Worksheets.