Test Review and Proposal Document
Please note this is my attempt to build a review sheet.
- It is based on the notes for the class.
- I expect that you are able to perform all tasks assigned in the homework.
- I may have missed some items. In general this should not be a problem unless they are fundamental to higher level items.
- I have not completely detailed all items. You should review both my and your notes.
- I may have missed some excel functions/operations we have discussed.
- Proposal Document
- Make sure you do all that the specification asks.
- You don't need to match all of my contents, but you should address all of the issues.
- Read the details on formatting the paper and follow them.
- Make sure you look at the rubrics and do all that is asked there.
- You do not need a complete question. Do your best, but that will come with exploring the data.
- You do not need a complete data dictionary
- Especially if your dataset has many records.
- But do please explore some of these records to show that you have the ability to do this.
- I would do the ones most directly related to the question.
- Remember this document has several purposes (see the top of the assignment)
- Test Review
- Look through the notes.
- What have we discussed?
- Did we spend a little time on it or real concentrated effort?
- What are the steps in doing a data project
- Form a question
- Find the data to investigate
- Explore the data
- Understand the dataset
- Find/clean "dirty" data
- Find a answer to the question
- Interpret results make recommendations.
- Email
- Discuss the importance of email in a professional setting.
- Discuss importance of using email professionally.
- Discuss important points of proper email etiquette
- Discuss the proper use of the fields in an email message
- Demonstrate the ability to send a professional email message with attachments.
- What is Data Science?/ Data
- Discuss the disciplines involved in data science
- What is data? A set of values
- What is information? Data that has been processed, organized, structured or presented to make them meaningful or useful
- Structured data is data that is broken into a number of unique records each record consists of a number of regular fields.
- unstructured data is data that can not be broken into fields easily.
- Electronic data is measured in Bits and Bytes
- Further measurements are derived using the SI prefixes.
- The amount of data available has exploded during the last 20 years.
- Moore's Law
- Moore's Law stated that the power of computers will double about every year and a half.
- We have sort of hit the end of Moore's law.
- When using a spreadsheet
- Isolate assumptions
- Label Everything
- Use the spreadsheet to compute values.
- Have an idea of the result and double check to see if the computation is valid.
- Use excel as a tool, not electronic grid paper.
- Demonstrate the ability to use a spreadsheet
- Enter labels
- Enter data
- Enter simple formulas
- Copy/fill data/formulas
- Identify what $ does in a cell reference and when to use it.
- Format cells
- Super Hero Data
- Ideas:
- Documentation is a constant process, not a step at the end of the project.
- Using document level formatting in Word is preferred to paragraph and text level formatting.
- Word should be used as a document preparation system, not an electronic typewriter.
- Describe the importance of section breaks.
- Charts should be labeled.
- Demonstrate the ability to use word to
- Insert screen shots.
- Change the way a special items interacts with text
- Apply text, paragraph and document level formatting.
- Use the format painter
- Add citations and a bibliography in a selected style to a document using the citation manager.
- Insert section breaks to logically divide a document.
- Build a table of contents.
- Demonstrate the ability to do the following in excel
- Use the freeze panes feature to keep headers visible.
- Adjust column widths to fit given data.
- Use find and replace to replace selected data
- Produce a list of unique values from a set of data.
- use the countif, count, counta, countblank functions
- Produce various charts illustrating data values.
- Circle or Pie
- Box and Whiskers
- Column, Row
- The Height field
- Ideas:
- Explain what a deprecated function is.
- Explain quartile, outlier
- Explain what dirty data is.
- Demonstrate the ability to do the following in excel
- Use the min, max, quartile, average functions
- Use the frequency function
- Finishing the Proposal
- Ideas
- Discuss the difference between linking and embedding
- Demonstrate the ability to
- Modify a style
- Select multiple disjoints cells/data/text
- Build tables in word.
- Format tables in word.
- Insert charts from excel in word.
- Insert tables from excel in word.
- Change the orientation of a section of text in word.
- Page Numbers
- Demonstrate the ability to
- Apply different page numbering styles to different sections of a document.
- Use multilevel builted and numbered lists.
- Find based on formatting