Skip to content

Latest commit

 

History

History
39 lines (24 loc) · 753 Bytes

CGPydata2.md

File metadata and controls

39 lines (24 loc) · 753 Bytes

Plan

  • walk through of an example

  • find

  • read in

  • clean

  • manipulate a data set using pandas

  • Using matplotlib for simple visualisation;

  • break into groups

  • take a dataset/question

  • Use ipython notebook to prepare an analysis;

  • finish with each group briefly presenting their solution and any problems.

Syllabus

  1. Data Wrangling (Revision)

  2. Date and Times

  3. Summary / Descriptive statistics

  4. Plots and Graphical Methods

  5. Cluster Analysis

  6. Logistic Regression (Prediction of a "success" or "failure")

  7. Linear Regression / Variable Selection Procedures

Data Repositories

References