8-APRIL-2020: Created for SP20
29-APRIL-2020:Updated with DB links to Lisa's tutorial and sqlite3
Analytics with python
[sqlite3, numpy, pandas, matplotlib]
Dietel provides step-by-step examples with numpy and some pandas in chapter 7: Array-Oriented programming with NumPy
crunching base: numpy
bookData sets for exercises
- Library building blocks: numpy and pandas
- Hello world
- Exploring health code violations with pandas
emoji_objectsCore objects: Index, Series, and DataFrame
The screen clippings of the API documentation below link to the object-specific overview subpages within.
- Code to the specification below. Then upload your a python files and any related documents to your GitHub account oragnized in sensible ways with informative commit messages
Write a script that answers basic data-based questions concerning health violations in Allgheny county
questions to pursue in Allegheny county health code violations
choose your own dataset
Choose a dataset of your own or one listed on the Western PA Regional Data Center. It will be easiest to choose a dataset published as CSV file type for ingestion by pandas.
Write a script that uses pandas to describe the value counts of each key variable.
Then slice your dataset using a value in a column of your choosing and compare a pattern of your choosing between your sliced sub-data-sets.