The following table maps course session dates, lesson topics, references, and content links for DAT-102, Introduction to Data Analytics
course | date | wk no. | session links | learning objectives | out-of-class work |
---|---|---|---|---|---|
DAT-102 | Sat 6-FEB'21 |
1 |
Introduction to data analyticsFurther optional readingFamiliarize yourself with the range of data types provided by Python environments. |
|
Please develop a "strip survey" containing a categorical question and a opinion/spectrum question. Compose the tiny survey in a text document and upload to a folder named with your public ID in our shared drive. |
DAT-102 | Sat 13-FEB'21 |
2 |
RecordingRecommended pre-reading from textbook for next weekLock-5 Stats book, ed. 1, Ch. 2, sections 1-4 only (sections 5-6 explore two quant variables which we'll cover later) |
|
TODO for Spring 2021Completion target: before class, 20-FEB
|
DAT-102 | Sat 20-FEB'21 |
3 |
RecordingsGraph Data StructuresThis week we continued our work from last week's data structures module: in part 1 of the session, we encoded our graphical representations of a graph data structure in tabular format for easy exporting to a text file In Part 2, We built a survey instrument for administering a question of interest to our peers in the course. |
Phase 1: Encode a peer's graph in tabular format
Complete ALL peer surveys by Tue, 23-FEB at Midnight
|
|
DAT-102 | Sat 27-FEB'21 |
4 |
SP'21 RecordingStrip survey analysisSummary-based descriptive stats: mean and standard deviationExtra |
SP21: TODO
|
|
DAT-102 | Sat 6-MAR'21 |
5 |
SP'21 RecordingsLock-5 Pre-ReadingEdition 1, Sections 2.3 (Spread) and 2.4 (Box Plots) Lock^5 Book sectionsChapter 2, Sections 1-Sections 4 |
Draw conclusions about a data set based on box plots Compute the standard deviation of a data set, interpret the results, and make inferences using Z-scores |
|
DAT-102 | Sat 13-MAR'21 |
6 |
Applying mean, median, and standard deviationMatch up the Distribution, stats blocks, box plot, and data source in this file |
|
Task 1: Strip Survey AnalysisPlease populate column B - U in our 'Strip_Survey' tab of our master tracker, including most importantly Column U which asks you to describe the relationship between your box plots Task 2: Stdev, Z-score practice packetNOTE: Several pages are in inverted order! (9 before 8, etc.) Key will be posted next week |
DAT-102 | Sat 20-MAR'21 |
7 |
Session RecordingBegin library section sampling, to be continued next week. Sampling!Begin library section sampling, to be continued next week. |
|
Step 1: Please sample 30 books from each of your two library sections: record the call number, number of pages, and some creative variable for each book in each section. Please create a data analysis home in our shared google drive sp21_librarySampling subdirectory. Also, share your chosen Library of Congress sections in our class master tracker on the library_sampling tab Step 2:Begin populating our analysis guide, either in your spreadsheet itself or in the editable document Step 3:Preview the Learning Resources in the library module page so your brain is ready to apply confidence intervals to our estimates. |
DAT-102 | Sat 27-MAR'21 |
8 |
Session RecordingPopulation inference from samplingLibrary samples continued |
Use the boostrap sampling procedure to make an estimate of a population parameter from sample data. |
Spring break TODO: library analysisNOTE: Skip hypothesis testing questions/sectionsDedicate a few hours hours to carefully responding to the analysis questions from your library sample. See our sampling module, and choose the library sampling mini-project. Uplod all your work in our shared google drive (load your google.com account, then navigate to drive.google.com then select "shared with me" in the left sidebar then locate our shared directory for SP'21 DAT-102) for formal submission Be sure to name your files with your public first name and your library section prefixes. |
DAT-102 | Sat 3-APR'21 |
- | Spring break; No class all week | ||
DAT-102 | Thu 15-APR'21 from 6:00 - 9:00pm |
9 & 10 |
This is a combined class session held from 6pm-9pm which is a make-up of the cancelled 10-APR'21 session and the rescheduled 17-APR'21 session due to instructor family funeral. Since this is not our normal meeting time, please attend if you can, but don't sweat if you can't. I'll post the recording straight away. Session RecordingReview of CI Fundamentals Socrative QuizReview Library Sample FindingsReview of ENDS article confidence intervalsLog our final project ideas |
|
Wrap a bow on library samplingSTEP 1:Complete as much as feasible of the library analysis questions and data sheets and upload them to our shared google drive. STEP 2: Transfer essential attributes of your page count sampling to our master SP21 tracker Library_Sampling tab: specifically the population point estimate, the estimated population standard deviation (called the standard error) and your 95% confidence interval bounds. Conf. Interval article studyPlease study the two American Journal of Public Health articles distributed in class. Prepare to dig into their confience intervals for each sub-population:
|
DAT-102 | Sat 24-APR'21 |
11 |
Session RecordingIntroduce BiVariate analysisSee Lock5 Stats Sections: 2.5: two quant variables with scatter plots US Cesus and ACSThe longest-running and most comprehensive sample-based data set is the US Census American Community Survey (ACS), the data from which is publicly accessible and incredibly rich. |
|
Step 1: Study AJPH Articles from last week's homework and Prepare for socrative quiz on conf intervalsStep 2: Populate our master tracker Library_sampling sheet with your library dataStep 3: Familiarize yourself with American factfinder for next week's worktime and propse soe variables of interest to youStart thinking about your final project |
DAT-102 | Sat 1-MAY'21 |
12 |
RecordingsCensus GroupworkFinal project practice and design |
1 |
Begin final projectOPTIONAL Out of class:Digest PGH Inquality reportDue to COVID-19 reorganiation, we will be unable to discuss the data and the sociology behind Pittsburgh's Inequality Across Gender and Race Report issued by the Pittsburgh Gender Equity Commission. As you desire, please engage with the report on your own and with others in your various circles. These discussion questions may be a guide for your discussion:
|
DAT-102 | Sat 8-MAY'21 |
13 |
Session RecordingExperiments: Mind FoodRandomized Controlled TrialsFinal project concept development |
Identify experimental design components in several novel experiments. |
Undertake and document your final project for sharing next week. Create a sub-folder with your first name and your topic of study inside the directory in our shared drive called dat102_sp21_finalProjects |
DAT-102 | Sat 15-MAY'21 |
14 |
FINAL EXAM PERIOD from 10:00am-12:00noon |