- Description
- Learning Outcomes
- Assignments
- Contacts
- Delivery of the Learning Module
- Schedule
- Requirements
- Resources
- Folder Structure
This module introduces the essentials of sampling, probability, and survey methodology. This includes simple probability samples, stratified sampling, cluster sampling, dealing with non-response, estimating, and survey quality. Participants will consider the theoretical foundations of different sampling approaches, as well as practical applications of this knowledge towards contexts such as market research, political polling, and the Canadian census. Participants will engage in analysis using the Python programming language.
By the end of the module, participants will be able to: * Implement simple probability samples. * Evaluate complicated sampling procedures and the tradeoffs involved. * Identify and understand sources of error or inaccuracies in data as a result of sampling strategies. * Assess survey quality.
Assignment | Due Date |
---|---|
Sampling and Reproducibility in Python | Friday, November 8 |
Questionnaire Design (Part A) | Saturday, November 9 |
Questionnaire Design (Part B) | Friday, November 15 |
Data Documentation Comparison Worksheet | Saturday, November 16 |
Questions can be submitted to the #cohort-3-help channel on Slack
- Technical Facilitator:
- name and pronouns:
Ciara
,She/Her
- email:
[email protected]
- name and pronouns:
- Learning Support Staff:
- name:
Farzana
- email:
[email protected]
- name:
Kasra
- email:
[email protected]
- name:
Amanda
- email:
[email protected]
- name:
This module will include live learning sessions and optional, asynchronous work periods. During live learning sessions, the Technical Facilitator will introduce and explain key concepts and demonstrate core skills. Learning is facilitated during this time. Before and after each live learning session, the instructional team will be available for questions related to the core concepts of the module. Optional work periods are to be used to seek help from peers, the Learning Support team, and to work through the homework and assignments in the learning module, with access to live help. Content is not facilitated, but rather this time should be driven by participants. We encourage participants to come to these work periods with questions and problems to work through. Participants are encouraged to engage actively during the learning module. They key to developing the core skills in each learning module is through practice. The more participants engage in coding along with the instructional team, and applying the skills in each module, the more likely it is that these skills will solidify.
Live Learning Session | Topic | Assignments | Resources |
---|---|---|---|
1 | Introduction, Probability | Slides 1 Slides 2 |
|
2 | Populations, censuses, surveys, and observational data; Essentials of sampling, asking and observing | Sampling and Reproducibility in Python | Slides 1 |
3 | Simple probability samples, Stratified sampling, Systematic Sampling | Questionnaire Design (Part A) | Slides 1 Slides 2 |
4 | Cluster Sampling, Errors | Slides 1 Slides 2 |
|
5 | Survey Quality, Questionnaire Design, Ethics | Questionnaire Design (Part B) | Slides 1 Slides 2 |
6 | Privacy | Data Documentation Comparison Worksheet | Slides 1 |
- Participants are expected to have completed Shell, Git, and Python learning modules.
- Participants are encouraged to ask questions, and collaborate with others to enhance their learning experience.
- Participants must have a computer and an internet connection to participate in online activities.
- Participants must not use generative AI such as ChatGPT to generate code in order to complete assignments. It should be used as a supportive tool to seek out answers to questions you may have.
- Participants must have VSCode installed with the following extensions:
- We expect Participants to have completed the steps in the onboarding repo.
- We encourage participants to default to having their camera on at all times, and turning the camera off only as needed. This will greatly enhance the learning experience for all participants and provides real-time feedback for the instructional team.
Feel free to use the following as resources:
- Probability Cheatsheet 1
- Probability Cheatsheet 2
- Probability Cheatsheet 3
- LLN Demo
- Amazon Exercises
- Multiple Imputation Exercises
- Survey Sampling Methods
- Population vs Sample
- Survey Margin of Error: What is it? How does it relate to sample size?
- Reproducibility in Research
- Probability Playlist
- Copy and paste your error message
- Copy and paste the code that caused the error, and the last few commands leading up to the error
- Write down what you are trying to accomplish with your code. Include both the specific action, and the bigger picture and context
- (optional) Take a screenshot of your entire workspace
- Sometimes, the error has common solutions that can be easy to find!
- This will be faster than waiting for an answer
- If none of the solutions apply, consider asking a Generative AI tool
- Paste your code, the error message, and a description of your overall goals
- Since we're all working through the same material, there's a good chance one of your peers has encountered the same error, or has already solved it
- Try searching in the DSI Certificates Slack help channel for whether a similar query has been posted
- If the question has not yet been answered, post your question!
- Describe your the overall goals, the context, and the specific details of what you were trying to accomplish
- Make sure to copy and paste your code, your error message
- Copying and pasting helps:
- Your peers and teaching team quickly try out your code
- Others to find your question in the future
- Asking for Help - The Odin Project
- How do I ask a good question? - Stack Overflow
- The XY problem: A question pitfall that won't get useful answers
- How to create a minimal reproducible example
.
├── .github
├── 01_materials
├── 02_activities
├── 03_instructional_team
├── 04_cohort_three
├── .gitignore
├── LICENSE
└── README.md
- .github: Contains issue templates and pull request templates for the repository.
- materials: Module slides and interactive notebooks (.ipynb files) used during learning sessions.
- activities: Contains graded assignments, exercises, and homework to practice concepts covered in the learning module.
- instructional_team: Resources for the instructional team.
- cohort_three: Additional materials and resources for cohort three.
- .gitignore: Files to exclude from this folder, specified by the Technical Facilitator
- LICENSE: The license for this repository.
- README.md: This file.
- steps_to_ask_for_help.png: Guide on how to ask for help.