Processing & transformation
Transform and manipulate data through sorting, grouping, filtering, and combining datasets.
K–2 Competencies
Sort case cards so that observations with similar values for a variable are grouped together.
Order case cards so that a numerical variable is ordered from smallest to largest or largest to smallest.
Classroom resources
Data Science Starter Kit Module 2: Getting Data Ready - Creation and Curation
Welcome to the hands-on world of data collection and organization! This module focuses on where data comes from and how to make it useful for investigation.🔗
Creation and Curation isn’t about turning your students into professional researchers. It’s about helping them understand that data doesn’t just appear—it’s collected by people making decisions about what to measure and how. Whether students are conducting their own surveys or using existing datasets, they need to understand how data gets from the messy real world into organized, analyzable formats.
3–5 Competencies
Manipulate tabular data by grouping cases based on categorical variables (e.g., grouping roller coaster cases so that all wood coasters are together and all steel coasters are together) and ordering cases based on numerical variables (e.g., ordering roller coaster cases "top speed" from slowest to fastest).
Classroom resources
Data Science Starter Kit Module 2: Getting Data Ready - Creation and Curation
Welcome to the hands-on world of data collection and organization! This module focuses on where data comes from and how to make it useful for investigation.🔗
Creation and Curation isn’t about turning your students into professional researchers. It’s about helping them understand that data doesn’t just appear—it’s collected by people making decisions about what to measure and how. Whether students are conducting their own surveys or using existing datasets, they need to understand how data gets from the messy real world into organized, analyzable formats.
6–8 Competencies
Use existing numerical variables to create bins or groups based on benchmark values appropriate for the context, or bins based on numerical ranges (e.g., 0-4, 5-10, 11-15, etc...).
Create a new variable from an existing variable that transforms (e.g., uses a formula to convert units of measure) or recodes data (e.g., blue-->B, red--> R).
Classroom resources
Data Science Starter Kit Module 2: Getting Data Ready - Creation and Curation
Welcome to the hands-on world of data collection and organization! This module focuses on where data comes from and how to make it useful for investigation.🔗
Creation and Curation isn’t about turning your students into professional researchers. It’s about helping them understand that data doesn’t just appear—it’s collected by people making decisions about what to measure and how. Whether students are conducting their own surveys or using existing datasets, they need to understand how data gets from the messy real world into organized, analyzable formats.
9–10 Competencies
Use calculations and logic statements to create new categorical variables based on existing categorical (e.g., if(employment=”employed”, Yes, No)) or quantitative variables (e.g., if(weight<30, light, if(weight>60,heavy ,medium))
Filter data based on groups or subsets of data relevant to the problem and context.
Classroom resources
Data Science Starter Kit Module 2: Getting Data Ready - Creation and Curation
Welcome to the hands-on world of data collection and organization! This module focuses on where data comes from and how to make it useful for investigation.🔗
Creation and Curation isn’t about turning your students into professional researchers. It’s about helping them understand that data doesn’t just appear—it’s collected by people making decisions about what to measure and how. Whether students are conducting their own surveys or using existing datasets, they need to understand how data gets from the messy real world into organized, analyzable formats.
11–12 Competencies
Use an identifying variable (e.g., index, case ID) to merge two separate datasets that have the same observation, but contain different variables to merge datasets together.
Use appropriate procedures to join two datasets together that have different observations with the same variables measured.
Classroom resources
Data Science Starter Kit Module 2: Getting Data Ready - Creation and Curation
Welcome to the hands-on world of data collection and organization! This module focuses on where data comes from and how to make it useful for investigation.🔗
Creation and Curation isn’t about turning your students into professional researchers. It’s about helping them understand that data doesn’t just appear—it’s collected by people making decisions about what to measure and how. Whether students are conducting their own surveys or using existing datasets, they need to understand how data gets from the messy real world into organized, analyzable formats.
Advanced Competencies
Classroom resources
Support other teachers by sharing a resource
Do you have a lesson plan, video, or tip that could help others teaching this topic?
Share feedback on the Learning Progressions
Your feedback helps us improve these progressions for teachers around the world. Thank you!
Share feedback on the Learning Progressions
Your feedback helps us improve these progressions for teachers around the world. Thank you!
Share a classroom resource
Suggesting a resource helps students around the world learn essential data science skills.