Concept B.4.1

Cleanliness

Work with datasets at increasing levels of cleanliness and identify how datasets need to be curated to address messiness issues.

K–2 Competencies

Work with datasets that are relatively clean (e.g., don't have missing data or errors).

K-2.B.4.1a

Classroom resources

Classroom Tip
Getting Started

Data Science Starter Kit Module 2: Getting Data Ready - Creation and Curation

Welcome to the hands-on world of data collection and organization! This module focuses on where data comes from and how to make it useful for investigation.🔗

Creation and Curation isn’t about turning your students into professional researchers. It’s about helping them understand that data doesn’t just appear—it’s collected by people making decisions about what to measure and how. Whether students are conducting their own surveys or using existing datasets, they need to understand how data gets from the messy real world into organized, analyzable formats.

3–5 Competencies

Work with datasets that require some cleaning (e.g., resolution of missing data or blank cells).

3-5.B.4.1a

Verify data by comparing recorded values to original sources when possible.

3-5.B.4.1b

Classroom resources

Classroom Tip
Getting Started
Thank you for your feedback.
Write more feedback

Data Science Starter Kit Module 2: Getting Data Ready - Creation and Curation

Welcome to the hands-on world of data collection and organization! This module focuses on where data comes from and how to make it useful for investigation.🔗

Creation and Curation isn’t about turning your students into professional researchers. It’s about helping them understand that data doesn’t just appear—it’s collected by people making decisions about what to measure and how. Whether students are conducting their own surveys or using existing datasets, they need to understand how data gets from the messy real world into organized, analyzable formats.

6–8 Competencies

Identify and handle missing values marked by special codes (-99) or blank cells.

6-8.B.4.1a

Distinguish between true zero values and blank cells.

6-8.B.4.1b

Classroom resources

Classroom Tip
Getting Started
Thank you for your feedback.
Write more feedback

Data Science Starter Kit Module 2: Getting Data Ready - Creation and Curation

Welcome to the hands-on world of data collection and organization! This module focuses on where data comes from and how to make it useful for investigation.🔗

Creation and Curation isn’t about turning your students into professional researchers. It’s about helping them understand that data doesn’t just appear—it’s collected by people making decisions about what to measure and how. Whether students are conducting their own surveys or using existing datasets, they need to understand how data gets from the messy real world into organized, analyzable formats.

9–10 Competencies

Work with datasets requiring multiple types of cleaning such as missing values, errors, and anomalies.

9-10.B.4.1a

Clean and prepare datasets before merging multiple sources.

9-10.B.4.1b

Classroom resources

Classroom Tip
Getting Started
Thank you for your feedback.
Write more feedback

Data Science Starter Kit Module 2: Getting Data Ready - Creation and Curation

Welcome to the hands-on world of data collection and organization! This module focuses on where data comes from and how to make it useful for investigation.🔗

Creation and Curation isn’t about turning your students into professional researchers. It’s about helping them understand that data doesn’t just appear—it’s collected by people making decisions about what to measure and how. Whether students are conducting their own surveys or using existing datasets, they need to understand how data gets from the messy real world into organized, analyzable formats.

11–12 Competencies

Apply advanced data cleaning techniques to handle complex data quality issues such as outliers, inconsistencies, and systematic errors.

11-12.B.4.1a

Develop and document reproducible data cleaning workflows that maintain data integrity.

11-12.B.4.1b

Evaluate and validate cleaned datasets using statistical methods and domain knowledge.

11-12.B.4.1c

Classroom resources

Classroom Tip
Getting Started
Thank you for your feedback.
Write more feedback

Data Science Starter Kit Module 2: Getting Data Ready - Creation and Curation

Welcome to the hands-on world of data collection and organization! This module focuses on where data comes from and how to make it useful for investigation.🔗

Creation and Curation isn’t about turning your students into professional researchers. It’s about helping them understand that data doesn’t just appear—it’s collected by people making decisions about what to measure and how. Whether students are conducting their own surveys or using existing datasets, they need to understand how data gets from the messy real world into organized, analyzable formats.

Classroom resources

Support other teachers by sharing a resource

Do you have a lesson plan, video, or tip that could help others teaching this topic?

Developed by our coalition

Coalition organizers

Share feedback on the Learning Progressions

Your feedback helps us improve these progressions for teachers around the world. Thank you!

Thank you! We’ve received your submission.
Oops! Something went wrong while submitting the form.

Share feedback on the Learning Progressions

Your feedback helps us improve these progressions for teachers around the world. Thank you!

Thank you! We’ve received your submission.
Oops! Something went wrong while submitting the form.

Share a classroom resource

Suggesting a resource helps students around the world learn essential data science skills.

Thank you! We’ve received your submission.
Oops! Something went wrong while submitting the form.