Skip to content

Rr 26 add schemas#29

Open
ryanraaschCDC wants to merge 13 commits intomainfrom
rr-26-add-schemas
Open

Rr 26 add schemas#29
ryanraaschCDC wants to merge 13 commits intomainfrom
rr-26-add-schemas

Conversation

@ryanraaschCDC
Copy link
Copy Markdown
Collaborator

No description provided.

@ryanraaschCDC ryanraaschCDC linked an issue Feb 27, 2026 that may be closed by this pull request
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds Pandera dataframe schema modules (with mock data generators) for STF datasets and wires the param_estimates dataset config to a schema file, along with a dependency update.

Changes:

  • Add new schema modules under cfa/catalog/public/datasets/stf/schemas/ (e.g., nwss, nssp_gold, nhsn_hrd, param_estimates)
  • Reference the new param_estimates schema from cfa/catalog/public/datasets/stf/param_estimates.toml
  • Update pyproject.toml dependencies (notably faker)

Reviewed changes

Copilot reviewed 5 out of 7 changed files in this pull request and generated 9 comments.

Show a summary per file
File Description
pyproject.toml Updates project dependencies (adds an extra faker entry).
cfa/catalog/public/datasets/stf/schemas/param_estimates.py Introduces a Pandera schema + mock data generator for param_estimates.
cfa/catalog/public/datasets/stf/schemas/nwss.py Introduces a Pandera schema + mock data generator for nwss.
cfa/catalog/public/datasets/stf/schemas/nssp_gold.py Introduces a Pandera schema + mock data generator for nssp_gold.
cfa/catalog/public/datasets/stf/schemas/nhsn_hrd.py Introduces Pandera schemas + mock data generators for nhsn_hrd extract/load.
cfa/catalog/public/datasets/stf/param_estimates.toml Adds a schemas pointer to the new param_estimates schema module.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread cfa/catalog/public/datasets/stf/schemas/nssp_gold.py Outdated
Comment thread pyproject.toml Outdated
Comment thread cfa/catalog/public/datasets/stf/schemas/nssp_gold.py Outdated
Comment thread cfa/catalog/public/datasets/stf/schemas/nwss.py
Comment thread cfa/catalog/public/datasets/stf/schemas/nwss.py
Comment thread cfa/catalog/public/datasets/stf/schemas/nwss.py
Comment thread cfa/catalog/public/datasets/stf/schemas/nwss.py Outdated
Comment thread cfa/catalog/public/datasets/stf/schemas/nwss.py Outdated
Comment thread cfa/catalog/public/datasets/stf/schemas/param_estimates.py
ryanraaschCDC and others added 4 commits February 27, 2026 18:07
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 5 out of 7 changed files in this pull request and generated 11 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread cfa/catalog/public/datasets/stf/schemas/param_estimates.py
Comment thread cfa/catalog/public/datasets/stf/schemas/nwss.py
Comment thread cfa/catalog/public/datasets/stf/schemas/nwss.py Outdated
Comment thread cfa/catalog/public/datasets/stf/schemas/nwss.py
Comment thread cfa/catalog/public/datasets/stf/schemas/nwss.py Outdated
Comment thread cfa/catalog/public/datasets/stf/schemas/nwss.py Outdated
Comment thread cfa/catalog/public/datasets/stf/schemas/nwss.py
Comment thread cfa/catalog/public/datasets/stf/schemas/param_estimates.py
Comment thread cfa/catalog/public/datasets/stf/schemas/param_estimates.py
Comment thread cfa/catalog/public/datasets/stf/schemas/nwss.py Outdated
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Copilot reviewed 5 out of 7 changed files in this pull request and generated 2 comments.


💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment thread cfa/catalog/public/datasets/stf/schemas/nwss.py
Comment thread cfa/catalog/public/datasets/stf/schemas/nhsn_hrd.py
@ryanraaschCDC ryanraaschCDC requested a review from xop5 March 2, 2026 19:13
@ryanraaschCDC ryanraaschCDC requested a review from damonbayer March 3, 2026 16:07
@damonbayer
Copy link
Copy Markdown
Contributor

@ryanraaschCDC Is there some documentation about how I can use this?

@ryanraaschCDC
Copy link
Copy Markdown
Collaborator Author

there is documentation captured in cfa-dataops. I can add documentation here if you think that is helpful.
https://github.com/CDCgov/cfa-dataops/blob/main/docs/data_developer_guide.md

@ryanraaschCDC
Copy link
Copy Markdown
Collaborator Author

I added some documentation for using schemas and mock data

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

add schemas

5 participants