Skip to content

Conversation

turbomam
Copy link
Member

No description provided.

@turbomam turbomam requested a review from mslarae13 May 27, 2025 13:56
@turbomam turbomam linked an issue May 27, 2025 that may be closed by this pull request
Copy link

LinkML Linting Results

Summary

Count
Schemas Checked 1
Schemas with Error 1
Schemas with Warning 0
Total Errors 15
Total Warnings 0

Problems per Schema

/home/runner/work/mixs/mixs/src/mixs/schema/mixs.yaml

Errors

  • recommended: Slot 'soil_pH' does not have recommended slot 'description'
  • recommended: Slot 'water_pH' does not have recommended slot 'description'
  • recommended: Subset 'combination_classes' does not have recommended slot 'description'
  • recommended: Subset 'sequencing' does not have recommended slot 'description'
  • recommended: Subset 'environment' does not have recommended slot 'description'
  • recommended: Subset 'nucleic acid sequence source' does not have recommended slot 'description'
  • recommended: Subset 'investigation' does not have recommended slot 'description'
  • standard_naming: Slot has name 'HACCP_term'
  • standard_naming: Slot has name 'IFSAC_category'
  • standard_naming: Slot has name 'air_PM_concen'
  • standard_naming: Slot has name 'ferm_pH'
  • standard_naming: Slot has name 'microb_start_taxID'
  • standard_naming: Slot has name 'soil_pH'
  • standard_naming: Slot has name 'spikein_AMR'
  • standard_naming: Slot has name 'water_pH'

Copy link

github-actions bot commented May 27, 2025

PR Preview Action v1.6.1

🚀 View preview at
https://GenomicsStandardsConsortium.github.io/mixs/pr-preview/pr-972/

Built to branch gh-pages at 2025-06-06 20:49 UTC.
Preview will be ready when the GitHub Pages deployment is complete.

cities:
conifers:
examples:
- value: cypress
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

probably want to keep "conifers: pine"

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TIL the practice of adding examples on PV's! ✅

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think PVs should ever have examples

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will switch all examples to PVs of their own and assert is_a between them (hemlock) and their super-classes (conifer)

@lschriml
Copy link
Member

As discussed we need to enumerate these longer lists as option values:
e.g. conifer
conifer:pine

Instead of having them listed as 'examples' in LinkML.

@turbomam
Copy link
Member Author

As discussed we need to enumerate these longer lists as option values: e.g. conifer conifer:pine

Instead of having them listed as 'examples' in LinkML.

Thanks @lschriml

Could you please look though these observed values

I would like to analyze how much of the hierarchy should be captured in MIxS per se

And reach out to the original contributors of this term to see what kinds of revisions they are comfortable with vs what really needs to be left as is

Copy link
Collaborator

@sujaypatil96 sujaypatil96 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Awesome job turning the string_serialization on cur_land_use to a enumeration. I added a few comments on if/how we can possibly further enrich the CurLandUseEnum (by adding meanings)?

cities:
conifers:
examples:
- value: cypress
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TIL the practice of adding examples on PV's! ✅

@@ -25,6 +25,99 @@ subsets:
nucleic acid sequence source:
investigation:
enums:
CurLandUseEnum:
permissible_values:
badlands:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we add meanings to some of these PV's?

For example, is this an appropriate meaning for the badlands PV? http://purl.obolibrary.org/obo/ENVO_00000127

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a really good idea @sujaypatil96

@cmungall and I discussed it today. Chris suggested focusing our PV meaning efforts on

  1. enums where virtually all of the PVs would have a mapping to a well-maintained ontology
  2. slots that are already being frequently used in INSDC Biosamples

I don't think 2 is true here, and I think 1 might be very difficult to accomplish because the current land use PVs are a mixture of land uses (which might be in EnvO) and organisms that might be found on land being used in some way. The organism names are a mixture of singular and plural, and may have many to many mappings within NCBI taxonomy

So I am going to skip PV meaning sin the PR

CurLandUseEnum:
permissible_values:
badlands:
cities:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Similarly, meaning for cities – http://purl.obolibrary.org/obo/ENVO_00000856

@sujaypatil96
Copy link
Collaborator

Looks like there's a merge conflict that needs resolution on this PR @turbomam?

@turbomam
Copy link
Member Author

turbomam commented Jun 3, 2025

Copy link

github-actions bot commented Jun 6, 2025

LinkML Linting Results

Summary

Count
Schemas Checked 1
Schemas with Error 1
Schemas with Warning 0
Total Errors 13
Total Warnings 0

Problems per Schema

/home/runner/work/mixs/mixs/src/mixs/schema/mixs.yaml

Errors

  • recommended: Subset 'combination_classes' does not have recommended slot 'description'
  • recommended: Subset 'sequencing' does not have recommended slot 'description'
  • recommended: Subset 'environment' does not have recommended slot 'description'
  • recommended: Subset 'nucleic acid sequence source' does not have recommended slot 'description'
  • recommended: Subset 'investigation' does not have recommended slot 'description'
  • standard_naming: Slot has name 'HACCP_term'
  • standard_naming: Slot has name 'IFSAC_category'
  • standard_naming: Slot has name 'air_PM_concen'
  • standard_naming: Slot has name 'ferm_pH'
  • standard_naming: Slot has name 'microb_start_taxID'
  • standard_naming: Slot has name 'soil_pH'
  • standard_naming: Slot has name 'spikein_AMR'
  • standard_naming: Slot has name 'water_pH'

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

cur_land_use was an enum before LinkML
4 participants