Skip to content

current land use enum with URL safe characters#972

Open
turbomam wants to merge 3 commits intomainfrom
969-cur_land_use-was-an-enum-before-linkml
Open

current land use enum with URL safe characters#972
turbomam wants to merge 3 commits intomainfrom
969-cur_land_use-was-an-enum-before-linkml

Conversation

@turbomam
Copy link
Copy Markdown
Member

No description provided.

@turbomam turbomam requested a review from mslarae13 May 27, 2025 13:56
@turbomam turbomam linked an issue May 27, 2025 that may be closed by this pull request
@github-actions
Copy link
Copy Markdown
Contributor

LinkML Linting Results

Summary

Count
Schemas Checked 1
Schemas with Error 1
Schemas with Warning 0
Total Errors 15
Total Warnings 0

Problems per Schema

/home/runner/work/mixs/mixs/src/mixs/schema/mixs.yaml

Errors

  • recommended: Slot 'soil_pH' does not have recommended slot 'description'
  • recommended: Slot 'water_pH' does not have recommended slot 'description'
  • recommended: Subset 'combination_classes' does not have recommended slot 'description'
  • recommended: Subset 'sequencing' does not have recommended slot 'description'
  • recommended: Subset 'environment' does not have recommended slot 'description'
  • recommended: Subset 'nucleic acid sequence source' does not have recommended slot 'description'
  • recommended: Subset 'investigation' does not have recommended slot 'description'
  • standard_naming: Slot has name 'HACCP_term'
  • standard_naming: Slot has name 'IFSAC_category'
  • standard_naming: Slot has name 'air_PM_concen'
  • standard_naming: Slot has name 'ferm_pH'
  • standard_naming: Slot has name 'microb_start_taxID'
  • standard_naming: Slot has name 'soil_pH'
  • standard_naming: Slot has name 'spikein_AMR'
  • standard_naming: Slot has name 'water_pH'

@github-actions
Copy link
Copy Markdown
Contributor

github-actions bot commented May 27, 2025

PR Preview Action v1.8.1

QR code for preview link

🚀 View preview at
https://GenomicsStandardsConsortium.github.io/mixs/pr-preview/pr-972/

Built to branch gh-pages at 2026-03-11 02:39 UTC.
Preview will be ready when the GitHub Pages deployment is complete.

cities:
conifers:
examples:
- value: cypress
Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

probably want to keep "conifers: pine"

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TIL the practice of adding examples on PV's! ✅

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think PVs should ever have examples

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Will switch all examples to PVs of their own and assert is_a between them (hemlock) and their super-classes (conifer)

@lschriml
Copy link
Copy Markdown
Member

As discussed we need to enumerate these longer lists as option values:
e.g. conifer
conifer:pine

Instead of having them listed as 'examples' in LinkML.

@turbomam
Copy link
Copy Markdown
Member Author

As discussed we need to enumerate these longer lists as option values: e.g. conifer conifer:pine

Instead of having them listed as 'examples' in LinkML.

Thanks @lschriml

Could you please look though these observed values

I would like to analyze how much of the hierarchy should be captured in MIxS per se

And reach out to the original contributors of this term to see what kinds of revisions they are comfortable with vs what really needs to be left as is

Copy link
Copy Markdown
Collaborator

@sujaypatil96 sujaypatil96 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Awesome job turning the string_serialization on cur_land_use to a enumeration. I added a few comments on if/how we can possibly further enrich the CurLandUseEnum (by adding meanings)?

cities:
conifers:
examples:
- value: cypress
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TIL the practice of adding examples on PV's! ✅

enums:
CurLandUseEnum:
permissible_values:
badlands:
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we add meanings to some of these PV's?

For example, is this an appropriate meaning for the badlands PV? http://purl.obolibrary.org/obo/ENVO_00000127

Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a really good idea @sujaypatil96

@cmungall and I discussed it today. Chris suggested focusing our PV meaning efforts on

  1. enums where virtually all of the PVs would have a mapping to a well-maintained ontology
  2. slots that are already being frequently used in INSDC Biosamples

I don't think 2 is true here, and I think 1 might be very difficult to accomplish because the current land use PVs are a mixture of land uses (which might be in EnvO) and organisms that might be found on land being used in some way. The organism names are a mixture of singular and plural, and may have many to many mappings within NCBI taxonomy

So I am going to skip PV meaning sin the PR

CurLandUseEnum:
permissible_values:
badlands:
cities:
Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Similarly, meaning for cities – http://purl.obolibrary.org/obo/ENVO_00000856

@sujaypatil96
Copy link
Copy Markdown
Collaborator

Looks like there's a merge conflict that needs resolution on this PR @turbomam?

@turbomam
Copy link
Copy Markdown
Member Author

turbomam commented Jun 3, 2025

@github-actions

This comment was marked as outdated.

@turbomam turbomam force-pushed the 969-cur_land_use-was-an-enum-before-linkml branch from 9b697aa to 21588f5 Compare March 10, 2026 23:41
Copilot AI review requested due to automatic review settings March 10, 2026 23:41
@github-actions
Copy link
Copy Markdown
Contributor

LinkML Linting Results

Summary

Count
Schemas Checked 1
Schemas with Error 0
Schemas with Warning 0
Total Errors 0
Total Warnings 0

Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR updates the MixS LinkML schema to model current land use as a proper enumeration, aiming to use URL-safe tokens for land-use values.

Changes:

  • Added a new CurLandUseEnum with hierarchical is_a relationships for land-use categories.
  • Updated the cur_land_use slot to use range: CurLandUseEnum and removed the prior string_serialization-based enumeration description.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Comment on lines +124 to +127
cypress:
is_a: conifers
crop trees:
christmas trees:
Copy link

Copilot AI Mar 10, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR title says the current land use enum is using URL-safe characters, but many newly added permissible values still contain spaces (e.g., "crop trees", "industrial areas", "permanent snow or ice"). If these values are intended to be URL-safe identifiers, consider normalizing them (e.g., kebab/snake case) and using aliases/title for human-readable labels; otherwise the PR title/intent should be updated to match the actual change (only removing /).

Copilot uses AI. Check for mistakes.
Copy link
Copy Markdown
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The PR scope is specifically replacing / characters with - for URL safety — the spaces in other permissible values are a separate concern and consistent with the existing MIxS convention for enum labels.

mines/quarries → mines-quarries and roads/railroads → roads-railroads
were renamed for URL safety but the old forms need aliases so existing
data remains valid.
@github-actions
Copy link
Copy Markdown
Contributor

LinkML Linting Results

Summary

Count
Schemas Checked 1
Schemas with Error 0
Schemas with Warning 0
Total Errors 0
Total Warnings 0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

cur_land_use was an enum before LinkML

5 participants