Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

qc-duplicate-exact-synonym-no-abbrev related updates #767

Open
wants to merge 5 commits into
base: develop
Choose a base branch
from

Conversation

joeflack4
Copy link
Contributor

@joeflack4 joeflack4 commented Feb 7, 2025

Note

Overview

Further updates to scripting that prevents multiple exact synonyms from appearing on different Mondo IDs.

Most important change:

  • Now no longer filters any such conflicts from -updated templates.

Other changes:

  • Now doesn't mutate inputs. Inputs/outputs are different files.
  • Handled edge cases where if there were no conflicts of a certain type, would get error.
  • Docs for reports/sync-synonym/review-qc-duplicate-exact-synonym-no-abbrev.tsv into reports/README.md

Pre-merge checklist

Documentation

Was the documentation added/updated under docs/?

  • Yes
  • No, updates to the docs were not necessary after careful consideration

QC

Was the full pipeline run before submitting this PR using sh run.sh make build-mondo-ingest on this branch (after
docker pull obolibrary/odkfull:dev), and no errors occurred?

  • Yes
  • No, there are no functional (code-related) changes to the pipeline in the PR, so no re-run is necessary

Mini build:

Build:

New Packages

Were any new Python packages added?

Were any other non-Python packages added?

PR Review and Conversations Resolved

Has the PR been sufficiently reviewed by at least 1 team member of the Mondo Technical team and all threads resolved?

  • Yes

Additional notes

- Further updates to scripting that prevents multiple exact synonyms from appearing on different Mondo IDs.
- Bug fix: synonyms-scope-type-xref(.sparql/.tsv) was missing cls_labe. This was causing a KeyError. Not sure how the code was running successfully without this before.
- Update: No longer filtering -updated.
@joeflack4 joeflack4 requested a review from twhetzel February 7, 2025 02:25
@joeflack4 joeflack4 changed the base branch from main to develop February 7, 2025 02:25
@joeflack4 joeflack4 self-assigned this Feb 7, 2025
@joeflack4 joeflack4 added the enhancement New feature or request label Feb 7, 2025
@joeflack4 joeflack4 marked this pull request as draft February 7, 2025 03:08
- Update: Now has different paths for input/output files, instead of mutating the inputs.
- Delete: Code for backing up the inputs. No longer needed, as there are no longer mutations.
- Add: Docs for reports/sync-synonym/review-qc-duplicate-exact-synonym-no-abbrev.tsv
- Update: Finished updating parameterization regrarding change of input/output dirs
  - Note that this update also gets fixes the spurious / confusing diffs issue
- Update: Added logic to handle errors in case of no issues detected
@joeflack4 joeflack4 marked this pull request as ready for review February 8, 2025 02:57
@joeflack4 joeflack4 changed the title qc-duplicate-exact-synonym-no-abbrev related upates qc-duplicate-exact-synonym-no-abbrev related updates Feb 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request stability
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant