Categorical Score Calibrations #589

bencap · 2025-11-24T20:24:53Z

This pull request implements a major migration of functional range data in MaveDB, moving from a legacy JSONB-based representation to a new normalized relational schema with explicit tables for ACMG classifications, functional classifications, and their associations to variants. The migration includes new Alembic migrations, a data migration script, and supporting code changes for model and enum usage.

The changes also include two related supporting changes: an improvement to the way we handle Pydantic forward references that rebuilds models dynamically upon the import of any Pydantic class and a model loading utility that simplifies route definitions when both JSON and Form data must be supported by routes with optional file uploads.

Key changes:

Database Schema Migration:

Introduced new tables: acmg_classifications, score_calibration_functional_classifications, and an association table for linking functional classifications to variants. The old functional_ranges JSONB column is renamed and then dropped after migration. [1]], [2]])
Renamed columns and added new fields to support the new schema, including a class_ column and renaming classification to functional_classification in the relevant table. ([alembic/versions/0520dfa9f2db_rename_functional_ranges_to_functional_.pyR1-R45])

Data Migration Script:

Added alembic/manual_migrations/migrate_jsonb_ranges_to_table_rows.py, a comprehensive script to migrate existing JSONB functional ranges to the new tables, including logic to create ACMG classification records, functional classification rows, and variant associations. The script also supports verification and rollback of the migration. ([alembic/manual_migrations/migrate_jsonb_ranges_to_table_rows.pyR1-R374])

Model and Enum Refactoring:

Refactored ACMG-related enums (ACMGCriterion, StrengthOfEvidenceProvided) out of src/mavedb/lib/acmg.py to their own modules, and updated imports to use the new locations. ([src/mavedb/lib/acmg.pyL1-R8])

Pydantic Model Circularity:

Ensured that model forward references are resolved upfront on module import by importing model_rebuild in src/mavedb/__init__.py. ([src/mavedb/init.pyR12-R14])

Flexible Loader for multipart/form-data

Added a library module flexible_module_loader.py with a generic dependency generator create_flexible_model_loader and convenience method json_or_form_loader. These dependency generators can be used to dynamically create parsers for routes that require support for JSON data and optional file uploads. ([src/mavedb/lib/flexible_calibration_loader.py.pyR185])

…cular dependencies Implements a centralized model rebuilding strategy for Pydantic model rebuilding. Instead of maintaing these model rebuilds in each file, we now can import circular dependencies in an `if TYPE_CHECKING:` block. The model rebuild module will then automatically handle model rebuilds, walking the view_model module and dynamically rebuilding our models based on their sub-classes. This should substantially increase ease of maintainability when adding dependent Pydantic models.

… module

… multipart form data

- Added new SQLAlchemy model `ScoreCalibrationFunctionalClassification` to represent functional classifications associated with score calibrations. - Established relationships between `ScoreCalibration` and `ScoreCalibrationFunctionalClassification`. - Created an association table for many-to-many relationships between functional classifications and variants. - Updated view models to accommodate new functional classification structures, including validation for inclusive bounds. - Enhanced tests to cover new functionality, including creation and validation of functional classifications. - Refactored existing code to ensure compatibility with new models and relationships.

… ScoreCalibration model

…ariants by score range

- Add a property `class_` to score calibration functional classifications. One of `range` or `class_` must be defined - Add validation logic to class based score ranges - Refactor lib code to support both range types - Refactor tests to support both range types TODO: Support for creating variant associations in class based score ranges.

…and adjust related tests

…ClassificationBase

…ionFunctionalClassification

…frame

- Added router functionality for validation and standardization of class based calibration files. - Added lib functionality for creation/modification of class based calibrations. - Invoked lib functionality from routers to allow client creation/modification of class based calibrations. - Introduced a new CSV file `calibration_classes.csv` containing variant URNs and their corresponding class names. - Implemented tests for creating and updating score calibrations using class-based classifications. - Enhanced existing test suite with parameterized tests to validate score calibration creation and modification. - Ensured that the response includes correct functional classifications and variant counts.

…ss validation

…ration creation and modification

…dification routes

…_pro - Allow class-based calibration to be defined via hgvs strings - Introduced new test CSV files for calibration classes based on HGVS nucleotide, HGVS protein, and URN. - Enhanced test coverage for score calibration creation and updating, including scenarios for decoding errors and validation errors. - Refactored tests to utilize parameterization for different calibration class files. - Added validation checks for index column selection in calibration dataframes. - Improved error messages for missing or invalid calibration classes.

…tion format

jstone-dev · 2025-12-26T17:33:00Z

I ran into two unexpected (I think) validation errors.

For functional classes with evidence strength, I'm seeing this:

When that error isn't present, I run into this:

bencap linked an issue Nov 24, 2025 that may be closed by this pull request

Calibrations without score ranges #538

Open

bencap requested a review from sallybg November 24, 2025 20:28

bencap mentioned this pull request Nov 26, 2025

Categorical Calibration Support VariantEffect/mavedb-ui#587

Open

bencap added 22 commits December 18, 2025 12:13

feat: rebuilt Pydantic models up front for availability within entire…

2e02b93

… module

feat: add flexible model loader for Pydantic models from JSON body or…

d1641de

… multipart form data

feat: remove deprecated functional_ranges_deprecated_json column from…

4b99dc4

… ScoreCalibration model

feat: add variants_for_functional_classification function to filter v…

a47ec9b

…ariants by score range

feat: update standardize_dataframe to accept custom standard columns …

9dafe45

…and adjust related tests

fix: update inclusive bound checks to allow None values in Functional…

605f746

…ClassificationBase

refactor: remove default values for inclusive bounds in ScoreCalibrat…

407377b

…ionFunctionalClassification

feat: add validation and standardization for calibration classes data…

3fda888

…frame

refactor: replace ValueError with ValidationError for calibration cla…

33d78cd

…ss validation

feat: add error handling for validation of class files in score calib…

be4402b

…ration creation and modification

feat: add file presence/absence checks in calibration creation and mo…

126c591

…dification routes

fixup

c4f3a3f

feat: don't allow class-based calibrations during score set creatoin

fc20f40

fix: only check resource existence for index columns

edc2058

fix: improperly renamed functional_ranges property in alembic downgrade

bd54cd2

fix: correct index column reference in validation function

969bf45

refactor: update calibrated variant effects script for new classifica…

4962f4f

…tion format

bencap force-pushed the feature/bencap/538/categorical-calibrations branch from d524416 to 4962f4f Compare December 18, 2025 20:43

fix: use functional classification enum in place of old style strings

8ee4c1a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Categorical Score Calibrations #589

Categorical Score Calibrations #589

Uh oh!

bencap commented Nov 24, 2025 •

edited

Loading

Uh oh!

jstone-dev commented Dec 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Categorical Score Calibrations #589

Are you sure you want to change the base?

Categorical Score Calibrations #589

Uh oh!

Conversation

bencap commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jstone-dev commented Dec 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bencap commented Nov 24, 2025 •

edited

Loading