Fix save_model to include fine-tuning head weights by LiudengZhang · Pull Request #371 · helicalAI/helical

LiudengZhang · 2026-04-24T23:29:44Z

Summary

save_model() currently calls torch.save(self.model.state_dict(), path), which only persists backbone weights and silently drops the fine_tuning_head
Users who fine-tune, save, and reload get random head weights instead of trained ones
Affects all 7 model types (Geneformer, scGPT, UCE, HyenaDNA, Caduceus, HelixmRNA, Mamba2mRNA)
Changed to self.state_dict() which includes both backbone and head
Updated load_model to auto-detect three checkpoint formats for backward compatibility (full, backbone-only, legacy pickle)

Test plan

Fine-tune a model, save, reload — verify head weights are preserved
Load a v2.0.0 backbone-only checkpoint — verify warning is logged and backbone loads correctly
Load a pre-v2.0.0 legacy pickle checkpoint — verify backward compatibility

🤖 Generated with Claude Code

dmiv-helical · 2026-04-27T09:41:12Z

Hi @LiudengZhang and welcome to the community!

Please open this PR against the main branch.
Also, this item in your Test plan:

Fine-tune a model, save, reload — verify head weights are preserved

is better as a unit test.

save_model only persisted self.model.state_dict(), silently discarding the trained fine-tuning head (ClassificationHead / RegressionHead) weights. Switch to self.state_dict() so both the backbone and the head are saved. load_model now auto-detects the checkpoint format: - full checkpoint (model + head keys) -> self.load_state_dict() - backbone-only (v2.0.0 checkpoint) -> strict=False, warn - legacy pickle (pre-v2.0.0) -> extract & load backbone

LiudengZhang · 2026-04-28T01:22:07Z

Thanks for the welcome and the feedback! I've retargeted the PR to main and added a unit test that verifies fine-tuning head weights survive save/reload (sets all head params to a sentinel value, saves, loads into a fresh model, and asserts equality). Let me know if anything else needs adjusting.

bputzeys · 2026-04-28T07:38:28Z

Thanks for this PR! Good one :)

…g head (#372) * Merge pull request #371 from LiudengZhang/fix/save-load-fine-tuning-head Fix save_model to include fine-tuning head weights * Bump version from 2.0.1 to 2.0.2 --------- Co-authored-by: LiudengZhang <99156394+LiudengZhang@users.noreply.github.com>

LiudengZhang · 2026-04-28T17:37:03Z

Thanks @bputzeys and @dmiv-helical for the reviews! Good catch on the comment — noted for next time.

LiudengZhang added 2 commits April 27, 2026 20:07

Add unit test for save/load fine-tuning head weights

1235729

LiudengZhang force-pushed the fix/save-load-fine-tuning-head branch from 28e5ba6 to 1235729 Compare April 28, 2026 01:10

LiudengZhang changed the base branch from release to main April 28, 2026 01:10

bputzeys reviewed Apr 28, 2026

View reviewed changes

Comment thread ci/tests/test_geneformer/test_fine_tuning.py Outdated

Update ci/tests/test_geneformer/test_fine_tuning.py

6d10497

bputzeys approved these changes Apr 28, 2026

View reviewed changes

dmiv-helical approved these changes Apr 28, 2026

View reviewed changes

bputzeys merged commit f133d5a into helicalAI:main Apr 28, 2026
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix save_model to include fine-tuning head weights#371

Fix save_model to include fine-tuning head weights#371
bputzeys merged 3 commits into
helicalAI:mainfrom
LiudengZhang:fix/save-load-fine-tuning-head

LiudengZhang commented Apr 24, 2026

Uh oh!

dmiv-helical commented Apr 27, 2026

Uh oh!

LiudengZhang commented Apr 28, 2026

Uh oh!

Uh oh!

bputzeys commented Apr 28, 2026

Uh oh!

Uh oh!

LiudengZhang commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

LiudengZhang commented Apr 24, 2026

Summary

Test plan

Uh oh!

dmiv-helical commented Apr 27, 2026

Uh oh!

LiudengZhang commented Apr 28, 2026

Uh oh!

Uh oh!

bputzeys commented Apr 28, 2026

Uh oh!

Uh oh!

LiudengZhang commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants