Set dtype default to float32 #4778

albertvillanova · 2026-01-06T11:56:50Z

Set dtype default to float32.

Follow-up to:

Hotfix: Set float32 as default dtype for testing tiny models #4770

…uggingface#4770)" This reverts commit ca16441.

trl/trainer/utils.py

This reverts commit 9158319.

HuggingFaceDocBuilderDev · 2026-01-06T19:10:29Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2026-01-07T16:54:51Z

For the record, QLoRA with DPO will still force the model dtype to be in fp32:

from peft import LoraConfig
from transformers import AutoModelForCausalLM
from datasets import load_dataset
from trl import DPOTrainer
from transformers import BitsAndBytesConfig


model = AutoModelForCausalLM.from_pretrained(
    "trl-internal-testing/tiny-Qwen2ForCausalLM-2.5",
    dtype="float32",
    quantization_config=BitsAndBytesConfig(load_in_4bit=True),
)
dataset = load_dataset("trl-internal-testing/zen", "standard_preference", split="train")

trainer = DPOTrainer(
    model=model,
    train_dataset=dataset,
    peft_config=LoraConfig(),
)
trainer.train()

but in my opinion it's fine, it will be fixed by #3906

albertvillanova · 2026-01-12T10:03:41Z

Thanks for your review, @qgallouedec.

Although the CI was green, I see you set float32 dtype in some tests, but not in others. I am wondering what criteria you used.

albertvillanova added 2 commits January 6, 2026 09:55

Revert "Hotfix: Set float32 as default dtype for testing tiny models (h…

8fa8a41

…uggingface#4770)" This reverts commit ca16441.

Set dtype default to float32 in create_model_from_path

2f917cb

qgallouedec reviewed Jan 6, 2026

View reviewed changes

trl/trainer/utils.py Outdated Show resolved Hide resolved

albertvillanova added 10 commits January 6, 2026 19:00

Set dtype default to float32 in ModelConfig

b40ccbe

Fix typo

592d310

Merge remote-tracking branch 'upstream/main' into fu-4770-default-dtype

2853981

Run dev tests

9158319

Set dtype default to float32 in notebook examples

ae48cf4

Update dtype in docs

e24bdd9

Update dtype default to float32 in ModelConfig

b91a2f5

Fix tests

bb343dd

Merge remote-tracking branch 'upstream/main' into fu-4770-default-dtype

f1e0719

Revert "Run dev tests"

c810c26

This reverts commit 9158319.

albertvillanova marked this pull request as ready for review January 6, 2026 19:07

Update model dtype to float32 in SFTTrainer slow tests

ca74a43

Update model dtype to float32 in DPO and GRPO trainer tests

29ac4ec

qgallouedec approved these changes Jan 7, 2026

View reviewed changes

qgallouedec and others added 5 commits January 8, 2026 07:53

Merge branch 'main' into fu-4770-default-dtype

26d6603

Merge branch 'main' into fu-4770-default-dtype

b3337bb

same for sudoku

34577cb

fix

abc3414

Merge remote-tracking branch 'upstream/main' into fu-4770-default-dtype

69586e8

albertvillanova merged commit 4d52e02 into huggingface:main Jan 12, 2026
9 of 10 checks passed

qgallouedec mentioned this pull request Jan 28, 2026

Fix CI AssertionError: assert not True #4921

Merged

This was referenced Jan 29, 2026

Fix CI NotImplementedError for bfloat16 #4902

Merged

Fix CI AssertionError: Parameter has not changed #4904

Merged

Set model dtype to float32 in tests of trainers #4924

Merged

albertvillanova mentioned this pull request Jan 29, 2026

Set model dtype to float32 in experimental tests of trainers #4925

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set dtype default to float32 #4778

Set dtype default to float32 #4778

Uh oh!

albertvillanova commented Jan 6, 2026

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jan 6, 2026

Uh oh!

qgallouedec commented Jan 7, 2026

Uh oh!

albertvillanova commented Jan 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Set dtype default to float32 #4778

Set dtype default to float32 #4778

Uh oh!

Conversation

albertvillanova commented Jan 6, 2026

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Jan 6, 2026

Uh oh!

qgallouedec commented Jan 7, 2026

Uh oh!

albertvillanova commented Jan 12, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants