fa4 sdpa by Jackmin801 · Pull Request #1947 · PrimeIntellect-ai/prime-rl

Jackmin801 · 2026-03-04T03:27:03Z

Note

^{Cursor Bugbot is generating a summary for commit cae5038. Configure here.}

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

Bugbot Autofix prepared fixes for both issues found in the latest run.

✅ Fixed: Missing CHANGELOG entry for new config option
- Added CHANGELOG entry documenting sdpa_fa4 as a new attention implementation option.
✅ Fixed: Unregistered attention implementation passed to HuggingFace transformers
- Modified get_model() to map sdpa_fa4 to sdpa when passing to AutoConfig.from_pretrained() since FA4 backend activation is handled separately.

Or push these changes by commenting:

@cursor push e2f556bda3

Preview (e2f556bda3)

diff --git a/CHANGELOG.md b/CHANGELOG.md
--- a/CHANGELOG.md
+++ b/CHANGELOG.md
@@ -50,6 +50,7 @@
 - **`orchestrator.tasks_per_minute`**: Added optional rate limiting for sandbox tasks per environment worker. Uses token bucket algorithm. If None (default), no rate limiting is applied (2026-02-02)
 - **`model.cp`**: When `cp>1` with `attn="flash_attention_3"`, require `model.impl="custom"` (FA3 ring-attention kernel only in custom path) (2026-02-06)
 - **`model.attn`**: Added `fa4` as an attention implementation option. Flash attention 4 is only supported with the custom implementation (#1726, 2026-02-06)
+- **`model.attn`**: Added `sdpa_fa4` as an attention implementation option. Uses PyTorch SDPA with FA4 backend (2026-03-04)
 - **`inference.model.enable_prefix_caching`**: Added flag to enable prefix caching in vLLM. Passed to vLLM as `--enable-prefix-caching` (default: None) (2026-02-08)
 - **`orchestrator.env.address`**: Added address field on `EnvConfig`. If set, connect to an environment server at this address; if None, spawn a server in a subprocess (2026-02-06)
 - **`orchestrator.env.extra_env_kwargs`**: Added on `EnvConfig`. Extra kwargs passed to the env (e.g. seq_len, interleaved_rollouts, score_rollouts). Auto-populated by the orchestrator for training envs; generally not recommended for user override. Main use case is to match these kwargs when running an env in an isolated environment server (default: {}) (2026-02-06)

diff --git a/src/prime_rl/trainer/model.py b/src/prime_rl/trainer/model.py
--- a/src/prime_rl/trainer/model.py
+++ b/src/prime_rl/trainer/model.py
@@ -165,10 +165,11 @@
     if is_vlm:
         logger.info(f"Detected vision-language model: {config.name}")
 
+    attn_for_hf = "sdpa" if config.attn == "sdpa_fa4" else config.attn
     model_config = cast(
         PretrainedConfig,
         AutoConfig.from_pretrained(
-            config.name, attn_implementation=config.attn, trust_remote_code=config.trust_remote_code
+            config.name, attn_implementation=attn_for_hf, trust_remote_code=config.trust_remote_code
         ),
     )
     model_config.use_cache = False

_{This Bugbot Autofix run was free. To enable autofix for future PRs, go to the Cursor dashboard.}

cursor · 2026-03-04T03:30:34Z

src/prime_rl/configs/trainer.py

 # -- Shared trainer configs (used by both SFT and RL trainers) --

-AttnImplementation: TypeAlias = Literal["sdpa", "flash_attention_2", "flash_attention_3", "fa4"]
+AttnImplementation: TypeAlias = Literal["sdpa", "sdpa_fa4", "flash_attention_2", "flash_attention_3", "fa4"]


Missing CHANGELOG entry for new config option

Low Severity

Adding "sdpa_fa4" to the AttnImplementation type alias introduces a new valid value for the model.attn config field, but CHANGELOG.md is not updated. A precedent exists at line 52 of the changelog for the earlier fa4 addition. The project rule requires a changelog entry when configuration structures or usage patterns are modified.

^{Triggered by project rule: BugBot Instructions}

cursor · 2026-03-04T03:30:34Z

src/prime_rl/trainer/model.py

        _register_fa4_attention_interface()

+    if config.attn == "sdpa_fa4":
+        _activate_sdpa_fa4_backend()


Unregistered attention implementation passed to HuggingFace transformers

High Severity

When config.attn is "sdpa_fa4", the string "sdpa_fa4" is passed as attn_implementation to AutoConfig.from_pretrained, but HuggingFace transformers doesn't recognize this value. Unlike "fa4", which has _register_fa4_attention_interface() to register a dummy and flash_attention_4_only_with_custom_impl to restrict to custom impl, "sdpa_fa4" has neither a registration nor an impl restriction. This will cause model loading to fail.

Additional Locations (1)

src/prime_rl/configs/trainer.py#L341-L346

fa4 sdpa

cae5038

cursor bot reviewed Mar 4, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fa4 sdpa#1947

fa4 sdpa#1947
Jackmin801 wants to merge 1 commit intomainfrom
feat-fa4

Jackmin801 commented Mar 4, 2026 •

edited by cursor bot

Loading

Uh oh!

cursor bot left a comment •

edited

Loading

Uh oh!

cursor bot Mar 4, 2026

Uh oh!

cursor bot Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Jackmin801 commented Mar 4, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cursor bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cursor bot Mar 4, 2026

Choose a reason for hiding this comment

Missing CHANGELOG entry for new config option

Uh oh!

cursor bot Mar 4, 2026

Choose a reason for hiding this comment

Unregistered attention implementation passed to HuggingFace transformers

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Jackmin801 commented Mar 4, 2026 •

edited by cursor bot

Loading

cursor bot left a comment •

edited

Loading