Skip to content

Vectorized training#67

Open
rohanpsingh wants to merge 7 commits intomainfrom
dev/vectorized-training
Open

Vectorized training#67
rohanpsingh wants to merge 7 commits intomainfrom
dev/vectorized-training

Conversation

@rohanpsingh
Copy link
Copy Markdown
Owner

No description provided.

@rohanpsingh rohanpsingh force-pushed the dev/vectorized-training branch from 8c87618 to 7f88d88 Compare January 17, 2026 18:19
@rohanpsingh rohanpsingh force-pushed the dev/vectorized-training branch 3 times, most recently from cc8003a to 0b06db4 Compare March 7, 2026 06:03
When num_envs_per_worker > 1, the code correctly computed fewer workers
but then created n_proc workers anyway, spawning far more environments
than intended.
get_data() returns a BatchData dataclass, not a dict. The dict-style
subscript access would crash with TypeError at runtime.
The auto-reset was replacing the terminal obs with the reset obs,
causing incorrect value bootstrapping for truncated episodes. Now
stores terminal_observation in info dict and uses it for bootstrapping.
The vectorized _sample_vectorized method wasn't tracking completed
episode lengths and rewards, resulting in nan metrics during training.
@rohanpsingh rohanpsingh force-pushed the dev/vectorized-training branch from f117109 to 4c665fa Compare March 8, 2026 18:53
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant