ICLR 2026 paper on state-dependent reward shaping with multi-view videos and ViCLIP for reinforcement learning.
-
Updated
Mar 6, 2026 - Python
ICLR 2026 paper on state-dependent reward shaping with multi-view videos and ViCLIP for reinforcement learning.
Official implementation of SPHERE: Mitigating the Loss of Spectral Plasticity in Mixture-of-Experts for Deep Reinforcement Learning.
Add a description, image, and links to the humanoidbench topic page so that developers can more easily learn about it.
To associate your repository with the humanoidbench topic, visit your repo's landing page and select "manage topics."