You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Debugged this with Claude (Anthropic) over multiple sessions analyzing dmesg logs and testing configurations. The SVM thrashing pattern in the logs led us to try HSA_USE_SVM=0.
cc @lamikr (if that's the right handle for TheROCk maintainer)
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
Hardware
Software
The Problem
VAE decode on Hunyuan Video would trigger SVM thrashing, leading to GPU ring timeout and MODE2 reset:
The Fix
export HSA_USE_SVM=0This disables Shared Virtual Memory management that was fighting with the 128GB unified memory architecture.
Results
Full Launch Script
Notes 25 frames 480X832 (tiktok) previous on 7.9 was 48s/it
model weight dtype torch.float16, manual cast: None
model_type FLOW
Requested to load HunyuanVideo15
loaded completely; 95367431640625005117571072.00 MB usable, 15881.76 MB loaded, full load: True
93%|█████████████████████████████████████████████████████████████████████████████████████████████████ | 14/15 [05:25<00:24, 24.18s/it]
Credit
Debugged this with Claude (Anthropic) over multiple sessions analyzing dmesg logs and testing configurations. The SVM thrashing pattern in the logs led us to try HSA_USE_SVM=0.
cc @lamikr (if that's the right handle for TheROCk maintainer)
Beta Was this translation helpful? Give feedback.
All reactions