CS Ph.D. @ SJTU | ML Research intern @ ByteDance, @ miHoYo
-
Shanghai Jiao Tong University
- Shanghai
-
12:20
(UTC -12:00)
Popular repositories Loading
-
-
mixture-of-depths
mixture-of-depths PublicForked from sramshetty/mixture-of-depths
An unofficial implementation of "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
Python
-
Mixture-of-depths-test
Mixture-of-depths-test PublicForked from astramind-ai/Mixture-of-depths
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
Python
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.