Pinned Loading
-
-
-
Transformer-Sandbox
Transformer-Sandbox PublicForked from karpathy/nanoGPT
Investigation of parallelized Transformer architectures and other sequence-to-sequence generative models.
Python 1
-
hippo-s4-mamba-operator-dynamics
hippo-s4-mamba-operator-dynamics PublicForked from state-spaces/mamba
Investigating alternative methods for defining operator dynamics in s4/mamba; examining the effect on trainability.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.