🤒
Pinned Loading
-
-
Bitnet-C-benchmark
Bitnet-C-benchmark PublicSingle-thread, end-to-end C++ implementation of the Bitnet (1.58-bit weight) model
C++ 1
-
cornell-zhang/allo
cornell-zhang/allo PublicAllo: A Programming Model for Composable Accelerator Design
-
pytorch-labs/gpt-fast
pytorch-labs/gpt-fast PublicSimple and efficient pytorch-native transformer text generation in <1000 LOC of python.
-
-
mobiusml/hqq
mobiusml/hqq PublicOfficial implementation of Half-Quadratic Quantization (HQQ)
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.