-
slapo Public
Forked from awslabs/slapoA schedule language for large model training
Python Apache License 2.0 UpdatedJun 14, 2024 -
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedAug 15, 2023 -
CLIP Public
Forked from openai/CLIPCLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Jupyter Notebook MIT License UpdatedApr 24, 2023 -
evals Public
Forked from openai/evalsEvals is a framework for evaluating OpenAI models and an open-source registry of benchmarks.
Python MIT License UpdatedMar 14, 2023 -
matxscript Public
Forked from bytedance/matxscriptThe model pre- and post-processing framework
C++ Apache License 2.0 UpdatedDec 26, 2022 -
DALI Public
Forked from NVIDIA/DALIA GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
C++ Apache License 2.0 UpdatedSep 18, 2022 -
veGiantModel Public
Forked from volcengine/veGiantModelPython Apache License 2.0 UpdatedMar 23, 2022 -
pytorch Public
Forked from pytorch/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
Python Other UpdatedDec 25, 2021 -
elpa Public
Forked from marekandreas/elpaA scalable eigensolver for dense, symmetric (hermitian) matrices (fork of https://gitlab.mpcdf.mpg.de/elpa/elpa.git)
Fortran Other UpdatedNov 30, 2021 -
builder Public
Forked from pytorch/builderContinuous builder and binary build scripts for pytorch
Shell BSD 2-Clause "Simplified" License UpdatedNov 18, 2021 -
ps-lite Public
Forked from dmlc/ps-liteA lightweight parameter server interface
-
byteps Public
Forked from bytedance/bytepsA high performance and general PS framework for distributed training
Python Other UpdatedOct 5, 2021 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer language models at scale, including: BERT & GPT-2
Python Other UpdatedJul 16, 2021 -
ucx Public
Forked from openucx/ucxUnified Communication X (mailing list - https://elist.ornl.gov/mailman/listinfo/ucx-group)
C Other UpdatedMar 11, 2021 -
-
ucx-py Public
Forked from rapidsai/ucx-pyPython bindings for UCX
Python BSD 3-Clause "New" or "Revised" License UpdatedOct 3, 2020 -
DeepSpeed Public
Forked from microsoft/DeepSpeedDeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
Python MIT License UpdatedSep 26, 2020 -
pytorch-OpCounter Public
Forked from Lyken17/pytorch-OpCounterCount the MACs / FLOPs of your PyTorch model.
-
gossip Public
Forked from Funatiq/gossipgossip: Efficient Communication Primitives for Multi-GPU Systems
C++ MIT License UpdatedSep 3, 2020 -
HugeCTR Public
Forked from NVIDIA-Merlin/HugeCTRHugeCTR is a high efficiency GPU framework designed for Click-Through-Rate (CTR) estimating training
C++ Apache License 2.0 UpdatedAug 29, 2020 -
horovod Public
Forked from horovod/horovodDistributed training framework for TensorFlow, Keras, PyTorch, and MXNet.
C++ Other UpdatedAug 20, 2020 -
tvm Public
Forked from apache/tvmOpen deep learning compiler stack for cpu, gpu and specialized accelerators
Python Apache License 2.0 UpdatedAug 19, 2020 -
tensorflow Public
Forked from tensorflow/tensorflowAn Open Source Machine Learning Framework for Everyone
C++ Apache License 2.0 UpdatedAug 12, 2020 -
gluon-nlp Public
Forked from dmlc/gluon-nlpNLP made easy
Python Apache License 2.0 UpdatedAug 12, 2020 -
mxnet Public
Forked from apache/mxnetLightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Go, Javascript and more
-
trax Public
Forked from google/traxTrax — Deep Learning with Clear Code and Speed
Python Apache License 2.0 UpdatedAug 1, 2020 -
d2l-tvm Public
Forked from d2l-ai/d2l-tvmDive into Deep Learning Compiler
Python UpdatedJul 10, 2020 -
Mini-Conf Public
Forked from Mini-Conf/Mini-ConfRun a conference from your backyard.
JavaScript MIT License UpdatedJun 23, 2020 -
-