Skip to content
View thinvy's full-sized avatar

Block or report thinvy

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Puzzles for learning Triton

Jupyter Notebook 1,117 80 Updated Sep 25, 2024

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 9,130 1,701 Updated Nov 8, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 1,430 135 Updated Nov 14, 2024

Fork of LLVM to support AMD AIEngine processors

LLVM 106 12 Updated Nov 14, 2024
C++ 394 64 Updated Sep 5, 2024
C++ 624 73 Updated Oct 11, 2024

An open-source image signal processing (ISP) pipeline implemented by C++

C++ 130 38 Updated Oct 23, 2022

Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding

Python 1,715 131 Updated Oct 11, 2024

depyf is a tool to help you understand and adapt to PyTorch compiler torch.compile.

Python 498 11 Updated Nov 4, 2024

🏞️ PicX 是一款基于 GitHub API 开发的图床工具,提供图片上传托管、生成图片链接和常用图片工具箱服务。

TypeScript 4,615 482 Updated Oct 15, 2024

Quantized Attention that achieves speedups of 2.1x and 2.7x compared to FlashAttention2 and xformers, respectively, without lossing end-to-end metrics across various models.

Python 389 16 Updated Nov 15, 2024

Helpful tools and examples for working with flex-attention

Python 466 23 Updated Oct 23, 2024

Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.

Python 3,644 242 Updated Oct 5, 2024

Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda

C++ 12 Updated Nov 9, 2024

The Free and Open Source Cross Platform YUV Viewer with an advanced analytics toolset

C++ 1,912 376 Updated Nov 7, 2024

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Python 728 71 Updated Nov 8, 2024

A wheel-mounted MEMS IMU-based dead reckoning system.

C++ 332 64 Updated Dec 7, 2023

CameraSDK-Cpp is a C++ library to control Insta360 cameras.

126 18 Updated Oct 23, 2024

PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT

Python 2,592 350 Updated Nov 14, 2024

Compass Apache TVM is enhanced based on the Apache TVM for wide range of Neural Network (NN) models quick support, optimization and heterogeneous execution.

Python 12 3 Updated Sep 26, 2024

The official implementation of "NAS-BNN: Neural Architecture Search for Binary Neural Networks"

Python 7 Updated Aug 30, 2024

Everything in Torch Fx

Python 341 63 Updated Jun 7, 2024

A easy tool for generating Tensor Program from Torch(besd on Torch FX & TVM Relax)

Python 10 Updated Mar 24, 2023

TVM Relay IR Visualization Tool (TVM 可视化工具)

Python 58 3 Updated Aug 8, 2023

PyTorch native quantization and sparsity for training and inference

Python 1,566 169 Updated Nov 15, 2024

A pytorch quantization backend for optimum

Python 822 61 Updated Nov 12, 2024
Python 3,283 634 Updated Dec 5, 2023
Jupyter Notebook 623 47 Updated Sep 18, 2024

Training library for local feature detection and matching

Python 754 98 Updated Aug 6, 2024
Next