Stars
[NeurIPS 2024🔥] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation
the official code for "ToolAlpaca: Generalized Tool Learning for Language Models with 3000 Simulated Cases"
Next generation of automated data exploratory analysis and visualization platform.
PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Efficient and general syntactical decoding for Large Language Models
Use natural language to Generate Amazon Athena SQL queries to fetch data.
On-device AI across mobile, embedded and edge for PyTorch
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Self-serve BI to 10x your data team ⚡️
LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.
This repository is intended for those looking to dive deep on advanced Text-to-SQL concepts.
Official repository for the paper “The Dawn of Natural Language to SQL: Are We Fully Ready?” (VLDB'24)
This is a continuously updated handbook for readers to easily track the latest NL2SQL (Text2SQL) techniques in the literature and provide practical guidance for researchers and practitioners.
A efficient and effective few-shot NL2SQL method on GPT-4.
We write your reusable computer vision tools. 💜
Apache Superset is a Data Visualization and Data Exploration Platform
The official implementation of Self-Play Fine-Tuning (SPIN)
A library for efficient similarity search and clustering of dense vectors.
"Deep Generative Modeling": Introductory Examples
TAG-Bench: A benchmark for table-augmented generation (TAG)
[CVPR 2023 Best Paper Award] Planning-oriented Autonomous Driving
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
📚 Tensor/CUDA Cores, 📖150+ CUDA Kernels, toy-hgemm library🔥(achieve the performance of cuBLAS 🎉🎉).
Efficient Triton Kernels for LLM Training