Stars
The official GitHub page for the survey paper "A Survey of Large Language Models".
Implementation of CAN: Revisiting Feature Co-Action for Click-Through RatePrediction
Awesome Deep Learning papers for industrial Search, Recommendation and Advertisement. They focus on Embedding, Matching, Ranking (CTR/CVR prediction), Post Ranking, Large Model (Generative Recommen…
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Easy-to-use,Modular and Extendible package of deep-learning based CTR models .
推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.
[NeurIPS 2021 Spotlight] & [IJCV 2024] SOFT: Softmax-free Transformer with Linear Complexity
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
High-quality implementations of standard and SOTA methods on a variety of tasks.
ICDAR 2019: MaskRCNN on PubLayNet datasets. Paragraph detection, table detection, figure detection,...
Implementation of ConvMixer for "Patches Are All You Need? 🤷"
This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.
Google Research
Voice Activity Detection based on Deep Learning & TensorFlow
Must-read papers on improving efficiency for pre-trained language models.
A PyTorch-based knowledge distillation toolkit for natural language processing
MMSA is a unified framework for Multimodal Sentiment Analysis.
FlatNCE: A Novel Contrastive Representation Learning Objective
Code for ALBEF: a new vision-language pre-training method
DPT: Deformable Patch-based Transformer for Visual Recognition (ACM MM2021)
Code for paper "DeepEMD: Few-Shot Image Classification with Differentiable Earth Mover's Distance and Structured Classifiers", CVPR2020
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks.
Pytorch port of Google Research's VGGish model used for extracting audio features.