Skip to content

Implementation of CLIP4CLIP model rewriten from the offical implementation

Notifications You must be signed in to change notification settings

ptpyip/CLIP4Clip-v2

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

46 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval

Modified implementation of paper CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval. (Official repo can be found here)

CLIP4Clip is a video-text retrieval model based on CLIP (ViT-B). We investigate three similarity calculation approaches: parameter-free type, sequential type, and tight type, in this work. The model achieve SOTA results on MSR-VTT, MSVD, LSMDC, ActivityNet, and DiDeMo.

CLIP4Clip.png

Requirement

# From CLIP
conda install --yes -c pytorch pytorch=1.7.1 torchvision cudatoolkit=11.0
pip install ftfy regex tqdm
pip install opencv-python boto3 requests pandas

About

Implementation of CLIP4CLIP model rewriten from the offical implementation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages