Skip to content
View memray's full-sized avatar

Organizations

@salesforce

Block or report memray

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A community hub for collecting and sharing real-world issues with LLMs and other models to help improve their capabilities.

1 Updated Oct 21, 2024

This repo contains the code and data for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks"

Python 63 1 Updated Nov 12, 2024

A PyTorch Native LLM Training Framework

Python 661 34 Updated Aug 25, 2024

Salesforce open-source LLMs with 8k sequence length.

Python 718 39 Updated Dec 20, 2023

Unified Controllable Visual Generation Model

Python 620 35 Updated Apr 22, 2024
Jupyter Notebook 295 22 Updated Jul 24, 2023

A deep learning library for identifying keyphrases from text

Python 25 3 Updated Aug 1, 2022
Python 131 17 Updated Jul 5, 2023

CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

Python 4,933 381 Updated Mar 17, 2024

ACTER is a manually annotated dataset for term extraction, covering 3 languages (English, French, and Dutch), and 4 domains (corruption, dressage, heart failure, and wind energy).

19 2 Updated Apr 8, 2022

Automatically generate your résumé and various cover letters from YAML files.

Python 126 21 Updated Aug 14, 2024

Code to obtain the PMC-SA. A dataset for the summarization of scientific articles.

Python 6 3 Updated Mar 24, 2023

Everything you need to know for a Software Engineering interview

2,073 452 Updated Mar 7, 2023

BART summarization tool

Python 6 Updated Sep 11, 2020

Large, curated set of benchmark datasets for evaluating automatic keyphrase extraction algorithms.

Shell 142 29 Updated Jul 3, 2020
Jupyter Notebook 532 119 Updated Dec 30, 2021

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

13,586 1,395 Updated Feb 13, 2023

AutoPhrase: Automated Phrase Mining from Massive Text Corpora

C++ 1,175 274 Updated Jan 27, 2022
Python 446 78 Updated Oct 26, 2022

Plot the vector graph of attention based text visualisation

Python 367 58 Updated Apr 12, 2019

Keyphrase Generation

Jupyter Notebook 217 34 Updated Jul 22, 2023

Python Keyphrase Extraction module

Python 1,564 290 Updated Jul 12, 2023

An open-source NLP research library, built on PyTorch.

Python 11,757 2,253 Updated Nov 22, 2022
Python 3 Updated Dec 2, 2018

A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.

Python 1,506 247 Updated Jun 25, 2024

Full Python implementation of the ROUGE metric, producing same results as in the official perl implementation.

Perl 157 25 Updated Jul 10, 2019

A Python wrapper for the ROUGE summarization evaluation package

Python 250 71 Updated Feb 10, 2021

Unsupervised Language Modeling at scale for robust sentiment classification

Python 1,062 202 Updated Jun 28, 2020

PyTorch original implementation of Cross-lingual Language Model Pretraining.

Python 2,889 498 Updated Feb 14, 2023

Facebook AI Research Sequence-to-Sequence Toolkit

Lua 3,742 616 Updated Sep 17, 2021
Next