Skip to content

This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by google

Notifications You must be signed in to change notification settings

sanowl/Self-Correcting-LLM--Reinforcement-Learning-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Self-Correction via Reinforcement Learning

This repository contains an implementation of the SCoRe (Self-Correction via Reinforcement Learning) system, based on the paper "Training Language Models to Self-Correct via Reinforcement Learning".

About

This my attempt to create Self-Correcting-LLM based on the paper Training Language Models to Self-Correct via Reinforcement Learning by google

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages