Skip to content
View wwxFromTju's full-sized avatar

Organizations

@TJU-DRL-LAB

Block or report wwxFromTju

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,078 79 Updated Nov 19, 2024
Jupyter Notebook 90 9 Updated Jun 27, 2024

Multi-Agent Reinforcement Learning with JAX

Python 442 81 Updated Nov 7, 2024

Microsoft Automatic Mixed Precision Library

Python 525 43 Updated Sep 29, 2024

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Jupyter Notebook 2,841 258 Updated May 3, 2024

Numbers every LLM developer should know

4,106 141 Updated Jan 16, 2024

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

Python 166 20 Updated Nov 24, 2024

A collection of LLM with RL papers

230 9 Updated Apr 24, 2024

A library for creating Gym environments with unified API to various physics simulators

Python 31 7 Updated Aug 16, 2022

a modular reinforcement learning library with JAX agents

Python 22 5 Updated Nov 15, 2023

GitHub's code repository is all you need

328 38 Updated Mar 21, 2023
Python 100 9 Updated Feb 14, 2024

Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions

Python 63 8 Updated Aug 31, 2024

AI4U is a plugin that allows you use the Godot Game Engine to specify agents with reinforcement learning. Non-Player Characters (NPCs) of games can be designed using ready-made components.

C# 66 11 Updated Sep 13, 2024

Fast Differentiable Tensor Library in JavaScript and TypeScript with Bun + Flashlight

TypeScript 1,145 26 Updated Jul 23, 2024

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,097 131 Updated Aug 3, 2023
Python 1,264 87 Updated Sep 24, 2024
C++ 146 11 Updated Sep 14, 2022

A collection of high-quality models for the MuJoCo physics engine, curated by Google DeepMind.

Python 1,456 204 Updated Nov 23, 2024

A GTK user interface for TLP written in Python

Python 1,120 85 Updated Sep 29, 2024

🕹️ A diverse suite of scalable reinforcement learning environments in JAX

Python 645 80 Updated Nov 22, 2024

Video PreTraining (VPT): Learning to Act by Watching Unlabeled Online Videos

Python 1,358 146 Updated Jun 10, 2024

Mirror Descent Policy Optimization

Python 38 3 Updated Oct 31, 2020

Unofficial Gato: A Generalist Agent

Python 199 27 Updated Jan 14, 2024

A Python-level JIT compiler designed to make unmodified PyTorch programs faster.

Python 1,011 124 Updated Apr 17, 2024

Training and serving large-scale neural networks with auto parallelization.

Python 3,080 357 Updated Dec 9, 2023

A Multi-Task Dataset for Simulated Humanoid Control

Python 167 23 Updated Sep 13, 2024

This is the official code for the paper CodeRL: Mastering Code Generation through Pretrained Models and Deep Reinforcement Learning (NeurIPS22).

Python 499 60 Updated Sep 26, 2023

Foundation Model for MineDojo

Python 243 32 Updated Apr 2, 2023
Next