wwxFromTju

Weixun Wang wwxFromTju

Make MAS(DRL) Great Again ! 🐶

323 followers · 797 following

DRL/MAS
Tianjin China
wwxfromtju.github.io

Achievements

x2 x3

Achievements

x2 x3

Organizations

Stars

openreasoner / openr

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,078 79 Updated Nov 19, 2024

architsharma97 / dpo-rlaif

Jupyter Notebook 90 9 Updated Jun 27, 2024

FLAIROx / JaxMARL

Multi-Agent Reinforcement Learning with JAX

Python 442 81 Updated Nov 7, 2024

Azure / MS-AMP

Microsoft Automatic Mixed Precision Library

Python 525 43 Updated Sep 29, 2024

eureka-research / Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Jupyter Notebook 2,841 258 Updated May 3, 2024

ray-project / llm-numbers

Numbers every LLM developer should know

4,106 141 Updated Jan 16, 2024

sotopia-lab / sotopia

Sotopia: an Open-ended Social Learning Environment (ICLR 2024 spotlight)

Python 166 20 Updated Nov 24, 2024

floodsung / LLM-with-RL-papers

A collection of LLM with RL papers

230 9 Updated Apr 24, 2024

NVlabs / easysim

A library for creating Gym environments with unified API to various physics simulators

Python 31 7 Updated Aug 16, 2022

perrin-isir / xpag

a modular reinforcement learning library with JAX agents

Python 22 5 Updated Nov 15, 2023

wwxFromTju / awesome-reinforcement-learning-lib

GitHub's code repository is all you need

328 38 Updated Mar 21, 2023

chandar-lab / RLHive

Python 100 9 Updated Feb 14, 2024

kvfrans / powderworld

Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions

Python 63 8 Updated Aug 31, 2024

gilzamir18 / AI4U

AI4U is a plugin that allows you use the Godot Game Engine to specify agents with reinforcement learning. Non-Player Characters (NPCs) of games can be designed using ready-made components.

C# 66 11 Updated Sep 13, 2024

facebookresearch / shumai

Fast Differentiable Tensor Library in JavaScript and TypeScript with Bun + Flashlight

TypeScript 1,145 26 Updated Jul 23, 2024

tinkoff-ai / CORL

High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC, LB-SAC, SPOT, Cal-QL, ReBRAC

Python 1,097 131 Updated Aug 3, 2023