Skip to content
@LAMDA-RL

LAMDA-RL

We are a fork of reinforcement learning researchers from LAMDA Group @ Nanjing University.

LAMDA-RL Lab

LAMDA-RL Lab is at the forefront of advancing the field of reinforcement learning and its application to creating general decision-making intelligence, by pushing the boundaries of what's possible with RL techniques.

We focus on developing novel algorithms and architectures that enable RL systems to learn and make decisions in increasingly general and adaptable ways. Some key areas we are exploring include:

  • Imitation learning;
  • Offline reinforcement learning;
  • Model-based RL and world model learning;
  • Multi-agent and collaborative RL;
  • Planning and learning with large models.

Through both fundamental and application research, our aim is to create RL-based systems that exhibit truly intelligent and general decision-making capabilities. For more information about our lab and research, please refer to our website https://lamda-rl.nju.edu.cn/.

Pinned Loading

  1. OfflineRL-Lib OfflineRL-Lib Public

    Benchmarked implementations of Offline RL Algorithms.

    Python 63 7

  2. ODIS ODIS Public

    The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".

    Python 37 5

  3. PRDC PRDC Public

    Forked from kimoyami/PRDC

    Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D4RL gym and AntMaze tasks.

    Python 14 3

  4. ACT ACT Public

    Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)

    Python 11 3

  5. Pretrained_BWArea_2.7B_30G Pretrained_BWArea_2.7B_30G Public

    Pre-trained Models of BWArea Model

    Python 8

  6. CPR CPR Public

    Forked from LyndonKong/CPR

    Python 2

Repositories

Showing 10 of 30 repositories
  • ODIS Public

    The implementation of ICLR-2023 paper "Discovering Generalizable Multi-agent Coordination Skills from Multi-task Offline Data".

    LAMDA-RL/ODIS’s past year of commit activity
    Python 37 Apache-2.0 5 1 0 Updated Oct 31, 2024
  • OPT-AIL Public
    LAMDA-RL/OPT-AIL’s past year of commit activity
    Python 0 0 0 0 Updated Oct 19, 2024
  • Madoc Public Forked from qs1bb/Madoc
    LAMDA-RL/Madoc’s past year of commit activity
    Python 0 1 0 0 Updated Oct 9, 2024
  • Pretrained_BWArea_2.7B_30G Public

    Pre-trained Models of BWArea Model

    LAMDA-RL/Pretrained_BWArea_2.7B_30G’s past year of commit activity
    Python 8 0 0 0 Updated Sep 10, 2024
  • WiseRL Public Forked from typoverflow/WiseRL

    PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms

    LAMDA-RL/WiseRL’s past year of commit activity
    Python 1 MIT 1 0 0 Updated Sep 5, 2024
  • .github Public
    LAMDA-RL/.github’s past year of commit activity
    0 0 0 0 Updated Sep 4, 2024
  • madac Public Forked from lamda-bbo/madac

    Official implementation of NeurIPS22 paper “Multi-agent Dynamic Algorithm Configuration”

    LAMDA-RL/madac’s past year of commit activity
    Python 0 Apache-2.0 7 0 0 Updated Sep 4, 2024
  • CPR Public Forked from LyndonKong/CPR
    LAMDA-RL/CPR’s past year of commit activity
    Python 2 1 0 0 Updated Sep 4, 2024
  • LAMDA-RL/unstable_baselines’s past year of commit activity
    Python 0 12 0 0 Updated Sep 4, 2024
  • UtilsRL Public
    LAMDA-RL/UtilsRL’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Sep 4, 2024

Top languages

Loading…

Most used topics

Loading…