Skip to content

Commit b4ab08a

Browse files
committed
v0.5.2
1 parent ec98a33 commit b4ab08a

File tree

4 files changed

+39
-9
lines changed

4 files changed

+39
-9
lines changed

CHANGELOG

+22
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,25 @@
1+
2024.06.27(v0.5.2)
2+
- env: add taxi env (#799) (#807)
3+
- env: add ising model env (#782)
4+
- env: add new Flozen Lake env (#781)
5+
- env: optimize ppo continuous config in MuJoCo (#801)
6+
- env: fix masac smac config multi_agent=True bug (#791)
7+
- env: update/speed up pendulum ppo
8+
- algo: fix gtrxl compatibility bug (#796)
9+
- algo: fix complex obs demo for ppo pipeline (#786)
10+
- algo: add naive PWIL demo
11+
- algo: fix marl nstep td compatibility bug
12+
- feature: add GPU utils (#788)
13+
- feature: add deprecated function decorator (#778)
14+
- style: relax flask requirement (#811)
15+
- style: add new badge (hellogithub) in readme (#805)
16+
- style: update discord link and badge in readme (#795)
17+
- style: fix typo in config.py (#776)
18+
- style: polish rl_utils api docs
19+
- style: add constraint about numpy<2
20+
- style: polish macos platform test version to 12
21+
- style: polish ci python version
22+
123
2024.02.04(v0.5.1)
224
- env: add MADDPG pettingzoo example (#774)
325
- env: polish NGU Atari configs (#767)

README.md

+15-7
Original file line numberDiff line numberDiff line change
@@ -42,7 +42,7 @@
4242
</div>
4343
<br>
4444

45-
Updated on 2024.02.04 DI-engine-v0.5.1
45+
Updated on 2024.06.27 DI-engine-v0.5.2
4646

4747
## Introduction to DI-engine
4848

@@ -56,10 +56,13 @@ It provides **python-first** and **asynchronous-native** task and middleware abs
5656
- Multi-agent RL algorithms: such as QMIX, WQMIX, MAPPO, HAPPO, ACE
5757
- Imitation learning algorithms (BC/IRL/GAIL): such as GAIL, SQIL, Guided Cost Learning, Implicit BC
5858
- Offline RL algorithms: BCQ, CQL, TD3BC, Decision Transformer, EDAC, Diffuser, Decision Diffuser, SO2
59-
- Model-based RL algorithms: SVG, STEVE, MBPO, DDPPO, DreamerV3, MuZero
59+
- Model-based RL algorithms: SVG, STEVE, MBPO, DDPPO, DreamerV3
6060
- Exploration algorithms: HER, RND, ICM, NGU
61-
- LLM + RL Algorithms: PPO-max, DPO, MODPO,PromptPG
61+
- LLM + RL Algorithms: PPO-max, DPO, PromptPG
6262
- Other algorithms: such as PER, PLR, PCGrad
63+
- MCTS + RL algorithms: AlphaZero, MuZero, please refer to [LightZero](https://github.com/opendilab/LightZero)
64+
- Generative Model + RL algorithms: Diffusion-QL, QGPO, SRPO, please refer to [GenerativeRL](https://github.com/opendilab/GenerativeRL)
65+
6366

6467
**DI-engine** aims to **standardize different Decision Intelligence environments and applications**, supporting both academic research and prototype applications. Various training pipelines and customized decision AI applications are also supported:
6568

@@ -72,6 +75,7 @@ It provides **python-first** and **asynchronous-native** task and middleware abs
7275
- [PPOxFamily](https://github.com/opendilab/PPOxFamily): PPO x Family DRL Tutorial Course
7376
- Real world decision AI applications
7477
- [DI-star](https://github.com/opendilab/DI-star): Decision AI in StarCraftII
78+
- [PsyDI](https://github.com/opendilab/PsyDI): Towards a Multi-Modal and Interactive Chatbot for Psychological Assessments
7579
- [DI-drive](https://github.com/opendilab/DI-drive): Auto-driving platform
7680
- [DI-sheep](https://github.com/opendilab/DI-sheep): Decision AI in 3 Tiles Game
7781
- [DI-smartcross](https://github.com/opendilab/DI-smartcross): Decision AI in Traffic Light Control
@@ -84,16 +88,20 @@ It provides **python-first** and **asynchronous-native** task and middleware abs
8488
- [DOS](https://github.com/opendilab/DOS): [CVPR 2023] ReasonNet: End-to-End Driving with Temporal and Global Reasoning
8589
- [LightZero](https://github.com/opendilab/LightZero): [NeurIPS 2023 Spotlight] A lightweight and efficient MCTS/AlphaZero/MuZero algorithm toolkit
8690
- [SO2](https://github.com/opendilab/SO2): [AAAI 2024] A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning
87-
- [LMDrive](https://github.com/opendilab/LMDrive): LMDrive: Closed-Loop End-to-End Driving with Large Language Models
91+
- [LMDrive](https://github.com/opendilab/LMDrive): [CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models
92+
- [SmartRefine](https://github.com/opendilab/SmartRefine): [CVPR 2024] SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
93+
- [ReZero](https://github.com/opendilab/LightZero): Boosting MCTS-based Algorithms by Backward-view and Entire-buffer Reanalyze
94+
- [UniZero](https://github.com/opendilab/LightZero): Generalized and Efficient Planning with Scalable Latent World Models
8895
- Docs and Tutorials
8996
- [DI-engine-docs](https://github.com/opendilab/DI-engine-docs): Tutorials, best practice and the API reference.
9097
- [awesome-model-based-RL](https://github.com/opendilab/awesome-model-based-RL): A curated list of awesome Model-Based RL resources
9198
- [awesome-exploration-RL](https://github.com/opendilab/awesome-exploration-rl): A curated list of awesome exploration RL resources
9299
- [awesome-decision-transformer](https://github.com/opendilab/awesome-decision-transformer): A curated list of Decision Transformer resources
93100
- [awesome-RLHF](https://github.com/opendilab/awesome-RLHF): A curated list of reinforcement learning with human feedback resources
94101
- [awesome-multi-modal-reinforcement-learning](https://github.com/opendilab/awesome-multi-modal-reinforcement-learning): A curated list of Multi-Modal Reinforcement Learning resources
95-
- [awesome-AI-based-protein-design](https://github.com/opendilab/awesome-AI-based-protein-design): a collection of research papers for AI-based protein design
96102
- [awesome-diffusion-model-in-rl](https://github.com/opendilab/awesome-diffusion-model-in-rl): A curated list of Diffusion Model in RL resources
103+
- [awesome-ui-agents](https://github.com/opendilab/awesome-ui-agents): A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond
104+
- [awesome-AI-based-protein-design](https://github.com/opendilab/awesome-AI-based-protein-design): a collection of research papers for AI-based protein design
97105
- [awesome-end-to-end-autonomous-driving](https://github.com/opendilab/awesome-end-to-end-autonomous-driving): A curated list of awesome End-to-End Autonomous Driving resources
98106
- [awesome-driving-behavior-prediction](https://github.com/opendilab/awesome-driving-behavior-prediction): A collection of research papers for Driving Behavior Prediction
99107

@@ -482,8 +490,8 @@ We appreciate all the feedbacks and contributions to improve DI-engine, both alg
482490

483491
```latex
484492
@misc{ding,
485-
title={DI-engine: OpenDILab Decision Intelligence Engine},
486-
author={OpenDILab Contributors},
493+
title={DI-engine: A Universal AI System/Engine for Decision Intelligence},
494+
author={Yazhe Niu, Jingxin Xu, Yuan Pu, Yunpeng Nie, Jinouwen Zhang, Shuai Hu, Liangxuan Zhao, Ming Zhang, Yu Liu},
487495
publisher={GitHub},
488496
howpublished={\url{https://github.com/opendilab/DI-engine}},
489497
year={2021},

conda/meta.yaml

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
{% set data = load_setup_py_data() %}
22
package:
33
name: di-engine
4-
version: v0.5.1
4+
version: v0.5.2
55

66
source:
77
path: ..

ding/__init__.py

+1-1
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
import os
22

33
__TITLE__ = 'DI-engine'
4-
__VERSION__ = 'v0.5.1'
4+
__VERSION__ = 'v0.5.2'
55
__DESCRIPTION__ = 'Decision AI Engine'
66
__AUTHOR__ = "OpenDILab Contributors"
77
__AUTHOR_EMAIL__ = "[email protected]"

0 commit comments

Comments
 (0)