From b6bcb7075afcf3fb77ee4717d7e768b3826ec106 Mon Sep 17 00:00:00 2001 From: huangshiyu Date: Thu, 21 Sep 2023 11:18:31 +0800 Subject: [PATCH 1/2] update --- README_zh.md | 2 +- examples/sb3/README.md | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/README_zh.md b/README_zh.md index bb121e4a..aac446a1 100644 --- a/README_zh.md +++ b/README_zh.md @@ -52,7 +52,7 @@ OpenRL基于PyTorch进行开发,目标是为强化学习研究社区提供一 - 支持自博弈训练 - 支持自然语言任务(如对话任务)的强化学习训练 - 支持[竞技场](https://openrl-docs.readthedocs.io/zh/latest/arena/index.html)功能,可以在多智能体对抗性环境中方便地对各种智能体(甚至是[及第平台](https://openrl-docs.readthedocs.io/zh/latest/arena/index.html#openrl)上提交的智能体)进行评测。 -- 支持从[Hugging Face](https://huggingface.co/)上导入模型和数据 +- 支持从[Hugging Face](https://huggingface.co/)上导入模型和数据。支持加载Hugging Face上Stable-baseline3的模型进行测试和训练。 - 提供用户自有环境接入OpenRL的[详细教程](https://openrl-docs.readthedocs.io/zh/latest/custom_env/index.html). - 支持LSTM,GRU,Transformer等模型 - 支持多种训练加速,例如:自动混合精度训练,半精度策略网络收集数据等 diff --git a/examples/sb3/README.md b/examples/sb3/README.md index 2b77a547..6b3ac906 100644 --- a/examples/sb3/README.md +++ b/examples/sb3/README.md @@ -1,4 +1,4 @@ -Load and use stable-baseline3 models from huggingface. +Load and use [stable-baseline3 models](https://huggingface.co/sb3) from huggingface. ## Installation From 995f963e1f2d7dde1f32fb608a9700e7fc5b5571 Mon Sep 17 00:00:00 2001 From: huangshiyu Date: Thu, 21 Sep 2023 11:50:49 +0800 Subject: [PATCH 2/2] update readme --- README.md | 2 +- README_zh.md | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-) diff --git a/README.md b/README.md index 167402cf..f41d9ba5 100644 --- a/README.md +++ b/README.md @@ -61,7 +61,7 @@ Currently, the features supported by OpenRL include: - Support [Arena](https://openrl-docs.readthedocs.io/en/latest/arena/index.html) , which allows convenient evaluation of various agents (even submissions for [JiDi](https://openrl-docs.readthedocs.io/en/latest/arena/index.html#performing-local-evaluation-of-agents-submitted-to-the-jidi-platform-using-openrl)) in a competitive environment. -- Importing models and datasets from [Hugging Face](https://huggingface.co/) +- Importing models and datasets from [Hugging Face](https://huggingface.co/). Supports loading [Stable-baselines3 models from Hugging Face](https://openrl-docs.readthedocs.io/en/latest/sb3/index.html) for testing and training. - [Tutorial](https://openrl-docs.readthedocs.io/en/latest/custom_env/index.html) on how to integrate user-defined environments into OpenRL. diff --git a/README_zh.md b/README_zh.md index aac446a1..06fa09b2 100644 --- a/README_zh.md +++ b/README_zh.md @@ -52,7 +52,7 @@ OpenRL基于PyTorch进行开发,目标是为强化学习研究社区提供一 - 支持自博弈训练 - 支持自然语言任务(如对话任务)的强化学习训练 - 支持[竞技场](https://openrl-docs.readthedocs.io/zh/latest/arena/index.html)功能,可以在多智能体对抗性环境中方便地对各种智能体(甚至是[及第平台](https://openrl-docs.readthedocs.io/zh/latest/arena/index.html#openrl)上提交的智能体)进行评测。 -- 支持从[Hugging Face](https://huggingface.co/)上导入模型和数据。支持加载Hugging Face上Stable-baseline3的模型进行测试和训练。 +- 支持从[Hugging Face](https://huggingface.co/)上导入模型和数据。支持加载Hugging Face上[Stable-baselines3的模型](https://openrl-docs.readthedocs.io/zh/latest/sb3/index.html)来进行测试和训练。 - 提供用户自有环境接入OpenRL的[详细教程](https://openrl-docs.readthedocs.io/zh/latest/custom_env/index.html). - 支持LSTM,GRU,Transformer等模型 - 支持多种训练加速,例如:自动混合精度训练,半精度策略网络收集数据等