🚀 Mini-Uni-RLHF

Mini-Uni-RLHF is minimal out-of-the-box annotation tool for researchers in RLHF community, powered by streamlit. The library only contains the core functionality of Uni-RLHF platform, which is designed with a focus on simple over easy. We recommend this version for researchers and small tasks that require no heavy configuration to achieve a complete workflow!

Feature

Out of the box, no heavy configuration required
Easily scalable based on streamlit framework and universal dataset format

🎯 Roadmap

Enrich documentation
Add visual and keypoint feedback support
Add online RL training mode

🛠️ setup

Install dependencies in less than 30 seconds:

cd /path/to/Mini-Uni-RLHF
pip install -r requirements.txt

Using test datasets:

Download datasets in Google Drive
Move 0_walker_walk_medium.hdf5 to default dataset location:

cd /path/to/Mini-Uni-RLHF
mkdir -p datasets/dataset_resource/vd4rl/walker/walker_walk_medium/

😎 Enjoy the mini tool!

streamlit run main.py

📂 Customised datasets

How to use existed datasets

We provide a very small walker dataset in Mini-Uni-RLHF/datasets/dataset_resource/vd4rl/walker/walker_walk_medium/0_walker_walk_medium.hdf5 for users to test. And now we suppoert d4rl, atari, smarts and vd4rl domain. (TODO)

How to add new datasets

All you need to do to plug into the new dataset is write a python file named {$mode}_{$domain}.py like offline_atari and the corresponding few functions! See details at Mini-Uni-RLHF/datasets/offline_customization_dataset.py.

class BaseOfflineDataset(object):
    def __init__(self):
        
    def load_offline_dataset(self):
        
    def get_episode_boundaries(self):
    
    def sample(self):
        
    def visualize(self):
        
    def query(self):

📸 Screenshot

Create the project:

Annotate labels for the project:

Export the annotated dataset:

🏷️ License

Distributed under the MIT License. See LICENSE.txt for more information.

✉️ Contact

For any questions, please feel free to email [email protected].

📝 Citation

If you find our work useful, please consider citing:

@inproceedings{anonymous2023unirlhf,
    title={Uni-{RLHF}: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback},
    author={Yuan, Yifu and Hao, Jianye and Ma, Yi and Dong, Zibin and Liang, Hebin and Liu, Jinyi and Feng, Zhixin and Zhao, Kai and Zheng, Yan}
    booktitle={The Twelfth International Conference on Learning Representations, ICLR},
    year={2024},
    url={https://openreview.net/forum?id=WesY0H9ghM},
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
datasets		datasets
pages		pages
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚀 Mini-Uni-RLHF

Feature

🎯 Roadmap

🛠️ setup

📂 Customised datasets

How to use existed datasets

How to add new datasets

📸 Screenshot

🏷️ License

✉️ Contact

📝 Citation

About

Releases

Packages

Contributors 2

Languages

License

pickxiguapi/Mini-Uni-RLHF

Folders and files

Latest commit

History

Repository files navigation

🚀 Mini-Uni-RLHF

Feature

🎯 Roadmap

🛠️ setup

📂 Customised datasets

How to use existed datasets

How to add new datasets

📸 Screenshot

🏷️ License

✉️ Contact

📝 Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages