Intro

This repo is used for @smartliuhw thesis's model training. The huggingface SFT trainer is used as the training framwork with deepspeed methodology to ensure the RTX 4090 GPU can be used properly.

How to use

Install the dependency

The environment dependency is listed in the requirment file, just run the following command:

pip install -r requirements.txt

Modify data process file

The data processing code is in the utils.py file, all the data should be stored with the Dataset class. The function get_train_data is the most important part, modify it accroding to your demand.

Modify train file

The model tran code is in the train.py file, using trl framework. Modify the args, templates, special tokens accroding to your demand.

Run trainning

You can launch a training by following the train example file, with only few changes about the model and the data.

If you have any question, feel free to ask me.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Intro

How to use

Install the dependency

Modify data process file

Modify train file

Run trainning

Files

README.md

Latest commit

History

README.md

File metadata and controls

Intro

How to use

Install the dependency

Modify data process file

Modify train file

Run trainning