PaD

This repo inculdes the code in the paper PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning (NAACL 2024 Long Paper).

Prerequisites

torch >= 2.0
transformers
download pre-trained models (CodeT5_small/base/large on Hugging Face)

Data

├── Data
   └── GSM8K   
       └── train-enhanced.json # pad-augmented gsm8k training data by gpt-3.5-turbo
       └── test_add_code.json # test data with pad-augmented label code
   └── MultiArith  # test data with pad-augmented label code
   └── SVAMP # test data with pad-augmented label code
   └── ASDiv # test data with pad-augmented label code

The data of self-refine task is here

Quick Start

1. Training

Execute the following command to re-produce our models:

sh run_seq2seq.sh

2. Eval

run the following scripts to generate your results:

sh run_seq2seq_test.sh

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
dataset		dataset
paper		paper
pre_trained_model/Salesforce		pre_trained_model/Salesforce
.gitignore		.gitignore
README.md		README.md
data_preprocess.py		data_preprocess.py
eva_symbolic_reasoning.py		eva_symbolic_reasoning.py
eval_CoT.py		eval_CoT.py
eval_code_solution.py		eval_code_solution.py
eval_for_bbh.py		eval_for_bbh.py
eval_for_coin_filp.py		eval_for_coin_filp.py
eval_for_gsm8k_on_baselines.py		eval_for_gsm8k_on_baselines.py
eval_other_reaosning_baselines.py		eval_other_reaosning_baselines.py
eval_standard_tuning_GSM8K.py		eval_standard_tuning_GSM8K.py
run_seq2seq.py		run_seq2seq.py
run_seq2seq.sh		run_seq2seq.sh
run_seq2seq_test.sh		run_seq2seq_test.sh
run_seq2seq_with_verifying.sh		run_seq2seq_with_verifying.sh
score_function_model.py		score_function_model.py
step_by_step_verfiying_with_bs.py		step_by_step_verfiying_with_bs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PaD

Prerequisites

Data

Quick Start

1. Training

2. Eval

About

Releases

Packages

Languages

Xuekai-Zhu/pad

Folders and files

Latest commit

History

Repository files navigation

PaD

Prerequisites

Data

Quick Start

1. Training

2. Eval

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages