QOL changes for generations #166

maxmatical · 2023-11-15T18:36:45Z

append task_name for multiple tasks
fix n_tasks logic from dataset index out of bounds

Next 2 changes related to #58
4. save intermediate generations with --save_every_k_samples
5. resume generation from intermediate generations with --load_generations_intermediate_paths

Tested with HumanEval with saving every 50 samples + loading from intermediate generations:

len(intermediate_generations) = 150
should be generating 14 new samples for new_generations
curr_sample_idx = 150
number of problems for this task is 14
len(dataloader)= 14
len(code_gens) = 14

len(new_generations) = 14
len(generations) after concatenating = 164

Verified:

loading form intermediate generations generates same output as with --limit_start 150 on HumanEval
Saved generations match final generations
Eval metrics unchanged

(minor) add some typing + linting

…luation-harness into max/save-intermediate-gen

RaymondLi0

Nice work, thank you!
I am wondering whether there would be an issue with several restarts (see comment)

bigcode_eval/evaluator.py

RaymondLi0 · 2024-01-04T09:59:13Z

bigcode_eval/utils.py

@@ -332,7 +334,8 @@ def complete_code(
                    gen_token_dict,
                )
                with open(intermediate_save_generations_path, "w") as fp:
-                    json.dump(code_gens, fp)
+                    intermediate_generations.extend(code_gens)


if there are multiple saving steps, I think we'll add the same generations several times into intermediate_generations.
Also this list is also extended in evaluator.py

bigcode-evaluation-harness/bigcode_eval/evaluator.py

Line 82 in 0754793

generations.extend(new_generations)

you're right, made 2 changes in the new commit

instead of extending on intermediate_generations when saving which end up duplicating new generations, extend on a deepcopy instead which prevents duplications

instead of extending at evaluator, return intermediate_generations.extend(code_gens) in complete_code, which makes a bit more sense with this new logic. since we never mutate intermediate_generations when saving, this will return the non-duplicated generations

RaymondLi0

One small suggestion. Then good to merge! Thank you

bigcode_eval/utils.py

…intermediate-gen QOL changes for generations

maxmatical added 13 commits November 9, 2023 09:54

save intermediate res

de4a56d

fix indexing inssue w/ generate code

1667834

save gen and ref per task

b35f5d7

save intermediate code generations

e17dd86

add intermediate generations to continue generating from

0bf932e

fix indexing issues

7233729

fix out of bounds with args.limit_start

cd46f9a

pass intermediate_generations as kwarg

81c7e13

Merge branch 'main' of https://github.com/bigcode-project/bigcode-eva…

4d84525

…luation-harness into max/save-intermediate-gen

add defaults to parallel_generations

f35f9a4

add args to test

7af4088

better naming convention

9f600a3

better naming convention

b661667

maxmatical marked this pull request as ready for review November 16, 2023 17:39

maxmatical requested review from loubnabnl, Muennighoff and RaymondLi0 November 16, 2023 17:39

RaymondLi0 reviewed Jan 3, 2024

View reviewed changes

bigcode_eval/evaluator.py Show resolved Hide resolved

bigcode_eval/evaluator.py Show resolved Hide resolved

fix multiple iterations of saving intermediate outputs

b54ee65

maxmatical requested a review from RaymondLi0 January 3, 2024 16:14

maxmatical added 2 commits January 3, 2024 13:22

merge

df8ffc9

minor optimization for preventing oob errors

0754793

RaymondLi0 reviewed Jan 4, 2024

View reviewed changes

maxmatical added 2 commits January 4, 2024 10:26

fix duplication issues

8cffbfd

fix return for complete_code

88fec42

maxmatical requested a review from RaymondLi0 January 4, 2024 15:57

maxmatical added 2 commits January 4, 2024 11:14

update ci yml

96eb239

clean up variable naming

9e86dd5

RaymondLi0 approved these changes Jan 8, 2024

View reviewed changes

bigcode_eval/utils.py Outdated Show resolved Hide resolved

remove deepcopy

6b18f1e

maxmatical merged commit 199eeec into main Jan 8, 2024
1 check passed

maxmatical deleted the max/save-intermediate-gen branch January 8, 2024 15:56

phuonglvh pushed a commit to phuonglvh/bigcode-evaluation-harness that referenced this pull request Nov 15, 2024

Merge pull request bigcode-project#166 from bigcode-project/max/save-…

e33e5cd

…intermediate-gen QOL changes for generations

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QOL changes for generations #166

QOL changes for generations #166

maxmatical commented Nov 15, 2023 •

edited

Loading

RaymondLi0 left a comment

RaymondLi0 Jan 4, 2024

maxmatical Jan 4, 2024

RaymondLi0 left a comment

QOL changes for generations #166

QOL changes for generations #166

Conversation

maxmatical commented Nov 15, 2023 • edited Loading

RaymondLi0 left a comment

Choose a reason for hiding this comment

RaymondLi0 Jan 4, 2024

Choose a reason for hiding this comment

maxmatical Jan 4, 2024

Choose a reason for hiding this comment

RaymondLi0 left a comment

Choose a reason for hiding this comment

maxmatical commented Nov 15, 2023 •

edited

Loading