Skip to content

Commit

Permalink
Merge pull request #748 from allenai/oeeval-ladder-testtrain
Browse files Browse the repository at this point in the history
Add test and train sets to in-loop oe-eval (for ladder work)
  • Loading branch information
liujch1998 authored Nov 19, 2024
2 parents 7e81a6c + ee99d57 commit 9c677c9
Show file tree
Hide file tree
Showing 86 changed files with 544 additions and 1 deletion.
3 changes: 2 additions & 1 deletion CHANGELOG.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
- `torch.compile()` now only compiles each block, not the whole model.
- Support for `torch.compile()` with `dynamic=True`
- Resetting the `torch.compile()` after every evaluation, because evaluation messes with the compiled versions
- Added more in-loop evaluation tasks to pick from, mostly for scaling law.


## [v0.5.1](https://github.com/allenai/OLMo/releases/tag/v0.5.1) - 2024-10-17
Expand Down Expand Up @@ -58,7 +59,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
- Swapped in correct flan data mix.
- Fix bug where the attention norm, when applied before the attention block, was modifying the residual stream.
- Fixed `OLMo.from_checkpoint()` so that it correctly loads `olmo_core` and `torch_new` style checkpoints.
- Fixed `preserve_rng_state` being incorrectly set to False when doing gradient checkpointing with dropout
- Fixed `preserve_rng_state` being incorrectly set to False when doing gradient checkpointing with dropout


## [v0.4.0](https://github.com/allenai/OLMo/releases/tag/v0.4.0) - 2024-07-11
Expand Down
Loading

0 comments on commit 9c677c9

Please sign in to comment.