Skip to content

Reward timeout #97

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 2 commits into
base: main
Choose a base branch
from
Open

Reward timeout #97

wants to merge 2 commits into from

Conversation

wensun
Copy link
Collaborator

@wensun wensun commented Jun 23, 2025

added a timeout in the rlvr_utils.py so that functions like parse_latex is not stuck on answers with bad formats.

Copy link
Collaborator

@bcui-db bcui-db left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think overall looks good (pending pre-commit fixes), but would want to wait for @gupta-abhay to also approve

Copy link
Collaborator

@gupta-abhay gupta-abhay left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think this looks good for now.

One general concern is that signal.alarm is not thread safe, which is the primary reason we removed it previously. We can revisit it if we hit spurious errors (such as errors popping up / evals & verifiers not matching each other due to timeouts)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants