Skip to content

Actions: pytorch/rl

RLHF Tests on Linux

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
4,495 workflow runs
4,495 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Feature] Linearise reward transform
RLHF Tests on Linux #7160: Pull request #2681 synchronize by louisfaury
January 3, 2025 12:16 9m 25s louisfaury:lf/agg-rewards-transform
January 3, 2025 12:16 9m 25s
[Feature] multiagent data standardization: PPO advantages
RLHF Tests on Linux #7158: Pull request #2677 opened by matteobettini
December 26, 2024 15:24 11m 3s matteobettini:multiagent_norm
December 26, 2024 15:24 11m 3s
[Feature] Make PPO compatible with composite actions and log-probs
RLHF Tests on Linux #7157: Pull request #2665 synchronize by vmoens
December 20, 2024 13:32 12m 25s gh/vmoens/58/head
December 20, 2024 13:32 12m 25s
[CI] Fix conda on windows
RLHF Tests on Linux #7156: Pull request #2676 synchronize by vmoens
December 20, 2024 13:31 9m 33s fix-windows-ci
December 20, 2024 13:31 9m 33s
[CI] Fix conda on windows
RLHF Tests on Linux #7155: Pull request #2676 opened by vmoens
December 20, 2024 12:22 9m 30s fix-windows-ci
December 20, 2024 12:22 9m 30s
[Example] RNN-based policy example
RLHF Tests on Linux #7154: Commit d009835 pushed by vmoens
December 20, 2024 12:12 9m 44s main
December 20, 2024 12:12 9m 44s
[Example] RNN-based policy example
RLHF Tests on Linux #7153: Pull request #2675 opened by vmoens
December 20, 2024 12:12 1m 44s gh/vmoens/64/head
December 20, 2024 12:12 1m 44s
[BugFix] Fix batching envs with non tensor data
RLHF Tests on Linux #7152: Commit ab4250e pushed by vmoens
December 20, 2024 10:27 9m 38s main
December 20, 2024 10:27 9m 38s
[BugFix] Fix batching envs with non tensor data
RLHF Tests on Linux #7151: Pull request #2674 opened by vmoens
December 20, 2024 10:26 10m 3s gh/vmoens/64/head
December 20, 2024 10:26 10m 3s
[Performance] Accelerate slice sampler on GPU
RLHF Tests on Linux #7150: Pull request #2672 synchronize by vmoens
December 20, 2024 09:03 16m 9s gh/vmoens/62/head
December 20, 2024 09:03 16m 9s
[BugFix] Compatibility of tensordict primers with batched envs (specifically for LSTM and GRU)
RLHF Tests on Linux #7149: Pull request #2668 synchronize by vmoens
December 20, 2024 09:03 16m 18s gh/vmoens/59/head
December 20, 2024 09:03 16m 18s
[Performance] Avoid cloning trajs in SliceSampler
RLHF Tests on Linux #7148: Pull request #2671 synchronize by vmoens
December 20, 2024 09:03 9m 56s gh/vmoens/61/head
December 20, 2024 09:03 9m 56s
[BugFix] Avoid KeyError in slice sampler (for compile)
RLHF Tests on Linux #7147: Pull request #2670 synchronize by vmoens
December 20, 2024 09:03 9m 52s gh/vmoens/60/head
December 20, 2024 09:03 9m 52s
[Tutorial] Beam search with GPT models
RLHF Tests on Linux #7146: Pull request #2623 synchronize by vmoens
December 19, 2024 16:35 9m 43s gh/vmoens/47/head
December 19, 2024 16:35 9m 43s
[Tutorial] MCTS
RLHF Tests on Linux #7145: Pull request #2673 opened by vmoens
December 19, 2024 16:35 9m 38s gh/vmoens/63/head
December 19, 2024 16:35 9m 38s
[Performance] Accelerate slice sampler on GPU
RLHF Tests on Linux #7144: Pull request #2672 synchronize by vmoens
December 19, 2024 16:28 14m 26s gh/vmoens/62/head
December 19, 2024 16:28 14m 26s
[Performance] Accelerate slice sampler on GPU
RLHF Tests on Linux #7143: Pull request #2672 opened by vmoens
December 19, 2024 16:26 4m 57s gh/vmoens/62/head
December 19, 2024 16:26 4m 57s
[Performance] Avoid cloning trajs in SliceSampler
RLHF Tests on Linux #7142: Pull request #2671 opened by vmoens
December 19, 2024 16:19 9m 35s gh/vmoens/61/head
December 19, 2024 16:19 9m 35s
[BugFix] Avoid KeyError in slice sampler (for compile)
RLHF Tests on Linux #7141: Pull request #2670 opened by vmoens
December 19, 2024 15:53 9m 47s gh/vmoens/60/head
December 19, 2024 15:53 9m 47s
[BugFix] Compatibility of tensordict primers with batched envs (specifically for LSTM and GRU)
RLHF Tests on Linux #7140: Pull request #2668 synchronize by vmoens
December 19, 2024 15:03 9m 53s gh/vmoens/59/head
December 19, 2024 15:03 9m 53s
[CI] Fix nightly build
RLHF Tests on Linux #7137: Commit 133d709 pushed by vmoens
December 19, 2024 10:39 9m 23s main
December 19, 2024 10:39 9m 23s
[CI] Fix nightly build
RLHF Tests on Linux #7136: Pull request #2666 opened by vmoens
December 19, 2024 10:25 9m 32s gh/vmoens/59/head
December 19, 2024 10:25 9m 32s
[Feature] Make PPO compatible with composite actions and log-probs
RLHF Tests on Linux #7135: Pull request #2665 opened by vmoens
December 18, 2024 18:30 9m 40s gh/vmoens/58/head
December 18, 2024 18:30 9m 40s
[Feature] Log pbar rate in SOTA implementations
RLHF Tests on Linux #7134: Commit 1ce25f1 pushed by vmoens
December 18, 2024 15:31 12m 24s main
December 18, 2024 15:31 12m 24s