adds wandb loging of metrics #676

NathanHB · 2025-04-15T15:06:38Z

No description provided.

HuggingFaceDocBuilderDev · 2025-04-15T15:08:44Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

clefourrier

LGTM but you need to update the doc on logging

anton-l · 2025-04-17T12:53:51Z

src/lighteval/logging/evaluation_tracker.py

        if self.should_push_results_to_tensorboard:
            self.push_to_tensorboard(
                results=self.metrics_logger.metric_aggregated, details=self.details_logger.compiled_details
            )

+    def push_to_wandb(self, results_dict: dict, details_datasets: dict) -> None:
+        self.wandb_run.log(
+            {**results_dict["results"]},


Can we log some additional data here like checkpoint_num or consumed_tokens? For the custom X axis like so

We can probably pass it from nanotron_model, since these metrics are implementation-specific

not sure what you mean by that ? what's checkpoint_num and is consumed_tokens the total amount of tokens used as input ?

I mean just arbitrary step counters coming from somewhere else, e.g. we can implement consumed_tokens (consumed_train_samples*seq_len) in nanotron_model.py

anton-l · 2025-04-17T13:05:34Z

src/lighteval/logging/evaluation_tracker.py

+    def push_to_wandb(self, results_dict: dict, details_datasets: dict) -> None:
+        self.wandb_run.log(
+            {**results_dict["results"]},
+        )


nit: IIUC and the metrics are logged as custom|mmlu:astronomy|0/acc_norm , we can .replace(':', '/') to get custom|mmlu/astronomy|0/acc_norm.
Otherwise wandb creates tons of collapsible sections for every subset of the benchmark, instead of just one section for custom|mmlu with all the metrics inside.

NathanHB added 2 commits April 15, 2025 15:06

adds wandb login of metrics

a032302

adds wandb loging of metrics

39c84f5

NathanHB requested review from lewtun, eliebak and clefourrier April 15, 2025 15:09

clefourrier approved these changes Apr 15, 2025

View reviewed changes

NathanHB added 2 commits April 17, 2025 09:19

adds wandb loging of metrics

75481b5

adds wandb loging of metrics

01c28ff

anton-l reviewed Apr 17, 2025

View reviewed changes

NathanHB merged commit 989f5f5 into main Apr 23, 2025
5 checks passed

NathanHB added the feature/enhancement New feature/request label May 5, 2025

hynky1999 pushed a commit that referenced this pull request May 22, 2025

adds wandb loging of metrics (#676)

91244dc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

adds wandb loging of metrics #676

adds wandb loging of metrics #676

Uh oh!

NathanHB commented Apr 15, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 15, 2025

Uh oh!

clefourrier left a comment

Uh oh!

anton-l Apr 17, 2025 •

edited

Loading

Uh oh!

anton-l Apr 17, 2025

Uh oh!

NathanHB Apr 17, 2025

Uh oh!

anton-l Apr 17, 2025

Uh oh!

anton-l Apr 17, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

adds wandb loging of metrics #676

adds wandb loging of metrics #676

Uh oh!

Conversation

NathanHB commented Apr 15, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 15, 2025

Uh oh!

clefourrier left a comment

Choose a reason for hiding this comment

Uh oh!

anton-l Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

anton-l Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

NathanHB Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

anton-l Apr 17, 2025

Choose a reason for hiding this comment

Uh oh!

anton-l Apr 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

anton-l Apr 17, 2025 •

edited

Loading

anton-l Apr 17, 2025 •

edited

Loading