Extend functionality of Wandb Config Diff script #687

kyleclo · 2024-08-02T09:41:43Z

Add tests for flatten_dict() in utils
Extend functionality for flatten_dict() to also flatten any dicts that exist in Lists
Extend the wandb config comparison script to use the extended flatten_dict()

Motivation is, while comparing configs, the current implementation doesn't perform comparison of some key aspects of the configs, namely config keys representing dataset paths (which are all List[str]) as well as keys like config["evaluators.value"] which are List[Dict].

The current behavior looks something like this:

where we can see that the fields data.value.paths and evaluators.value aren't easily comparable.

The new behavior looks like this:

where it preserves behavior of original script under old keys, but performs side by side comparison of list elements also.

The downside, of course, is with a lot of dataset paths, these config diffs can become quite long to sift through.

…nt compare wandb config script to also flatten list dicts

dirkgr · 2024-08-02T14:23:54Z

olmo/util.py

+                    new_list.append(
+                        flatten_dict(
+                            v,
+                            parent_key=root,


The normal way of doing this is actually to treat a List as a Mapping[int, Any]. So the key becomes something like "foo.bar.0" and "foo.bar.1", etc. Then you don't need the extra root parameter either.

dirkgr · 2024-08-02T14:27:21Z

scripts/compare_wandb_configs.py

+    }
+    if len(keys_with_differences) > 0:
+        for k in sorted(keys_with_differences):
+            if isinstance(left_config[k], list) and isinstance(right_config[k], list):


You also don't need this if you treat lists as Mapping[int, Any]. And it will work right even if the list entries are complex. On the other hand, the output will look different / be less compact.

add test for flatten dict; extend flatten dict to handle lists; augme…

027bcc3

…nt compare wandb config script to also flatten list dicts

kyleclo requested a review from dirkgr August 2, 2024 09:41

dirkgr requested changes Aug 2, 2024

View reviewed changes

kyleclo added 3 commits August 7, 2024 11:14

new flatten

792de79

Merge branch 'main' into kylel/config-diff

62c4d1d

Merge branch 'main' into kylel/config-diff

f824795

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extend functionality of Wandb Config Diff script #687

Extend functionality of Wandb Config Diff script #687

kyleclo commented Aug 2, 2024

dirkgr Aug 2, 2024

dirkgr Aug 2, 2024

Extend functionality of Wandb Config Diff script #687

Are you sure you want to change the base?

Extend functionality of Wandb Config Diff script #687

Conversation

kyleclo commented Aug 2, 2024

dirkgr Aug 2, 2024

Choose a reason for hiding this comment

dirkgr Aug 2, 2024

Choose a reason for hiding this comment