Remove Config objects #30

ejnnr · 2024-03-01T07:49:50Z

Very much WIP (e.g. I haven't adjusted tests yet, abstractions are probably broken, and I'll probably also want to get rid of ScriptConfig). But I (perhaps naively) think most of the work is done, and I tested that things still work manually a bit (see e.g. notebooks/simple_demo.ipynb)

This will probably just replace #29 but I wanted to make a separate PR in case we decide to not use these changes but still use #29.

Adversarial examples are broken, I think they might be easier to fix after some bigger changes

More appropriate now to not call it `_config.py`

I think we should let the user handle this and just have big warning flags around WaNet---making sure we always do this correctly automatically seems nearly impossible so better to be explicit about that

Abstractions and tests are still very broken

I think we haven't been using these for a while

Tests all pass now; I removed one or two that aren't applicable anymore (notably checking whether WaNet loads correctly out of the box)

I think it doesn't make much sense intuitively to have them be arguments to the detector

We also want to ignore log dirs in e.g. the notebook folder

ejnnr · 2024-03-03T02:28:53Z

Mostly done now (tests and the example notebook pass, and I've removed every single config class). Also cleaned up a few other things, so this PR ended up pretty huge. I think the main thing that would be good to look at @VRehnberg is notebooks/simple_demo.ipynb and check that the interface seems fine (we can make minor improvements later).

The other thing worth checking might be how WaNet is handled---I've removed all the logic for automagically ensuring the validation set uses the same control grid, or loading from disk. Basically, I think it's really hard for us to always handle those things correctly now that we just accept arbitrary pytorch Datasets (e.g. the user could pass in their own custom dataset, which wraps a BackdoorDataset). So I think we should let the user handle this, I've added a warning to the docstring, and tried to make it somewhat harder to forget loading the control grid by having a mandatory init argument. We can also have a notebook example that shows how to handle everything correctly. Still, this remains a bit of a footgun.

Of course you're welcome to check other things too, but I don't have concrete guesses for which parts are most likely to be buggy unfortunately.

ejnnr · 2024-03-03T02:35:53Z

Also note that configs aren't saved to disk anymore, see #32 I don't think we actually need this at the moment but would definitely be nice to add back in a much simpler version

VRehnberg

The interface is nice. Two things to note:

The test_tampering.py fails for me (type mismatch when calculating the loss I believe) but it fails on main for me as well (different error) and I don't think this PR introduces the issue. Still, perhaps double check that.
I'm a bit unsure how you've been imagining that WanetBackdoor instances best be used right now (see comment).

VRehnberg · 2024-03-04T10:45:50Z

src/cupbearer/data/backdoors.py

+    Within a single process, just make sure you only initialize WanetBackdoor once
+    and then use that everywhere.


Suggested change

Within a single process, just make sure you only initialize WanetBackdoor once

and then use that everywhere.

Within a single process, just make sure you only initialize a fresh WanetBackdoor once

and then reuse its warping pattern everywhere.

Surely you'll want to have different instances here to control p_backdoor and p_noise individually for different validation sets? Or am I missing something?

I've added #34 for an example of how I'd imagine you'd be using these backdoors.

Yeah you're right of course, thanks for #34!

ejnnr · 2024-03-04T20:08:23Z

Hm, the tampering test passes for me both here and on main. Since it already fails for you on main anyway, I'll merge this PR now, but we should figure out what's going on probably. Can you paste the stack trace you're getting?

You're totally right about the need for multiple WaNet backdoor instances, I'll merge #34 or some version of it first and then merge into main.

Add convenience method to clone WanetBackdoor instance

VRehnberg · 2024-03-05T08:08:33Z

Here's the stacktrace https://gist.github.com/VRehnberg/f3e5089fc03083808d1923b3df4bbc11

VRehnberg · 2024-03-05T09:43:00Z

It's the torch.nn.functional.binary_cross_entropy that doesn't accept that the targets of data.TamperingDataset("diamonds") are boolean. MWE:

>>> import torch
>>> torch.nn.functional.binary_cross_entropy(torch.rand(10), (torch.rand(10) > 0.5).to(torch.bool))
*** RuntimeError: Found dtype Bool but expected Float
>>> torch.nn.functional.binary_cross_entropy(torch.rand(10), (torch.rand(10) > 0.5).to(torch.float32))
tensor(1.2316)

VRehnberg · 2024-03-05T10:01:05Z

The labels are boolean from the source and an additional all statement in tampering.py.

>>> datasets.load_dataset("redwoodresearch/diamonds-seed0", split="validation")["measurements"][0]
[True, True, True]

ejnnr · 2024-03-05T10:13:02Z

Oh I think I know what's going on. Your MWE errors out on CPU for me but passes on MPS. Since pytorch lightning picks MPS by default over CPU if available, the tests pass for me. Guessing that Oliver was also on MPS (or maybe it also works on CUDA).

VRehnberg · 2024-03-05T10:37:59Z

As long as it also works with floats on MPS (CUDA errors out on booleans same as CPU). I'll create a PR that adds a typecast somewhere.

ejnnr added 7 commits February 28, 2024 14:28

Export AnomalyDetector

460ff9b

Make tasks more flexible

dbae3bf

Iterating on tasks

f16b9ca

Mostly fix tests

9073a85

Adversarial examples are broken, I think they might be easier to fix after some bigger changes

[WIP] Remove configs

54c34a6

Remove unused DatasetConfigs

51e6a25

Rename task file

48f8292

More appropriate now to not call it `_config.py`

VRehnberg mentioned this pull request Mar 1, 2024

Make tasks more flexible #29

Closed

ejnnr added 14 commits March 1, 2024 20:50

WIP on removing ScriptConfig and TrainConfig

79b51ec

Remove backdoor loading/storing logic

bdd56fb

I think we should let the user handle this and just have big warning flags around WaNet---making sure we always do this correctly automatically seems nearly impossible so better to be explicit about that

Remove TrainConfig

62e618a

Abstractions and tests are still very broken

Adjust abstractions

94c54ed

Remove loggers

4c7e0c2

I think we haven't been using these for a while

Fix bugs and tests

6809a7e

Tests all pass now; I removed one or two that aren't applicable anymore (notably checking whether WaNet loads correctly out of the box)

Move save_path and max_batch_size arguments

6f0e472

I think it doesn't make much sense intuitively to have them be arguments to the detector

Remove another unused file

ae98812

Remove more unused code

31a7993

Minor improvements and remove TODOs

f0dacc5

Fix demo notebook

0267bd1

Add WaNet warning

975289e

Update gitignore

1b82635

We also want to ignore log dirs in e.g. the notebook folder

Update documentation somewhat

35220aa

ejnnr marked this pull request as ready for review March 3, 2024 02:22

ejnnr requested a review from VRehnberg March 3, 2024 02:22

Remove simple_parsing dependency

f9ab02b

ejnnr mentioned this pull request Mar 3, 2024

Re-introduce some way of auto-saving configs #32

Open

ejnnr mentioned this pull request Mar 4, 2024

Measurement tampering #33

Merged

Merge remote-tracking branch 'origin/main' into no-configs

80463e2

ejnnr and others added 2 commits March 3, 2024 18:09

Adjust tampering/LM code to no-config style

d61c676

Add convenience method to clone WanetBackdoor instance

565f456

VRehnberg requested changes Mar 4, 2024

View reviewed changes

VRehnberg mentioned this pull request Mar 4, 2024

Add convenience method to clone WanetBackdoor instance #34

Merged

ejnnr added 2 commits March 4, 2024 12:18

Minor changes to WaNet cloning

2c1b38c

Merge pull request #34 from VRehnberg/wanet-partial-clone-method

f7e9300

Add convenience method to clone WanetBackdoor instance

ejnnr merged commit 7af54ed into main Mar 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove Config objects #30

Remove Config objects #30

ejnnr commented Mar 1, 2024

ejnnr commented Mar 3, 2024

ejnnr commented Mar 3, 2024

VRehnberg left a comment

VRehnberg Mar 4, 2024

VRehnberg Mar 4, 2024

ejnnr Mar 4, 2024

ejnnr commented Mar 4, 2024

VRehnberg commented Mar 5, 2024

VRehnberg commented Mar 5, 2024

VRehnberg commented Mar 5, 2024

ejnnr commented Mar 5, 2024

VRehnberg commented Mar 5, 2024

		Within a single process, just make sure you only initialize WanetBackdoor once
		and then use that everywhere.

Remove Config objects #30

Remove Config objects #30

Conversation

ejnnr commented Mar 1, 2024

ejnnr commented Mar 3, 2024

ejnnr commented Mar 3, 2024

VRehnberg left a comment

Choose a reason for hiding this comment

VRehnberg Mar 4, 2024

Choose a reason for hiding this comment

VRehnberg Mar 4, 2024

Choose a reason for hiding this comment

ejnnr Mar 4, 2024

Choose a reason for hiding this comment

ejnnr commented Mar 4, 2024

VRehnberg commented Mar 5, 2024

VRehnberg commented Mar 5, 2024

VRehnberg commented Mar 5, 2024

ejnnr commented Mar 5, 2024

VRehnberg commented Mar 5, 2024