Possible bug in AdaptivePool #641

yardenas · 2024-01-19T14:59:01Z

The following code seems to fail

import numpy as np
from torch import nn
import torch
import jax
import equinox as eqx

print(torch.__version__, eqx.__version__)

random_input = jax.random.normal(jax.random.PRNGKey(1), (10, 90, 90, 50))

ppool = torch.nn.AdaptiveAvgPool3d(4)
jpool = eqx.nn.AdaptiveAvgPool3d(4)
random_input_torch = torch.tensor(np.asarray(random_input))
pp = ppool(random_input_torch)
jj = jpool(random_input)

assert np.allclose(np.asarray(pp), np.asarray(jj)), print(np.asarray(jj) - np.asarray(pp))

Not sure if I'm missing something here, I just stumbled upon this trying to convert an Equinox model to pytorch.

I tried going over AdaptivePool but couldn't really figure out the problem. Same thing happens when using 2d pooling btw. Happy to make a PR fixing the issue with some guidance

The text was updated successfully, but these errors were encountered:

patrick-kidger · 2024-01-21T13:12:25Z

I think this is expected -- I think the adaptive part of our pooling works in a slightly different way, as it was more computationally efficient for us to do it that way. (Adaptive pooling was added back in #129, although I don't see any discussion of PyTorch consistency there.)

Tagging @paganpasta in case they recall any other information.

Perhaps we should simply document this discrepancy?

paganpasta · 2024-01-21T17:33:59Z

Going through the corresponding issue (#121), it looks like we implemented adaptive pooling differently from PyTorch infavor of (potential) speed-up. The main difference being how shapes are carved from the original tensor.

Hope this helps.

FWIW, I did not see "significant" difference in performance of (classification) models converted from PyTorch.

patrick-kidger · 2024-01-21T21:41:41Z

Right! Thank you @paganpasta :)

Regarding performance, I think we achieved similar performance to PyTorch because we implemented it slightly differently. If we'd matched PyTorch exactly (i.e. chosen different pooling regions), then we'd have ended up being slightly slower.

If it does turn out to be possible to change things to match PyTorch, without a significant speed difference, then I'd be happy to take a PR on that.

Else, I'd suggest we should probably document this difference but otherwise just accept that we do things slightly differently.

yardenas · 2024-01-29T21:36:02Z

@paganpasta yeah I got similar performance for both models (pytorch vs eqx one). I just tried porting my model to pytorch (collaborators still use torch) and a sanity test of getting the same output failed.

imho a sentence in the docs will help save a couple of debug hours for lost souls and should be more than enough

cisimon7 · 2024-04-18T13:00:23Z

any idea how to get the same output as pytorch model output though? Can you confirm if this issue also in all the different pool layers? I just tried MaxPool1d (without the adaptive) and noticing different output for pytorch and equinox.

Also, please which other layers might behave different from Pytorch due to difference in implementation?

patrick-kidger · 2024-04-18T18:23:18Z

I'm afraid getting precise compatibility between the frameworks is pretty hard. Even for the same algorithm then one sometimes gets meaningful differences in output, just due to the differences in the underlying compilers etc.

I think adaptive pooling and batch norm are the two places where we do something algorithmically different. Although we do have a PR out to change the latter, see #675. (That I really need to review...)

For something like max pool I thought we were the same. There's only one way to compute a maximum, after all.

In general we don't tend to diverge from PyTorch without a good reason, so I think most other layers should be compatible. FWIW I translate models PyTorch->Equinox (in my non-open-source actually-paid work!) and generally observe the same behaviour between the two libraries. When this is important I make sure to also include that as part of the tests for my Equinox implementation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Possible bug in AdaptivePool #641

Possible bug in AdaptivePool #641

yardenas commented Jan 19, 2024 •

edited

Loading

patrick-kidger commented Jan 21, 2024

paganpasta commented Jan 21, 2024 •

edited

Loading

patrick-kidger commented Jan 21, 2024 •

edited

Loading

yardenas commented Jan 29, 2024 •

edited

Loading

cisimon7 commented Apr 18, 2024 •

edited

Loading

patrick-kidger commented Apr 18, 2024

Possible bug in AdaptivePool #641

Possible bug in AdaptivePool #641

Comments

yardenas commented Jan 19, 2024 • edited Loading

patrick-kidger commented Jan 21, 2024

paganpasta commented Jan 21, 2024 • edited Loading

patrick-kidger commented Jan 21, 2024 • edited Loading

yardenas commented Jan 29, 2024 • edited Loading

cisimon7 commented Apr 18, 2024 • edited Loading

patrick-kidger commented Apr 18, 2024

yardenas commented Jan 19, 2024 •

edited

Loading

paganpasta commented Jan 21, 2024 •

edited

Loading

patrick-kidger commented Jan 21, 2024 •

edited

Loading

yardenas commented Jan 29, 2024 •

edited

Loading

cisimon7 commented Apr 18, 2024 •

edited

Loading