Optimize CDF Calculation and Convert NumPy Arrays to Tensors in Benchmark #399

yalsaffar · 2024-10-09T20:55:07Z

PR Description

This PR addresses the first step in making AEPsych's functions consistently return PyTorch tensors and expect tensors as input, improving compatibility with GPUs and reducing redundant conversions between NumPy arrays and PyTorch tensors(partially solving #365).

Key changes include:

Conversion of np.arrays to tensors in the following files:
- aepsych/models/base.py:
  - Refactored the p_below_threshold method to operate fully with PyTorch tensors.
  - Replaced norm.cdf() with torch.distributions.Normal(0, 1).cdf() for better GPU compatibility.
- aepsych/benchmark/problem.py:
  - Significant changes made to ensure consistent use of tensors across the pipeline.
  - The result of f_threshold() now directly returns a PyTorch tensor, ensuring consistency.
  - Additionally, used detach().cpu().numpy() in places where the super().evaluate() method returns float values, ensuring compatibility.
Updates in aepsych/tests/test_benchmark.py:
- Migrated all operations from NumPy to PyTorch.
- This includes calculations for Brier score and misclassification error, now utilizing torch.mean(), torch.square(), torch.isclose(), and torch.all() to fully align with tensor operations.

Stability:

All test cases have passed successfully in the workflow.

…Tensor

facebook-github-bot · 2024-10-11T15:57:22Z

@JasonKChow has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

JasonKChow

There's still some functions that return numpy arrays (despite the documentation specifically saying otherwise). Example: in f_true method of Problem class.

Some typehints are wrong, example f_true method again.

The evaluate method can be rewritten entirely to use torch. Replace pearsonr with corrcoef for torch, you'll need to reformat the data.

aepsych/models/base.py

yalsaffar · 2024-10-14T13:00:54Z

Thank you, @JasonKChow, for the valuable feedback!

I’ve made the following updates:

Updated the evaluate method to fully utilize PyTorch, replacing pearsonr with corrcoef and reformatted the data.
Fixed the f_true method to ensure it works with tensors and corrected the type hints accordingly.
The only method still using np.array in Problem.py is sample_y(). Changing this would require significant modifications to the Strategy class, which I plan to address in the next PR!

Please let me know if there is anything I need to change!

facebook-github-bot · 2024-10-14T21:28:36Z

@JasonKChow has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

JasonKChow · 2024-10-14T22:31:01Z

aepsych/benchmark/problem.py

@@ -233,12 +234,13 @@ def f_threshold(self, model=None) -> torch.Tensor:
            inverse_torch = model.likelihood.objective.inverse

            def inverse_link(x):
-                return inverse_torch(torch.tensor(x))
+                return inverse_torch(torch.tensor(x).clone().detach())


Why is this cloned and detached? Also why is it converted to tensor if we already assume that self.thresholds will be a tensor?

You're right, there's no need to convert it again, and cloning and detaching are also unnecessary. I'll remove those, test it, and commit the changes.

facebook-github-bot · 2024-10-14T23:28:55Z

@JasonKChow has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-10-15T17:35:49Z

@JasonKChow merged this pull request in 45d8e2d.

yalsaffar added 14 commits October 8, 2024 17:17

modifiy cdf/other calculations to work with torch in APEsychMixin class

49c4a92

update ModelProtocol class with torch.Tensor

3155a8f

update MultipleLSETestCase.unvectorized_p_below_threshold with torch.…

efa3d0c

…Tensor

adding workflow for this branch

1b2609f

matching the evaluate as it returns floats for now

27721c0

matching the evaluate as it returns floats for now 2

50abc99

remove workflow for this branch

cf73773

updating f_threshold to work with icdf instead of norm.ppf

c5f2d5d

remove workflow for this branch

dcfb0e8

update f_threshold with icdf

45db03b

update p() in problem and remove casting tensors in test_benchmark.py

3bb4e8b

fix smaller bugs related to tensor opreations

d3eaeba

fix smaller bugs related to tensor opreations in sample_y method

fcd719f

remove workflow for the branch

b3c5088

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 9, 2024

JasonKChow requested changes Oct 11, 2024

View reviewed changes

aepsych/models/base.py Outdated Show resolved Hide resolved

refactoring evaluate and f_true in Problem class to work with Tensors

5d0521e

fix linter issue in evaluate method

220fbb2

JasonKChow reviewed Oct 14, 2024

View reviewed changes

fix redundant use of torch.tensor() in f_threshold method

0dc76fb

JasonKChow approved these changes Oct 14, 2024

View reviewed changes

facebook-github-bot closed this in 45d8e2d Oct 15, 2024

facebook-github-bot added the Merged label Oct 15, 2024

yalsaffar deleted the optimize-torch-cdf branch October 16, 2024 14:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize CDF Calculation and Convert NumPy Arrays to Tensors in Benchmark #399

Optimize CDF Calculation and Convert NumPy Arrays to Tensors in Benchmark #399

yalsaffar commented Oct 9, 2024

facebook-github-bot commented Oct 11, 2024

JasonKChow left a comment

yalsaffar commented Oct 14, 2024

facebook-github-bot commented Oct 14, 2024

JasonKChow Oct 14, 2024

yalsaffar Oct 14, 2024

facebook-github-bot commented Oct 14, 2024

facebook-github-bot commented Oct 15, 2024

Optimize CDF Calculation and Convert NumPy Arrays to Tensors in Benchmark #399

Optimize CDF Calculation and Convert NumPy Arrays to Tensors in Benchmark #399

Conversation

yalsaffar commented Oct 9, 2024

PR Description

Key changes include:

Stability:

facebook-github-bot commented Oct 11, 2024

JasonKChow left a comment

Choose a reason for hiding this comment

yalsaffar commented Oct 14, 2024

facebook-github-bot commented Oct 14, 2024

JasonKChow Oct 14, 2024

Choose a reason for hiding this comment

yalsaffar Oct 14, 2024

Choose a reason for hiding this comment

facebook-github-bot commented Oct 14, 2024

facebook-github-bot commented Oct 15, 2024