`DataInterpolations` support #178

SouthEndMusic · 2024-08-20T19:32:00Z

No description provided.

SouthEndMusic · 2024-08-20T19:33:10Z

Is this the way to go with dispatching on callable structs? Or should it be done per interpolation type separately?

adrhill · 2024-08-20T21:35:01Z

Wow, this PR came impressively quickly after merging #177!

Is this the way to go with dispatching on callable structs?

I'm afraid this might just overload the constructors.
You can inspect the generated code by omitting the eval statement, e.g. calling

SCT.overload_gradient_1_to_1(:DataInterpolations, AbstractInterpolation)

which will most likely return a function similar to

function DataInterpolations.AbstractInterpolation(t::SparseConnectivityTracer.GradientTracer)
    return SparseConnectivityTracer.gradient_tracer_1_to_1(t, false)
end

which unfortunately is not what you want.

I'll try to figure out how to work around it. The simplest solution would be to add a new specialized overload_gradient_callable_1_to_1 code generation utility.

SouthEndMusic · 2024-08-21T10:13:32Z

There must be an easier way to get all the interpolation types. Also, for some reason the tests cannot find DataInterpolations.

Edit: nvm, I'll try some different things.

adrhill · 2024-08-21T11:50:55Z

I think your initial attempt was the intuitive way to do it, it just needs some work from our side.

SouthEndMusic · 2024-08-21T12:31:01Z

I think my latest approach is the best, this is also the level on which other DataInterpolation extensions operate. This still doesn't work, and I think I know why looking at the generated methods:

[6] _interpolate(tx::T, ty::T) where T<:SparseConnectivityTracer.GradientTracer
     @ SparseConnectivityTracerDataInterpolationsExt C:\Users\konin_bt\SciML\Other\SparseConnectivityTracer.jl\src\overloads\gradient_tracer.jl:119
  [7] _interpolate(tx::T, ty::T) where T<:SparseConnectivityTracer.HessianTracer
     @ SparseConnectivityTracerDataInterpolationsExt C:\Users\konin_bt\SciML\Other\SparseConnectivityTracer.jl\src\overloads\hessian_tracer.jl:188
  [8] _interpolate(tx::SparseConnectivityTracer.GradientTracer, ::Real)
     @ SparseConnectivityTracerDataInterpolationsExt C:\Users\konin_bt\SciML\Other\SparseConnectivityTracer.jl\src\overloads\gradient_tracer.jl:125
  [9] _interpolate(::Real, ty::SparseConnectivityTracer.GradientTracer)
     @ SparseConnectivityTracerDataInterpolationsExt C:\Users\konin_bt\SciML\Other\SparseConnectivityTracer.jl\src\overloads\gradient_tracer.jl:129
 [10] _interpolate(dx::D, y::Real) where {P, T<:SparseConnectivityTracer.GradientTracer, D<:SparseConnectivityTracer.Dual{P, T}}
     @ SparseConnectivityTracerDataInterpolationsExt C:\Users\konin_bt\SciML\Other\SparseConnectivityTracer.jl\src\overloads\gradient_tracer.jl:163
 [11] _interpolate(dx::D, dy::D) where {P, T<:SparseConnectivityTracer.GradientTracer, D<:SparseConnectivityTracer.Dual{P, T}}
     @ SparseConnectivityTracerDataInterpolationsExt C:\Users\konin_bt\SciML\Other\SparseConnectivityTracer.jl\src\overloads\gradient_tracer.jl:145
 [12] _interpolate(dx::D, dy::D) where {P, T<:SparseConnectivityTracer.HessianTracer, D<:SparseConnectivityTracer.Dual{P, T}}
     @ SparseConnectivityTracerDataInterpolationsExt C:\Users\konin_bt\SciML\Other\SparseConnectivityTracer.jl\src\overloads\hessian_tracer.jl:227
 [13] _interpolate(x::Real, dy::D) where {P, T<:SparseConnectivityTracer.GradientTracer, D<:SparseConnectivityTracer.Dual{P, T}}
     @ SparseConnectivityTracerDataInterpolationsExt C:\Users\konin_bt\SciML\Other\SparseConnectivityTracer.jl\src\overloads\gradient_tracer.jl:196
 [14] _interpolate(dx::D, y::Real) where {P, T<:SparseConnectivityTracer.HessianTracer, D<:SparseConnectivityTracer.Dual{P, T}}
     @ SparseConnectivityTracerDataInterpolationsExt C:\Users\konin_bt\SciML\Other\SparseConnectivityTracer.jl\src\overloads\hessian_tracer.jl:256
 [15] _interpolate(x::Real, ty::SparseConnectivityTracer.HessianTracer)
     @ SparseConnectivityTracerDataInterpolationsExt C:\Users\konin_bt\SciML\Other\SparseConnectivityTracer.jl\src\overloads\hessian_tracer.jl:204
 [16] _interpolate(tx::SparseConnectivityTracer.HessianTracer, y::Real)
     @ SparseConnectivityTracerDataInterpolationsExt C:\Users\konin_bt\SciML\Other\SparseConnectivityTracer.jl\src\overloads\hessian_tracer.jl:200
 [17] _interpolate(x::Real, dy::D) where {P, T<:SparseConnectivityTracer.HessianTracer, D<:SparseConnectivityTracer.Dual{P, T}}
     @ SparseConnectivityTracerDataInterpolationsExt C:\Users\konin_bt\SciML\Other\SparseConnectivityTracer.jl\src\overloads\hessian_tracer.jl:284

The original methods are e.g. _interpolate(A::LinearInterpolation, t), so the first argument is not a real number at all. However, this is assumed by the generated methods. I think this is a quite general issue you have to adress.

adrhill · 2024-08-21T13:33:41Z

Yes, this kind of function requires writing manual overloads. I should add to the documentation that our "N-to-M operators" assume Real inputs.

Judging by its name, _interpolate seems to be an internal function. We should avoid touching these, as they can break without SemVer notice.

Overloading on callable AbstractInterpolation structs has the additional advantage of them being "1-to-1 operators" on Reals.

SouthEndMusic · 2024-08-21T14:48:57Z

For my application I only need scalar to scalar, but for many interpolation types the output can also be a vector. Or is that still considered 1 to 1?

adrhill · 2024-08-21T15:04:45Z

Are interpolations generic functions $f: \mathbb{R}^n \rightarrow \mathbb{R}^m$?
If so, we'll have to think about what the sparsity patterns should look like and implement the overloads manually.

An overly conservative estimate could be obtained by using tools from src/overloads/arrays.jl:

if all scalar entries in the input vector interact with each other, the union of their index sets can be computed using first_order_or or second_order_or
we can then return a Fill vector from FillArrays.jl that has the combined tracer on each entry.

(This is how we handle matrix inversion for example.)

SouthEndMusic · 2024-08-21T15:17:58Z

DataInterpolations input is always 1D, but the output can be in any vector space. But most support is for scalar/vector output.

adrhill · 2024-08-21T15:24:03Z

Could you rebase this PR on main and move the test cases to the new test/ext folder?

I’ll add the needed overloads for you tomorrow. Any tests you add will help me out greatly.

SouthEndMusic · 2024-08-21T16:35:14Z

In general what I think should be supported (maybe each can be a separate issue/PR):

scalar to scalar interpolation: that's just calling the interpolation object as a callable struct
scalar to array interpolation: idem but with array output (vectors, matrices are also supported somewhat)
code optimization: all derivatives of constant interpolation are zero, second derivative of linear interpolation is zero
derivatives and integrals: computed as e.g. derivative(A::LinearInterpolation, t), integral(A::LinearInterpolation, t) or integral(A::LinearInterpolation, t1, t2)
local: in certain intervals certain derivatives of certain interpolation types are 0. Might not be worth the effort though
derivatives w.r.t. constructor input: quite niche, but sometimes people want to compute derivatives w.r.t. input data (for e.g. optimization). Here the sparsity pattern depends on the interpolation type.

adrhill · 2024-08-21T17:07:00Z

Thanks for the list! I think that should all be doable in here.

SouthEndMusic · 2024-08-22T07:40:06Z

By the way, what's the philosophy of where extensions are located? DataInterpolations has its own extensions for Symbolics, Zygote etc. For those extensions it's also not that bad if they use DataInterpolations internals

adrhill · 2024-08-22T11:38:10Z

I'd say we should have them in here for now:

SCT is less stable and less established than DataInterpolations
the overloads won't use internals from DataInterpolations but (in this case) will have to use internals from SCT

codecov-commenter · 2024-08-22T16:18:29Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

Attention: Patch coverage is 87.12871% with 26 lines in your changes missing coverage. Please review.

Project coverage is 90.42%. Comparing base (c7355d3) to head (23ee8ab).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
test/ext/test_DataInterpolations.jl	86.01%	20 Missing ⚠️
test/runtests.jl	69.23%	4 Missing ⚠️
src/overloads/ambiguities.jl	0.00%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #178      +/-   ##
==========================================
- Coverage   91.36%   90.42%   -0.94%     
==========================================
  Files          41       44       +3     
  Lines        1772     2037     +265     
==========================================
+ Hits         1619     1842     +223     
- Misses        153      195      +42

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

adrhill · 2024-08-22T16:51:49Z

1D interpolations should be working now.
Do you have an example for N-dimensional interpolations? I couldn't find any in the DataInterpolations docs.

for other package extensions to copy.

adrhill · 2024-08-23T15:45:45Z

Urgh, I wasn't aware DataInterpolations had separate methods for AbstractArray inputs...

adrhill · 2024-08-28T15:30:15Z

What about things like this:

import SparseConnectivityTracer as SCT
using DataInterpolations

method = SCT.TracerLocalSparsityDetector()

t = [0.0, 1.0, 3.0]
f(u) = ConstantInterpolation(u, t)(2.0)

u = [1.0, 2.0, 5.0]

SCT.jacobian_sparsity(f, u, method)
# 1×3 SparseArrays.SparseMatrixCSC{Bool, Int64} with 1 stored entry:
# ⋅  1  ⋅

As you can see it already works for some methods, but for other methods it doesn't yet (e.g. LinearInterpolation).

Wait, this is an entirely different use-case than what we are currently implementing. You don't want to differentiate through the interpolation, you want to differentiate through the creation of the interpolant?

adrhill · 2024-08-28T15:49:46Z

As you can see it already works for some methods

I would say that with the additional overloads on tracers, this is unintended behavior, as this PR only returns the input indices of the "interpolation query tracers t", not the potential "interpolant constructor tracers" u. Pushing tracers through interpolants constructed from tracers would require additional methods.
We should overload the constructors to throw a NotImplementedError on Tracers to not bloat this PR.

adrhill · 2024-08-28T15:52:12Z

The differentiation of the result of an interpolation w.r.t. to the input arguments of its constructor is also a bit more nuanced than what we are currently doing.

gdalle · 2024-08-28T16:05:16Z

You don't want to differentiate through the interpolation, you want to differentiate through the creation of the interpolant?

As usual, I assume that this works out of the box for local tracers, and that only global tracers are an issue?

The differentiation of the result of an interpolation w.r.t. to the input arguments of its constructor is also a bit more nuanced than what we are currently doing.

IIUC this would require constructing each field of the corresponding Interpolation type (some of which may not even be public API) and filling them with the appropriate tracers in a mathematically sound way. So there's no hope of doing this in a batched way. I think this is out of scope for the current PR, although I understand the appeal.

Of course there's also the crazy option: make LinearInterpolation(u::Vector{Tracer}, t::Vector{Tracer}) return a new TracerLinearInterpolation object without all the caches and SciML shenanigans.

adrhill · 2024-08-28T16:37:17Z

As usual, I assume that this works out of the box for local tracers, and that only global tracers are an issue?

Maybe before this PR, but with the new overloads we possibly return the wrong patterns.

IIUC this would require constructing each field of the corresponding Interpolation type (some of which may not even be public API) and filling them with the appropriate tracers in a mathematically sound way.

I‘m not even sure whether there is a way for Global tracers to return any kind of sparsity. Let’s take LinearInterpolation as an example. For a given query t, the output depends on the closest lower and higher datapoints. Global tracers don’t have the required primal value for such an ordering.

adrhill · 2024-08-28T16:42:23Z

If a dense pattern is the correct solution for global tracers, the solution might not be too complicated:

Collect all tracers from the data (if there are any) and take a union over all of them. Then take a union with the „query tracers“ and return a Fill array of the correct size.

SouthEndMusic · 2024-08-28T16:44:28Z

I‘m not even sure whether there is a way for Global tracers to return any kind of sparsity

You're right about that, sparsity of an interpolation call w.r.t. the input u is only meaningful locally, for global you cannot say anything or you would have to assume dense dependency on the whole of u.

We should overload the constructors to throw a NotImplementedError on Tracers to not bloat this PR

That is fine by me, this is quite a niche usecase. This was also implemented for DataInterpolations + Enzyme only partially and relatively recently because someone specifically asked for it. I think the most important use cases left are derivative and integral.

gdalle · 2024-08-28T16:50:01Z

Global tracers don’t have the required primal value for such an ordering.

That depends, if t is assumed to be ordered then we have that neighborness information

adrhill · 2024-08-28T16:53:19Z

That depends, if t is assumed to be ordered then we have that neighborness information

No, the query is entirely independent from the data. We don’t know which data points are closest to the query without primals.

gdalle · 2024-08-28T18:26:24Z

In that case, my option of a dummy TracerInterpolation containing the merged tracers of all u and all t may not be too dumb?

gdalle · 2024-08-29T13:15:01Z

On second thought, I don't think you can systematically throw error on constructors even if u and t have Tracer elements, because the method that gets called will depend primarily on the vector type, not on the element type.
If DataInterpolations only implements something like LinearInterpolation(u::AbstractVector, t::AbstractVector) then you're in luck and you can be more specific, but if they have special cases then you're screwed because Vector{<:Real} is more specific than AbstractVector{<:Tracer}.

adrhill · 2024-08-29T14:32:13Z

@SouthEndMusic do you have an idea why the 1.6 tests fail and others don't?

ext/SparseConnectivityTracerDataInterpolationsExt.jl

test/ext/test_DataInterpolations.jl

adrhill · 2024-09-02T13:46:16Z

I spent a bit too much time writing tests for interpolants containing tracers in c993787 just to figure out we can't add overloads on them for the same reason @gdalle mentioned we can't overload the constructors:

[...] The method that gets called will depend primarily on the vector type, not on the element type.

If DataInterpolations only implements something like LinearInterpolation(u::AbstractVector, t::AbstractVector) then you're in luck and you can be more specific, but if they have special cases then you're screwed because Vector{<:Real} is more specific than AbstractVector{<:Tracer}.

The same applies here: we can easily overload (interp::QuadraticInterpolation)(t::HessianTracer), but overloads like (interp::QuadraticInterpolation{T})(x) where {T<:AbstractArray{<:AbstractTracer}} might get missed unless we generate a lot of code over a big loop of common array types.

That is fine by me, this is quite a niche usecase.

I afraid we'll have to leave it at supporting tracer inputs, at least for now.

SouthEndMusic marked this pull request as draft August 20, 2024 19:32

adrhill mentioned this pull request Aug 22, 2024

Document limitations of operator overloading utils #180

Merged

SouthEndMusic added 3 commits August 22, 2024 18:10

Add draft of DataInterpolations extension

8aaaabd

Next attempt

3155a07

The next attempt

598fe2e

adrhill force-pushed the DataInterpolations_support branch from fe318ef to 598fe2e Compare August 22, 2024 16:14

adrhill added 3 commits August 22, 2024 18:25

Move tests to test/ext folder

1c282d9

Add package extrension testset

d3eff69

Add overloads

e348a86

adrhill added 4 commits August 22, 2024 18:54

Fix for Julia 1.6

3d4c73d

Specify DataInterp ext. isn't an ideal template

4c659d0

for other package extensions to copy.

Test on more types of interpolation

78052bd

Fix path

1e73da1

adrhill marked this pull request as ready for review August 22, 2024 17:08

adrhill added 2 commits August 28, 2024 16:54

Reorganize code and add TODOs

7524a02

Add test cases on other input types

457a1cc

adrhill added 5 commits August 29, 2024 14:27

Fix Jacobian tests

abe0189

Fix Hessian tests

f603ff4

Remove TODOs

c1bde9e

Remove TODOs v2

a98b0d6

Remove overloads on Dual when possible

894da06

adrhill reviewed Aug 29, 2024

View reviewed changes

ext/SparseConnectivityTracerDataInterpolationsExt.jl Outdated Show resolved Hide resolved

adrhill reviewed Aug 29, 2024

View reviewed changes

test/ext/test_DataInterpolations.jl Show resolved Hide resolved

adrhill added 4 commits September 2, 2024 11:37

Remove support for Julia <1.10

99ac119

Fix ambiguity

814a042

Test generation of tests

c993787

Remove tests on interpolants containing tracers

23ee8ab

adrhill merged commit cb273a1 into adrhill:main Sep 2, 2024
6 checks passed

adrhill mentioned this pull request Sep 2, 2024

Scalar operators to support #74

Closed

adrhill referenced this pull request Sep 2, 2024

SparseConnectivityTracer v0.6.3

116f027

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`DataInterpolations` support #178

`DataInterpolations` support #178

SouthEndMusic commented Aug 20, 2024

SouthEndMusic commented Aug 20, 2024

adrhill commented Aug 20, 2024 •

edited

Loading

SouthEndMusic commented Aug 21, 2024 •

edited

Loading

adrhill commented Aug 21, 2024

SouthEndMusic commented Aug 21, 2024 •

edited

Loading

adrhill commented Aug 21, 2024

SouthEndMusic commented Aug 21, 2024

adrhill commented Aug 21, 2024 •

edited

Loading

SouthEndMusic commented Aug 21, 2024

adrhill commented Aug 21, 2024 •

edited

Loading

SouthEndMusic commented Aug 21, 2024

adrhill commented Aug 21, 2024

SouthEndMusic commented Aug 22, 2024 •

edited

Loading

adrhill commented Aug 22, 2024

codecov-commenter commented Aug 22, 2024 •

edited

Loading

adrhill commented Aug 22, 2024

adrhill commented Aug 23, 2024

adrhill commented Aug 28, 2024

adrhill commented Aug 28, 2024

adrhill commented Aug 28, 2024

gdalle commented Aug 28, 2024

adrhill commented Aug 28, 2024

adrhill commented Aug 28, 2024

SouthEndMusic commented Aug 28, 2024

gdalle commented Aug 28, 2024

adrhill commented Aug 28, 2024

gdalle commented Aug 28, 2024

gdalle commented Aug 29, 2024

adrhill commented Aug 29, 2024

adrhill commented Sep 2, 2024

DataInterpolations support #178

DataInterpolations support #178

Conversation

SouthEndMusic commented Aug 20, 2024

SouthEndMusic commented Aug 20, 2024

adrhill commented Aug 20, 2024 • edited Loading

SouthEndMusic commented Aug 21, 2024 • edited Loading

adrhill commented Aug 21, 2024

SouthEndMusic commented Aug 21, 2024 • edited Loading

adrhill commented Aug 21, 2024

SouthEndMusic commented Aug 21, 2024

adrhill commented Aug 21, 2024 • edited Loading

SouthEndMusic commented Aug 21, 2024

adrhill commented Aug 21, 2024 • edited Loading

SouthEndMusic commented Aug 21, 2024

adrhill commented Aug 21, 2024

SouthEndMusic commented Aug 22, 2024 • edited Loading

adrhill commented Aug 22, 2024

codecov-commenter commented Aug 22, 2024 • edited Loading

Codecov Report

adrhill commented Aug 22, 2024

adrhill commented Aug 23, 2024

adrhill commented Aug 28, 2024

adrhill commented Aug 28, 2024

adrhill commented Aug 28, 2024

gdalle commented Aug 28, 2024

adrhill commented Aug 28, 2024

adrhill commented Aug 28, 2024

SouthEndMusic commented Aug 28, 2024

gdalle commented Aug 28, 2024

adrhill commented Aug 28, 2024

gdalle commented Aug 28, 2024

gdalle commented Aug 29, 2024

adrhill commented Aug 29, 2024

adrhill commented Sep 2, 2024

`DataInterpolations` support #178

`DataInterpolations` support #178

adrhill commented Aug 20, 2024 •

edited

Loading

SouthEndMusic commented Aug 21, 2024 •

edited

Loading

SouthEndMusic commented Aug 21, 2024 •

edited

Loading

adrhill commented Aug 21, 2024 •

edited

Loading

adrhill commented Aug 21, 2024 •

edited

Loading

SouthEndMusic commented Aug 22, 2024 •

edited

Loading

codecov-commenter commented Aug 22, 2024 •

edited

Loading