Interpolation functions #1814

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Closed

pgree wants to merge 79 commits into stan-dev:develop from pgree:feature/58-interp

Contributor

pgree commented Mar 31, 2020 •

edited by syclik

Loading

Summary

I added a function that does interpolation of a set of function values, (x_i, y_i), specified by the user. The algorithm works by first doing a linear interpolation between points and then smoothing that linear interpolation by convolution with a Gaussian.

In order to enforce that the interpolated function values coincide with the user-inputted function values, the algorithm at each step convolves a Gaussian with a new piecewise linear function in which the function values are shifted by the difference between the most recent interpolation and the desired, original value.

The interpolated function can be evaluated analytically aside from an evaluation of the error function. The derivative of the interpolated function can be evaluated analytically.

This is a work-in-progress. I have discussed some of the design choices with @bbbales2 but that's it.

Tests

There are tests for the "accuracy" of the interpolation and tests for the derivative of the interpolation that compare autodiff to finite differences.

Side Effects

The variance of the Gaussian in the convolution is currently 1/10 the minimum distance between the x-values specified by the user. This has the potentially to lead to undesired behavior.

There is a tolerance hardcoded for when to stop the iterative algorithm in terms of the maximum distance between the interpolated values at the original points and the user-specified values at those points.

Checklist

Math issue Interpolation function(s) #58
Copyright holder: Columbia University
the basic tests are passing
the code is written in idiomatic C++ and changes are documented in the doxygen
the new changes are tested

pgree and others added 4 commits

March 31, 2020 15:56


          added interpolation functions

f340d6f


          moved prim tests to new files

63172f1


          Merge branch 'develop' into feature/58-interp

389f42b


          [Jenkins] auto-formatting by clang-format version 6.0.0 (tags/google/…

98fdf8b

…stable/2017-11-14)

pgree mentioned this pull request

new interpolation scheme stan-dev/design-docs#18

Open

pgree and others added 4 commits

April 1, 2020 11:29


          fixing messed up branch

efde025


          fixing messed up branch again

2169b76


          trying merge

b36451b


          [Jenkins] auto-formatting by clang-format version 5.0.0-3~16.04.1 (ta…

6c8726d

…gs/RELEASE_500/final)

SteveBronder requested changes

View reviewed changes

Collaborator

SteveBronder left a comment •

edited

Loading

Thanks for submitting!

Couple quick review comments. In general I think you need to take the stuff in prim and flesh it out to work for fvar and var types for any of the parameters since you only have a rev specialization for the 5th argument to conv_gaus_line

I also think the names for these should be fleshed out. conv_gaus_line could be something like gauss_line_convolution

stan/math/prim/fun/conv_gaus_line.hpp Outdated

Comment on lines 15 to 16

		double der_conv_gaus_line(double t0, double t1, double a, double b, double x0,
		double sig2) {

Collaborator

SteveBronder Apr 2, 2020

Generally we want prim functions to be templated generically to work with any type, then the specialization in rev will be for analytic calculations.

Contributor Author

pgree Aug 4, 2020

moved this function to the rev version of this file

stan/math/prim/fun/conv_gaus_line.hpp Outdated

Comment on lines 12 to 14

+              /*
+              evaluate the derivative of conv_gaus_line with respect to x
+              */

Collaborator

SteveBronder Apr 2, 2020

Need full docs for external facing functions

stan/math/prim/fun/conv_gaus_line.hpp Outdated

Comment on lines 17 to 21

+                using stan::math::normal_cdf;
+                double pi = stan::math::pi();
+                using std::exp;
+                using std::pow;
+                using std::sqrt;

Collaborator

SteveBronder Apr 2, 2020

Put using aliases together at the top of the function

stan/math/prim/fun/conv_gaus_line.hpp Outdated

+                using std::exp;
+                using std::pow;
+                using std::sqrt;
+                double sig = sqrt(sig2);

Collaborator

SteveBronder Apr 2, 2020

Suggested change

      
              double sig = sqrt(sig2);
          
              const double sig = sqrt(sig2);

stan/math/prim/fun/conv_gaus_line.hpp Outdated

+                double sig = sqrt(sig2);
+                double y;
+                double alpha = sqrt(2 * pi * sig2);

Collaborator

SteveBronder Apr 2, 2020

Suggested change

      
              double alpha = sqrt(2 * pi * sig2);
          
              const double alpha = sqrt(2 * pi * sig2);

stan/math/prim/fun/gaus_interp.hpp Outdated

Comment on lines 43 to 46

+                if (x <= xs[0])
+                  return ys[0];
+                if (x >= xs[n - 1])
+                  return ys[n - 1];

Collaborator

SteveBronder Apr 2, 2020

Suggested change

      
              if (x <= xs[0])
          
                return ys[0];
          
              if (x >= xs[n - 1])
          
                return ys[n - 1];
          
              if (x <= xs[0]) {
          
                return ys[0];
          
              }
          
              if (x >= xs[n - 1]) {
          
                return ys[n - 1];
          
              }

stan/math/prim/fun/gaus_interp.hpp Outdated

Comment on lines 40 to 41

		double lin_interp_pt(int n, vector<double> xs, vector<double> ys,
		vector<double> as, vector<double> bs, double x) {

Collaborator

SteveBronder Apr 2, 2020

These should all be const& parameters so copies are not made

Collaborator

SteveBronder Apr 2, 2020

This is true for everywhere you are using std::vector, arithmetic types can be passed by value

Contributor Author

pgree Aug 4, 2020

I made these changes generally and also I created a new file for lin_interp. It seemed to make sense to separate the linear interpolation from this smooth interpolation.

stan/math/prim/fun/gaus_interp.hpp Outdated

+                lin_interp_coefs(n, xs, ys, as, bs);
+                // evaluate at new points
+                vector<double> ys_new;

Collaborator

SteveBronder Apr 2, 2020

Reserve the vector size ahead of the loop to avoid reallocation

Suggested change

      
              vector<double> ys_new;
          
              vector<double> ys_new;
          
              ys_new.reserve(n_new);

stan/math/prim/fun/gaus_interp.hpp Outdated

Comment on lines 110 to 111

		template <typename Tx>
		double min_diff(int n, std::vector<Tx> xs) {

Collaborator

SteveBronder Apr 2, 2020

If this is only used in this file as an internal function then it should be in the internal namespace

stan/math/prim/fun/gaus_interp.hpp Outdated

+              */
+              template <typename Tx>
+              double min_diff(int n, std::vector<Tx> xs) {
+                double dmin = value_of(xs[1]) - value_of(xs[0]);

Collaborator

SteveBronder Apr 2, 2020

idt this will work for fvar<var>

Contributor Author

pgree Aug 4, 2020

I think this is no longer relevant, because now the xs (and ys) will only be inputted as data

Contributor Author

pgree commented Apr 6, 2020

@SteveBronder thanks for reviewing and for these comments. There is some discussion -- stan-dev/design-docs#18 -- about how and if this should be implemented so I will wait until there is some agreement before iterating further on this.

syclik changed the title ~~interpolation functions, WIP with design doc~~ [WIP] interpolation functions, WIP with design doc

Member

syclik commented Apr 28, 2020

Just changed the title to reflect this is WIP.

Collaborator

SteveBronder commented Jul 16, 2020

@pgree are there any blockers for this?

Contributor Author

pgree commented Jul 16, 2020

@SteveBronder sorry for the delay on this, I got distracted and had some Catalina-related problems. I will get to this asap

Collaborator

SteveBronder commented Jul 16, 2020

Cool no worries just wanted to check in!

pgree and others added 14 commits

August 4, 2020 17:20


          changes to reflect design doc discussion

ee28a65


          fixing tests

7582c50


          reflects steve's changes

e91a3fc


          adding docs

eb0a0d9


          Merge branch 'feature/58-interp' of https://github.com/pgree/math int…

fc4abb5

…o feature/58-interp


          Merge commit '5b30e403bb1b2276dee2dd3d6736e52e67960651' into HEAD

774dccb


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

c4582be

…4.1 (tags/RELEASE_600/final)


          fixing formatting

74b8419


          doc typo fix

0ce6663


          another doc typo fix

c4dd75d


          another doc typo fix

be423da


          another doc typo fix

05737f4


          adding checks and tests

149f0d7


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

8594a47

…4.1 (tags/RELEASE_600/final)

stan-buildbot and others added 3 commits

October 27, 2020 22:14


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

62ea083

…4.1 (tags/RELEASE_600/final)


          merge conflicts

b5f69eb


          merge conf

d0d75a2

SteveBronder mentioned this pull request

Including .hpp files and namespaces stan-dev/stanc3#712

Closed

pgree and others added 3 commits

November 11, 2020 13:51


          rerun tests

a992cad


          Merge commit '5291fc8cf03c901201243014957843ffb61c4705' into HEAD

226ee16


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

57c2f8b

…4.1 (tags/RELEASE_600/final)

Contributor

stan-buildbot commented Nov 12, 2020

Name	Old Result	New Result	Ratio	Performance change( 1 - new / old )
gp_pois_regr/gp_pois_regr.stan	3.02	3.02	1.0	-0.11% slower
low_dim_corr_gauss/low_dim_corr_gauss.stan	0.02	0.02	1.0	0.39% faster
eight_schools/eight_schools.stan	0.12	0.11	1.02	2.18% faster
gp_regr/gp_regr.stan	0.17	0.17	1.01	0.67% faster
irt_2pl/irt_2pl.stan	5.69	5.67	1.0	0.4% faster
performance.compilation	88.16	85.49	1.03	3.03% faster
low_dim_gauss_mix_collapse/low_dim_gauss_mix_collapse.stan	8.47	8.49	1.0	-0.24% slower
pkpd/one_comp_mm_elim_abs.stan	30.86	29.95	1.03	2.94% faster
sir/sir.stan	136.36	135.2	1.01	0.85% faster
gp_regr/gen_gp_data.stan	0.04	0.04	0.99	-1.35% slower
low_dim_gauss_mix/low_dim_gauss_mix.stan	2.96	2.96	1.0	0.17% faster
pkpd/sim_one_comp_mm_elim_abs.stan	0.38	0.38	1.01	1.44% faster
arK/arK.stan	1.76	1.78	0.99	-1.09% slower
arma/arma.stan	0.59	0.6	1.0	-0.35% slower
garch/garch.stan	0.75	0.74	1.01	0.92% faster
Mean result: 1.00676635704

Jenkins Console Log
Blue Ocean
Commit hash: 57c2f8b

Machine information

ProductName: Mac OS X ProductVersion: 10.11.6 BuildVersion: 15G22010

CPU:
Intel(R) Xeon(R) CPU E5-1680 v2 @ 3.00GHz

G++:
Configured with: --prefix=/Applications/Xcode.app/Contents/Developer/usr --with-gxx-include-dir=/usr/include/c++/4.2.1
Apple LLVM version 7.0.2 (clang-700.1.81)
Target: x86_64-apple-darwin15.6.0
Thread model: posix

Clang:
Apple LLVM version 7.0.2 (clang-700.1.81)
Target: x86_64-apple-darwin15.6.0
Thread model: posix

pgree changed the title ~~[WIP] interpolation functions, WIP with design doc~~ Interpolation functions

Member

rok-cesnovar commented Jun 21, 2021

Bump. What is the status of this PR? Waiting on a re-review or something else?

Contributor Author

pgree commented Jun 21, 2021

@rok-cesnovar I think all the requested changes were made so just waiting for a re-review.

Contributor

wds15 commented Jun 21, 2021

Didn't we also have a design doc for this. Not sure what the status is there?

Contributor Author

pgree commented Jun 23, 2021

Didn't we also have a design doc for this. Not sure what the status is there?

Yes, exactly. This PR reflects what was discussed there, including your suggestions @wds15

Contributor Author

pgree commented Sep 14, 2021

@bgoodri - per our discussion today, here's the PR

Contributor

bgoodri commented Sep 14, 2021

Sorry, didn't realize this existed until today. It sounds like it is already been reviewed, in which case we should merge it once the conflict with fun.hpp is fixed. But what were the pros and cons of this interpolation scheme as compared to any of the ones in Boost that we could wrap?

Contributor Author

pgree commented Sep 15, 2021

But what were the pros and cons of this interpolation scheme as compared to any of the ones in Boost that we could wrap?

This interpolation schemes has a few of advantages

the interpolated function is smooth
the interpolated function can be evaluated with a formula
the derivative of the interpolated function can be evaluated with a formula
the interpolated function coincides with the inputted points

At the time we decided on this interpolation scheme, Boost didn't have anything with these features

Member

bob-carpenter commented Sep 15, 2021

It'd be great to add an interpolation function to Stan. Andrew and others have been asking for something like the interpolation function in BUGS for ages.

Is the approach unlike any of the Boost implementations? https://www.boost.org/doc/libs/1_77_0/libs/math/doc/html/interpolation.html

Contributor

bgoodri commented Sep 15, 2021

At the time we decided on this interpolation scheme, Boost didn't have anything with these features

Fair enough, although since we will probably include pchip at some point, we should think about having some sort of a common interface for them, at least at the Stan Language level. It would be great it we could soon have callable f = interpolation(x, y); in the transformed data block.

Contributor Author

pgree commented Sep 15, 2021

Is the approach unlike any of the Boost implementations? https://www.boost.org/doc/libs/1_77_0/libs/math/doc/html/interpolation.html

From above:

This interpolation scheme has a few of advantages

the interpolated function is smooth
the interpolated function can be evaluated with a formula
the derivative of the interpolated function can be evaluated with a formula
the interpolated function coincides with the inputted points

Also, you end up with non-oscillatory behavior between reference points. At the time we decided on this interpolation scheme, Boost didn't have anything with these features

The plots for the Akima spline look nice, but I'm not sure how they would do in the environments where people will use them. I don't know if one of the Boost interpolation schemes would work well enough for users' needs. Maybe they would.

we should think about having some sort of a common interface for them, at least at the Stan Language level. It would be great it we could soon have callable f = interpolation(x, y); in the transformed data block.

That's pretty much what we did. I discussed this with (I think) @SteveBronder a while back and we ended up settling on

lin_interp(xs, ys, x)
gaus_interp(xs, ys, x)

which are both in this PR

Member

bob-carpenter commented Sep 15, 2021

Thanks.

lin_interp(xs, ys, x)
gaus_interp(xs, ys, x)

We've used "normal" rather than "gauss" everywhere in our math lib. If you want to call out Gauss, it should at least be spelled out all the way to "gauss".
The BUGS function is interp.lin. It's documented on p. 13 of http://www.mrc-bsu.cam.ac.uk/wp-content/uploads/manual14.pdf. The advantage of interp_lin over lin_interp is that it's better for autocomplete and indexing.

I probably won't be reviewing this and we don't double review, but I strongly urge you not to use gaus_ whatever other decisions are made.

Contributor Author

pgree commented Sep 16, 2021

great, will change to interp_lin. Since interp_normal doesn't sound like a useful name, how about interp_smooth?

Member

bob-carpenter commented Sep 16, 2021

I like the more descriptive name. I'm OK with interp_gauss if you think interp_normal is too vague. I just really don't like just gaus.

Contributor

bgoodri commented Sep 18, 2021

Since all of the interpolation schemes we might implement in Stan in the future will presumably be smooth to some extent, I don't think interp_smooth would disambiguate. Maybe interp_convolve?

Member

bob-carpenter commented Sep 21, 2021

If there aren't likely to be other normal or Gaussian interpolations, I'd prefer

interp_normal, or
interp_gauss.

If there might be other Gaussian interpolations that don't involve convolution, then I'd prefer

interp_normal_conv,
interp_gauss_conv,
interp_normal_convolve, or
interp_gauss_convolve.

pgree and others added 4 commits

September 21, 2021 19:58


          renaming

2cbbad1


          Merge branch 'feature/58-interp' of https://github.com/pgree/math int…

a9f71a2

…o feature/58-interp


          Merge commit '57f05b7354af652031e10f7e278f82bdbedb1a40' into HEAD

7c8ab8b


          [Jenkins] auto-formatting by clang-format version 6.0.0-1ubuntu2~16.0…

c19300c

…4.1 (tags/RELEASE_600/final)

Member

syclik commented Apr 29, 2022

@pgree, sorry about the delay. I'm closing the PR for the moment. Please reopen if it's still active.

When that happens, I'll clear out the old review and give it a fresh one and turn that around quickly.

Right now, the tests fail, so we'll need to trigger the build again anyway.

syclik closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet