Optimizing a control signal #2905

cstjean · 2024-07-29T06:26:12Z

(from https://discourse.julialang.org/t/sciml-optimizing-a-control-signal/117504 and https://discourse.julialang.org/t/solving-ode-parameters-using-experimental-data-with-control-inputs/66614/20; apologies if this isn't using quite the right MTK jargon, I'm new at this)

Suppose I’ve got a differential equation that takes some forcing / control signal as input defined as a DataInterpolation.LinearInterpolation(u, t) (similar to the tutorial example). I’d like to find the u vector which minimizes some loss function. How should I do that with ModelingToolkit?

I can pass the individual u1, u2, u3 as parameters, then call LinearInterpolation([u1,u2,u3], t). That doesn’t look very practical, but maybe I can create many parameters in a comprehension?
I believe that I can pass non-numerical values (like Vector, or LinearInterpolation objects) as parameters, by specifying their type somehow. I can’t find the documentation anymore, but is that how it’s done?
DiffEqFlux.jl?

On top of that, we would like to be able to specify a control signal that depends on the value of the dependent variables. I.e. as temperature rises above 80°C, reduce the heat source. I believe this kind of thing can be done with optimal control / model-predictive-control but for our problem we are interested in a reinforcement learning solution.

In theory we could run it as

Solve ODEProblem from initial state S0, from t=0 to t=1 with parameter HEAT=H0, yielding final temperature T1
Decide new heat level H1 based on temperature T1
Solve ODEProblem from initial state S1, from t=1 to t=2 with parameter HEAT=H1, yielding final temperature T2
...

But I was wondering if there was a nicer interface for this kind of setup. I'd really just like to be able to pass an arbitrary function / functor f in the ODEProblem constructor.

Related: SciML/ModelingToolkitStandardLibrary.jl#123

The text was updated successfully, but these errors were encountered:

ChrisRackauckas · 2024-09-02T14:28:25Z

@baggepinnen has some things to point to. You wouldn't want to just fit the interpolation since you have a lot of sparsity in time to exploit.

baggepinnen · 2024-09-02T15:01:31Z

We don't have much that is open source unfortunately.

If you stick an interpolation object in there and optimize over the arrays in the interpolation using single shooting you will not have any sparsity, but you will typically have a difficult optimization problem that may take a long time to optimize and/or converge to a poor local minimum. If the problem is easy enough for this to work, this is the easiest method to implement yourself.

If your system is simple enough, it can be automatically converted to JuMP equations relatively easily after which you can use a package like InfiniteOpt.jl to perform a direct-collocation transcription of the problem. This typically gives you very good performance, but requires a number of manual steps that haven't been documented well yet.

You can of course implement multiple shooting transcription yourself as well, or modify https://docs.sciml.ai/DiffEqFlux/stable/examples/multiple_shooting/ appropriately. For optimal control, using a penalty to the loss like is done in the tutorial is inadequate, you have to formulate the transcription constraints as hard constraints.

cstjean · 2024-09-03T02:27:58Z

We don't have much that is open source unfortunately.

If you're referring to JuliaSim: I think it could eventually be considered on our side. For now, I was given the OK to try to build a simulator POC in Julia, so it's early days.

If you stick an interpolation object in there and optimize over the arrays in the interpolation using single shooting you will not have any sparsity, but you will typically have a difficult optimization problem that may take a long time to optimize and/or converge to a poor local minimum. If the problem is easy enough for this to work, this is the easiest method to implement yourself.

Our current optimization problem is almost certainly single-optimum/convex, and I believe it's fairly modest in its current state. I'm going to give it a try. #2646 (comment) looks promising for the ForwardDiff part.

Thank you for the detailed information.

cstjean mentioned this issue Aug 5, 2024

add an example to IO docs #2926

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimizing a control signal #2905

Optimizing a control signal #2905

cstjean commented Jul 29, 2024 •

edited

Loading

ChrisRackauckas commented Sep 2, 2024

baggepinnen commented Sep 2, 2024

cstjean commented Sep 3, 2024

Optimizing a control signal #2905

Optimizing a control signal #2905

Comments

cstjean commented Jul 29, 2024 • edited Loading

ChrisRackauckas commented Sep 2, 2024

baggepinnen commented Sep 2, 2024

cstjean commented Sep 3, 2024

cstjean commented Jul 29, 2024 •

edited

Loading