Fitting Parameters for an Oscillatory System

In this example we will use Optimization.jl to fit the parameters of an oscillatory system (the Brusselator) to data. Here, special consideration is taken to avoid reaching a local minimum. Instead of fitting the entire time series directly, we will start with fitting parameter values for the first period, and then use those as an initial guess for fitting the next (and then these to find the next one, and so on). Using this procedure is advantageous for oscillatory systems, and enables us to reach the global optimum. For more information on fitting ODE parameters to data, please see the main documentation page on this topic.

First, we fetch the required packages.

using Catalyst
using OrdinaryDiffEqRosenbrock
using Optimization
using OptimizationOptimisers # Required for the ADAM optimizer.
using SciMLSensitivity # Required for `Optimization.AutoZygote()` automatic differentiation option.

Next, we declare our model, the Brusselator oscillator.

brusselator = @reaction_network begin
    A, ∅ --> X
    1, 2X + Y --> 3X
    B, X --> Y
    1, X --> ∅
end
p_real = [:A => 1.0, :B => 2.0]

We simulate our model, and from the simulation generate sampled data points (to which we add noise). We will use this data to fit the parameters of our model.

u0 = [:X => 1.0, :Y => 1.0]
tend = 30.0

sample_times = range(0.0; stop = tend, length = 100)
prob = ODEProblem(brusselator, u0, tend, p_real)
sol_real = solve(prob, Rosenbrock23(); tstops = sample_times)
sample_vals = Array(sol_real(sample_times))
sample_vals .*= (1 .+ .1 * rand(Float64, size(sample_vals)) .- .05)

We can plot the real solution, as well as the noisy samples.

using Plots
default(; lw = 3, framestyle = :box, size = (800, 400))

plot(sol_real; legend = nothing, color = [:darkblue :darkred])
scatter!(sample_times, sample_vals'; color = [:blue :red], legend = nothing)

Next, we create a function to fit the parameters using the ADAM optimizer. For a given initial estimate of the parameter values, pinit, this function will fit parameter values, p, to our data samples. We use tend to indicate the time interval over which we fit the model. We use an out of place set_p function to update the parameter set in each iteration. We also provide the set_p, prob, sample_times, and sample_vals variables as parameters to our optimization problem.

set_p = ModelingToolkit.setp_oop(prob, [:A, :B])
function optimize_p(pinit, tend,
        set_p = set_p, prob = prob, sample_times = sample_times, sample_vals = sample_vals)
    function loss(p, (set_p, prob, sample_times, sample_vals))
        p = set_p(prob, p)
        newtimes = filter(<=(tend), sample_times)
        newprob = remake(prob; p)
        sol = Array(solve(newprob, Rosenbrock23(); saveat = newtimes, verbose = false, maxiters = 10000))
        loss = sum(abs2, sol .- sample_vals[:, 1:size(sol,2)])
        return loss
    end

    # optimize for the parameters that minimize the loss
    optf = OptimizationFunction(loss, Optimization.AutoZygote())
    optprob = OptimizationProblem(optf, pinit, (set_p, prob, sample_times, sample_vals))
    sol = solve(optprob, ADAM(0.1); maxiters = 100)

    # return the parameters we found
    return sol.u
end

Next, we will fit a parameter set to the data on the interval (0, 10).

p_estimate = optimize_p([5.0, 5.0], 10.0)

2-element Vector{Float64}:
 0.993393478390206
 1.9400561451626188

We can compare this to the real solution, as well as the sample data

function plot_opt_fit(p, tend)
    p = set_p(prob, p)
    newprob = remake(prob; tspan = tend, p)
    sol_estimate = solve(newprob, Rosenbrock23())
    plot(sol_real; color = [:blue :red], label = ["X real" "Y real"], linealpha = 0.2)
    scatter!(sample_times, sample_vals'; color = [:blue :red],
        label = ["Samples of X" "Samples of Y"], alpha = 0.4)
    plot!(sol_estimate; color = [:darkblue :darkred], linestyle = :dash,
        label = ["X estimated" "Y estimated"], xlimit = (0.0, tend))
end
plot_opt_fit(p_estimate, 10.0)

Next, we use this parameter estimate as the input to the next iteration of our fitting process, this time on the interval (0, 20).

p_estimate = optimize_p(p_estimate, 20.0)
plot_opt_fit(p_estimate, 20.0)

Finally, we use this estimate as the input to fit a parameter set on the full time interval of the sampled data.

p_estimate = optimize_p(p_estimate, 30.0)
plot_opt_fit(p_estimate, 30.0)

The final parameter estimate is then

p_estimate

2-element Vector{Float64}:
 1.0007866727555992
 1.9999345178718981

which is close to the actual parameter set of [1.0, 2.0].

Why we fit the parameters in iterations

As previously mentioned, the reason we chose to fit the model on a smaller interval to begin with, and then extend the interval, is to avoid getting stuck in a local minimum. Here specifically, we chose our initial interval to be smaller than a full cycle of the oscillation. If we had chosen to fit a parameter set on the full interval immediately we would have obtained poor fit and an inaccurate estimate for the parameters.

p_estimate = optimize_p([5.0,5.0], 30.0)
plot_opt_fit(p_estimate, 30.0)