Uncertainty Quantified Deep Bayesian Model Discovery

In this tutorial, we show how SciML can combine the differential equation solvers seamlessly with Bayesian estimation libraries like AdvancedHMC.jl and Turing.jl. This enables converting Neural ODEs to Bayesian Neural ODEs, which enables us to estimate the error in the Neural ODE estimation and forecasting. In this tutorial, a working example of the Bayesian Neural ODE: NUTS sampler is shown.

Note

For more details, have a look at this paper: https://arxiv.org/abs/2012.07244

Step 1: Import Libraries

For this example, we will need the following libraries:

# SciML Libraries
import SciMLSensitivity as SMS
import DifferentialEquations as DE

# ML Tools
import Lux
import Zygote

# External Tools
import Random
import Plots
import AdvancedHMC
import MCMCChains
import StatsPlots
import ComponentArrays

Setup: Get the data from the Spiral ODE example

We will also need data to fit against. As a demonstration, we will generate our data using a simple cubic ODE u' = A*u^3 as follows:

u0 = [2.0; 0.0]
datasize = 40
tspan = (0.0, 1)
tsteps = range(tspan[1], tspan[2], length = datasize)
function trueODEfunc(du, u, p, t)
    true_A = [-0.1 2.0; -2.0 -0.1]
    du .= ((u .^ 3)'true_A)'
end
prob_trueode = DE.ODEProblem(trueODEfunc, u0, tspan)
ode_data = Array(DE.solve(prob_trueode, DE.Tsit5(), saveat = tsteps))

2×40 Matrix{Float64}:
 2.0  1.97895   1.94728  1.87998  1.74775  …   0.353996   0.53937   0.718119
 0.0  0.403905  0.79233  1.15176  1.45561     -1.54217   -1.52816  -1.50614

We will want to train a neural network to capture the dynamics that fit ode_data.

Step 2: Define the Neural ODE architecture.

Note that this step potentially offers a lot of flexibility in the number of layers/ number of units in each layer. It may not necessarily be true that a 100 units architecture is better at prediction/forecasting than a 50 unit architecture. On the other hand, a complicated architecture can take a huge computational time without increasing performance.

dudt2 = Lux.Chain(x -> x .^ 3,
    Lux.Dense(2, 50, tanh),
    Lux.Dense(50, 2))

rng = Random.default_rng()
p, st = Lux.setup(rng, dudt2)
const _st = st
function neuralodefunc(u, p, t)
    dudt2(u, p, _st)[1]
end
function prob_neuralode(u0, p)
    prob = DE.ODEProblem(neuralodefunc, u0, tspan, p)
    sol = DE.solve(prob, DE.Tsit5(), saveat = tsteps)
end
p = ComponentArrays.ComponentArray{Float64}(p)
const _p = p

ComponentVector{Float64}(layer_1 = Float64[], layer_2 = (weight = [-1.5053128004074097 0.23296351730823517; -0.12302673608064651 0.7344870567321777; … ; 1.529285192489624 0.10703841596841812; 1.524273157119751 -0.9132710695266724], bias = [0.08473961055278778, 0.5733053088188171, -0.6375018358230591, 0.34527474641799927, -0.32976967096328735, -0.17856068909168243, -0.036604199558496475, -0.5427015423774719, -0.6017343401908875, -0.505540668964386  …  0.6750436425209045, -0.020538918673992157, -0.631108820438385, -0.05680956691503525, -0.23751713335514069, -0.3454344868659973, -0.3129132091999054, -0.5172009468078613, 0.6838986873626709, -0.64003986120224]), layer_3 = (weight = [0.1820300817489624 0.17436477541923523 … -0.025456873700022697 0.11291535943746567; 0.06513882428407669 -0.13476361334323883 … 0.17408664524555206 -0.11773275583982468], bias = [-0.04990271106362343, 0.043129812926054]))

Note that the f64 is required to put the Lux neural network into Float64 precision.

Step 3: Define the loss function for the Neural ODE.

function predict_neuralode(p)
    p = p isa ComponentArrays.ComponentArray ? p : convert(typeof(_p), p)
    Array(prob_neuralode(u0, p))
end
function loss_neuralode(p)
    pred = predict_neuralode(p)
    loss = sum(abs2, ode_data .- pred)
    return loss, pred
end

loss_neuralode (generic function with 1 method)

Step 4: Now we start integrating the Bayesian estimation workflow as prescribed by the AdvancedHMC interface with the NeuralODE defined above

The AdvancedHMC interface requires us to specify: (a) the Hamiltonian log density and its gradient , (b) the sampler and (c) the step size adaptor function.

For the Hamiltonian log density, we use the loss function. The θ*θ term denotes the use of Gaussian priors.

The user can make several modifications to Step 4. The user can try different acceptance ratios, warmup samples and posterior samples. One can also use the Variational Inference (ADVI) framework, which doesn't work quite as well as NUTS. The SGLD (Stochastic Gradient Langevin Descent) sampler is seen to have a better performance than NUTS. Have a look at https://sebastiancallh.github.io/post/langevin/ for a brief introduction to SGLD.

l(θ) = -sum(abs2, ode_data .- predict_neuralode(θ)) - sum(θ .* θ)
function dldθ(θ)
    x, lambda = Zygote.pullback(l, θ)
    grad = first(lambda(1))
    return x, grad
end

metric = AdvancedHMC.DiagEuclideanMetric(oneunit.(p))
h = AdvancedHMC.Hamiltonian(metric, l, dldθ)

Hamiltonian with DiagEuclideanMetric and GaussianKinetic

We use the NUTS sampler with an acceptance ratio of δ= 0.45 in this example. In addition, we use Nesterov Dual Averaging for the Step Size adaptation.

We sample using 500 warmup samples and 500 posterior samples.

integrator = AdvancedHMC.Leapfrog(AdvancedHMC.find_good_stepsize(h, p))
kernel = AdvancedHMC.HMCKernel(AdvancedHMC.Trajectory{AdvancedHMC.MultinomialTS}(integrator, AdvancedHMC.GeneralisedNoUTurn()))
adaptor = AdvancedHMC.StanHMCAdaptor(AdvancedHMC.MassMatrixAdaptor(metric), AdvancedHMC.StepSizeAdaptor(0.45, integrator))
samples, stats = AdvancedHMC.sample(h, kernel, p, 500, adaptor, 500; progress = true)

(ComponentArrays.ComponentVector{Float64, Vector{Float64}, Tuple{ComponentArrays.Axis{(layer_1 = ViewAxis(1:0, Shaped1DAxis((0,))), layer_2 = ViewAxis(1:150, Axis(weight = ViewAxis(1:100, ShapedAxis((50, 2))), bias = ViewAxis(101:150, Shaped1DAxis((50,))))), layer_3 = ViewAxis(151:252, Axis(weight = ViewAxis(1:100, ShapedAxis((2, 50))), bias = ViewAxis(101:102, Shaped1DAxis((2,))))))}}}[(layer_1 = Float64[], layer_2 = (weight = [-1.4064361324648091 0.1654149624748235; -0.13038276190916986 0.762940142650413; … ; 1.565577665697733 0.15786898921666567; 1.4611885054618972 -0.8594016754212911], bias = [0.11790718877476772, 0.5508456614112369, -0.6167524073030416, 0.3376521697943075, -0.34214819977724914, -0.13228129924436247, -0.1026565957445684, -0.5213511522180015, -0.6743947468887034, -0.5040266599419985  …  0.6070809410062626, 0.006287055203644077, -0.5962856604063214, -0.03149895231699639, -0.20322988970051495, -0.40633236109664517, -0.29295213068683645, -0.5720024152195639, 0.645092825161093, -0.6274675995427166]), layer_3 = (weight = [0.19827899606355684 0.1962889707502925 … -0.12671041518880133 0.0015854335150942145; 0.10045345211703058 -0.1124817590165631 … 0.1336041128583673 -0.17384474517152582], bias = [-0.14192731024158614, -0.0865267622919797])), (layer_1 = Float64[], layer_2 = (weight = [-1.4064361324648091 0.1654149624748235; -0.13038276190916986 0.762940142650413; … ; 1.565577665697733 0.15786898921666567; 1.4611885054618972 -0.8594016754212911], bias = [0.11790718877476772, 0.5508456614112369, -0.6167524073030416, 0.3376521697943075, -0.34214819977724914, -0.13228129924436247, -0.1026565957445684, -0.5213511522180015, -0.6743947468887034, -0.5040266599419985  …  0.6070809410062626, 0.006287055203644077, -0.5962856604063214, -0.03149895231699639, -0.20322988970051495, -0.40633236109664517, -0.29295213068683645, -0.5720024152195639, 0.645092825161093, -0.6274675995427166]), layer_3 = (weight = [0.19827899606355684 0.1962889707502925 … -0.12671041518880133 0.0015854335150942145; 0.10045345211703058 -0.1124817590165631 … 0.1336041128583673 -0.17384474517152582], bias = [-0.14192731024158614, -0.0865267622919797])), (layer_1 = Float64[], layer_2 = (weight = [-1.4064361324648091 0.1654149624748235; -0.13038276190916986 0.762940142650413; … ; 1.565577665697733 0.15786898921666567; 1.4611885054618972 -0.8594016754212911], bias = [0.11790718877476772, 0.5508456614112369, -0.6167524073030416, 0.3376521697943075, -0.34214819977724914, -0.13228129924436247, -0.1026565957445684, -0.5213511522180015, -0.6743947468887034, -0.5040266599419985  …  0.6070809410062626, 0.006287055203644077, -0.5962856604063214, -0.03149895231699639, -0.20322988970051495, -0.40633236109664517, -0.29295213068683645, -0.5720024152195639, 0.645092825161093, -0.6274675995427166]), layer_3 = (weight = [0.19827899606355684 0.1962889707502925 … -0.12671041518880133 0.0015854335150942145; 0.10045345211703058 -0.1124817590165631 … 0.1336041128583673 -0.17384474517152582], bias = [-0.14192731024158614, -0.0865267622919797])), (layer_1 = Float64[], layer_2 = (weight = [-0.9976457159494003 0.11633150623860891; 0.5922544575495343 0.9749708496948031; … ; 1.351253709291547 -0.5061012650642728; 0.9296759388509291 -0.5413052998801394], bias = [0.022088036170690807, 0.8164011909349628, -0.855786672666137, 1.2478249234268415, -0.8112370769701828, -0.14405934493631442, -0.817626673756423, -1.0480137610446094, -0.5862417679986626, -0.5326344679650546  …  0.10410555587294748, -0.6623131282793845, 1.2849183471926995, 0.2921212778266032, -0.3394732673791378, -0.8584845037328063, 0.34993203328946393, -0.05852422209428401, 0.6016117659973738, -0.8511163333292897]), layer_3 = (weight = [0.7007263717171636 -0.5042167409642268 … -1.218537111759365 0.9016375542803149; -1.0782421291149562 0.32923347464580643 … 0.6647532490559008 1.2765753908865292], bias = [-1.0911532270019626, 0.274125869026199])), (layer_1 = Float64[], layer_2 = (weight = [-0.5655371383968593 -0.3802608728345402; -0.42132079409579787 0.8440766427353119; … ; 0.5993991757216389 -1.0687121532381123; -0.6682884058103092 0.06894694847064242], bias = [0.22675660425712568, 0.9654263526602976, -0.1494392346903547, 0.7026940006439082, -0.8966618034939208, 0.11707310886403145, -1.5454847599746837, -0.10900767246400704, 0.11290368434469841, -0.42799431642252384  …  -0.15780785726320407, 0.09362221325947222, -0.32219065064016456, -1.096939414207985, -1.116772371652735, -1.0039881584918469, 0.26719847381458034, -0.30922585948337233, 1.2276427456175927, -0.34120059303926414]), layer_3 = (weight = [1.3386820036116394 0.03306183858212077 … -0.5576506858380101 0.3403290544191354; -1.537376336435362 -0.3582832598090277 … 1.1466452898581003 0.2712444349452843], bias = [-0.7542152865054218, 0.3346989727839114])), (layer_1 = Float64[], layer_2 = (weight = [-0.21126097321701265 -0.5627563464515981; 0.9705077106276877 0.15886321869672432; … ; 0.2937833916803266 1.144868702319608; -0.10707063626294205 -1.0734043948009735], bias = [0.06878480391432336, -0.1465499803445152, 0.6496398031051688, -0.126783065968759, 0.06362884815611623, -1.1549779611678037, 0.043977991354122406, -0.5691387071766589, 0.7272634980635, 0.8843534851612532  …  -0.8300771112180769, 0.441381600133894, 0.24405387523754077, 0.5986041259775606, 0.38580189268776927, 0.23399672065799976, 0.752396097085545, -1.181754893984185, -0.49343316884086313, -0.6473851059039272]), layer_3 = (weight = [-0.1406537735487538 -0.30827807053739453 … -0.061279267093346045 1.1071701744736848; -0.7381253164121524 0.6238907156395431 … -0.5804439682982433 -1.1925556828232848], bias = [-0.22384653099905308, 0.29238361347318076])), (layer_1 = Float64[], layer_2 = (weight = [-0.4458256845606531 0.5853165639168482; 1.2210055184813413 1.1231416719870884; … ; 0.9367947731760681 -1.4327933327267364; -0.23588045424735773 -0.23109194202055522], bias = [0.24336900033056055, -2.398811820721856, 0.15906278174997346, -0.39785155630568925, -0.28038441580777973, -1.186751273489069, 0.6053224456119585, -0.10684226661106383, 0.35958859311726177, -0.10793886048655701  …  -0.2928863828565217, -0.7421391830479185, -0.46833156553692007, 0.7393802545585048, 0.8602308984099889, 0.1987695001048536, 0.9113162242147849, -1.3521694998616387, -0.46370814443748604, 0.3010469792427206]), layer_3 = (weight = [0.21769141176076748 -0.32893233794153287 … 1.1250219409000826 0.002968928263738714; -0.06915986748490902 0.7450293067198144 … -0.74157127401442 -0.5208954838759219], bias = [-1.2198753679240473, 0.9045109901997826])), (layer_1 = Float64[], layer_2 = (weight = [-0.4458256845606531 0.5853165639168482; 1.2210055184813413 1.1231416719870884; … ; 0.9367947731760681 -1.4327933327267364; -0.23588045424735773 -0.23109194202055522], bias = [0.24336900033056055, -2.398811820721856, 0.15906278174997346, -0.39785155630568925, -0.28038441580777973, -1.186751273489069, 0.6053224456119585, -0.10684226661106383, 0.35958859311726177, -0.10793886048655701  …  -0.2928863828565217, -0.7421391830479185, -0.46833156553692007, 0.7393802545585048, 0.8602308984099889, 0.1987695001048536, 0.9113162242147849, -1.3521694998616387, -0.46370814443748604, 0.3010469792427206]), layer_3 = (weight = [0.21769141176076748 -0.32893233794153287 … 1.1250219409000826 0.002968928263738714; -0.06915986748490902 0.7450293067198144 … -0.74157127401442 -0.5208954838759219], bias = [-1.2198753679240473, 0.9045109901997826])), (layer_1 = Float64[], layer_2 = (weight = [1.0592599759508678 0.41280289430043227; -0.8373410833501997 -0.8954620917149596; … ; -0.04112813915081778 1.4483557206036253; 0.630528656727911 -0.6396086016997058], bias = [-0.3791564784194507, 1.1574511458651904, -0.40729574620940084, 0.31516901677412346, 0.42009197034846535, 0.43976682406257084, 0.04611426220184482, 1.2416326920489997, -0.5508590619212829, 0.905939951301956  …  0.22413664351858179, 0.7260187144461845, 0.3920206065080503, -0.9399708459923972, -0.5023623910537196, 0.00627849925289161, -0.30102421707291094, 0.9626612510690695, -0.1312780552735217, -0.17194860847955418]), layer_3 = (weight = [-0.19519341307654894 0.6772655748978497 … -0.5625954946603611 -0.00852202919607293; 0.0670319557097502 -0.5991415212805659 … 1.3972735722399852 0.7314320464260583], bias = [0.09092781281975182, -0.4737954025270726])), (layer_1 = Float64[], layer_2 = (weight = [1.0076797403728408 0.45015763548825405; -0.938101274339696 -0.9100821052907221; … ; 0.009204464056451298 1.350269830436377; 0.6409302067862348 -0.6183109918404318], bias = [-0.3437087433402981, 1.2515615044375994, -0.4130437489627855, 0.2832200150468108, 0.2638998851907417, 0.1815988428517642, 0.06431272070543473, 1.3649346823141406, -0.5200437349957743, 0.8493581625552628  …  0.2547952281073971, 0.6877516024951088, 0.4136108165157035, -0.8729729923493144, -0.48767386315886646, 0.1175301673826891, -0.3286468872141666, 0.9135841796858738, -0.13341054440051864, -0.056216614970595574]), layer_3 = (weight = [-0.1555795514112847 0.7659485530251239 … -0.6700253487303169 0.11131318601212764; 0.1647754906778865 -0.6295171607400247 … 1.609546172268339 0.8344770785504454], bias = [0.16735448510451928, -0.2795785481483765]))  …  (layer_1 = Float64[], layer_2 = (weight = [-0.030356762304204248 0.49917444415116; 0.048676049155328714 -0.029257053169671386; … ; 0.9820470016416855 0.5052728317641194; 0.42587258956742563 0.36148797256474785], bias = [-0.016429114039677867, 0.9716989059250252, -0.12291334464093281, -0.5240399244884746, -0.863931254868768, -0.4697463977421927, -0.9967666352248437, 1.072808770331425, 0.07969407616654949, -0.6155500990908702  …  0.13778324829628757, 0.023692820034566617, -0.3755950012676212, 0.31415630227300373, 0.277602964696193, 1.163437119116783, 1.608679570737311, 0.0371148165165093, 0.6768670582490808, 0.6239111527782404]), layer_3 = (weight = [0.02399448037423759 -0.23387645363033374 … -0.3867321986522881 0.06476895095386102; -0.1603972496781146 1.3215852261406456 … 0.0369327100188556 0.8139634852106289], bias = [-0.27907454457631825, -1.1728092938876165])), (layer_1 = Float64[], layer_2 = (weight = [0.2889231337516822 -0.3156084448164602; 0.0043781536704008495 0.05603730436906658; … ; 0.4814435368305874 0.11170918777655267; -0.03543066491637491 0.42088969601201076], bias = [-0.30651866973686465, 0.8976515477093637, 0.02710812758331506, -0.6197784266030703, -0.39959337599347067, -0.4009369300020848, -0.3795203912222298, 0.5743948972604684, 0.12124383809623945, 0.014343337742538362  …  0.17019625046783007, 0.06573861423968508, -0.39266602476316154, 0.413103870158327, 0.3829865834522302, 1.3543629079279829, 1.7371722012145794, 0.42392531482688645, 0.6463564768210326, 0.4759561129158899]), layer_3 = (weight = [0.6035713640332326 -0.3789861581040878 … -0.7964759319772727 -0.4595826434409108; -0.13709795535504843 1.6837055246747077 … -0.24547487204789467 0.9772627313344884], bias = [-0.7868506089328291, -0.9675092240736057])), (layer_1 = Float64[], layer_2 = (weight = [-0.1602889583918383 0.5039910708356289; 0.8481887103553172 -0.22548734385299601; … ; -0.2707283311297962 0.04144098664757426; -0.4860960670743371 -0.3411089531920752], bias = [0.08524488935454588, -0.22917049384058683, -0.19707899375230395, -0.9396405112666257, -1.4511832675492664, -0.6119223965969187, -0.08220971333384554, 0.5013286265215959, 0.6091532350170703, -0.5907527622094001  …  -0.819342187508717, -1.5778910658359786, -0.2001324742478927, 0.3516978392938321, 2.1146970572358077, 1.2182934099609066, 1.2210984271972358, 0.04205791540792716, 0.017849413006530538, -0.018817817934694656]), layer_3 = (weight = [1.0831102941784612 -0.3370906271924005 … -0.7394341651637623 -0.045544326339271676; -0.04176853701469353 1.2868630530501297 … -0.2932415375054742 0.783255284580246], bias = [-0.47818133779961125, -0.7907815133981523])), (layer_1 = Float64[], layer_2 = (weight = [-1.0508368503209304 -0.1258983199876541; -1.272674456642084 -0.3151349106537931; … ; 0.11817449112135636 0.8004164942209341; 0.03329648669098641 0.42678882025315673], bias = [-0.01630744249223247, 0.5303468571937991, 0.2032175353242399, 0.41005117313506984, 0.15520349378589868, 1.3037113000860896, -0.42620584758395136, -0.30218680121274605, -0.23195041900831714, -0.1834847422073063  …  -0.0909891047415818, 0.5887272564830851, 0.07363371194947689, 0.2414556577965205, -0.7825838468927191, 0.33350853956187904, -0.8883541845522328, 1.0304646651563454, 0.44262625131861233, -0.16679867821363312]), layer_3 = (weight = [-0.9529588227468594 1.7725613970232477 … -0.30544705965848623 -0.21391985773686725; -0.32560285501499486 -1.5418204606209542 … 0.19731751470746642 -1.288932848079859], bias = [-0.623134829499468, -0.1481630289783885])), (layer_1 = Float64[], layer_2 = (weight = [-1.0508368503209304 -0.1258983199876541; -1.272674456642084 -0.3151349106537931; … ; 0.11817449112135636 0.8004164942209341; 0.03329648669098641 0.42678882025315673], bias = [-0.01630744249223247, 0.5303468571937991, 0.2032175353242399, 0.41005117313506984, 0.15520349378589868, 1.3037113000860896, -0.42620584758395136, -0.30218680121274605, -0.23195041900831714, -0.1834847422073063  …  -0.0909891047415818, 0.5887272564830851, 0.07363371194947689, 0.2414556577965205, -0.7825838468927191, 0.33350853956187904, -0.8883541845522328, 1.0304646651563454, 0.44262625131861233, -0.16679867821363312]), layer_3 = (weight = [-0.9529588227468594 1.7725613970232477 … -0.30544705965848623 -0.21391985773686725; -0.32560285501499486 -1.5418204606209542 … 0.19731751470746642 -1.288932848079859], bias = [-0.623134829499468, -0.1481630289783885])), (layer_1 = Float64[], layer_2 = (weight = [0.9864980758927354 0.9680462100816293; 1.2872200550101789 0.3130798149503222; … ; -0.21284123930718837 -0.7225041205818119; 0.05633305289895946 -0.393092229454337], bias = [-0.03468311675915402, -0.6025904404839348, -0.16333617080371027, -0.4138650201086759, -0.7527845908789648, -1.2200982951090233, 0.2199794187236367, -0.10580061000237498, 0.5574051633763365, 1.09589227599928  …  0.1942775648450055, -0.34382342431607865, -0.002620945157945318, -0.1981697318750355, 1.021785368199693, -0.54147215080716, 0.7839041578013747, -0.8450916223121581, -0.35552480108942636, -0.07162836616590058]), layer_3 = (weight = [0.8030612943514385 -0.892494090693527 … 0.9279748770328632 -0.25202906310984413; -0.16266260188770248 1.3664839858561937 … -0.259008010515147 0.6270009321978326], bias = [-0.2937749630082403, 0.6859809562370466])), (layer_1 = Float64[], layer_2 = (weight = [0.9864980758927354 0.9680462100816293; 1.2872200550101789 0.3130798149503222; … ; -0.21284123930718837 -0.7225041205818119; 0.05633305289895946 -0.393092229454337], bias = [-0.03468311675915402, -0.6025904404839348, -0.16333617080371027, -0.4138650201086759, -0.7527845908789648, -1.2200982951090233, 0.2199794187236367, -0.10580061000237498, 0.5574051633763365, 1.09589227599928  …  0.1942775648450055, -0.34382342431607865, -0.002620945157945318, -0.1981697318750355, 1.021785368199693, -0.54147215080716, 0.7839041578013747, -0.8450916223121581, -0.35552480108942636, -0.07162836616590058]), layer_3 = (weight = [0.8030612943514385 -0.892494090693527 … 0.9279748770328632 -0.25202906310984413; -0.16266260188770248 1.3664839858561937 … -0.259008010515147 0.6270009321978326], bias = [-0.2937749630082403, 0.6859809562370466])), (layer_1 = Float64[], layer_2 = (weight = [1.036924790738892 -0.025784740040562165; 0.6793088282945744 0.3459914510045838; … ; -0.07778650820378567 -0.290046278149121; -0.6884716777700202 -0.060818503571076016], bias = [-0.2848055057422112, 0.740861179129451, -0.7688082633847867, -1.3029219356856012, -1.3812103135704774, -0.11264520885310086, 0.12579504041846012, -0.01271879085515383, 1.6742748957990345, 0.7539540614019588  …  -0.12400722123219957, -0.0706830085476437, 0.1934105196835579, -0.5530593251062514, 0.9371618763304286, -0.33778283172047113, 0.6312573952114107, -0.9624800635728167, -0.43317901949679033, -0.24588780211320677]), layer_3 = (weight = [0.6142064680911561 -1.1496080070622687 … 0.2529686138635432 0.23258596678603374; -0.9528155394187159 1.1992423029131098 … -0.4765902466647467 -0.14819651244600218], bias = [0.49689767423644976, 0.4760887411414709])), (layer_1 = Float64[], layer_2 = (weight = [1.2844103183465296 -0.2850806727897711; 0.906636669596607 0.14169415898590246; … ; -0.051481951119464145 -0.10549202872775845; -0.4522793142873147 -0.043246823847059045], bias = [-0.5252558402278008, 0.6503587467745923, -0.7687425419950676, -1.551961727259382, -1.5089339705996774, -0.4968427617233586, 0.3143976545472736, -0.028571853374828118, 1.3162044555279442, 0.8670456559157118  …  -0.19472144099947247, -0.010112183081059923, -0.008713957935100963, -0.3041899183284749, 0.8707650824534326, -0.20801175333829966, 0.3686649219686597, -0.7249817292656014, -0.33696945029312525, -0.17019230695729973]), layer_3 = (weight = [0.8013093927453868 -0.9497238313994013 … 0.0686831246718092 0.16388490163377442; -0.839983374677611 0.8094013860330447 … -0.5086526540334994 0.3658169000852336], bias = [0.5725199651438744, 0.011510436344422281])), (layer_1 = Float64[], layer_2 = (weight = [-1.6981826472727024 0.12420129250880733; -0.3046220883324116 -0.12601832709402255; … ; -0.8038397362586169 0.5497028473428774; 0.39387279443564255 0.142964880052704], bias = [-0.30180269592178555, 0.5080288185734284, 0.7685471317385523, 0.008796423135893173, 0.19016575661529822, -0.225817618212762, 0.23575058533598217, 0.17767934287186943, 0.05336259510234942, 0.14751915214182945  …  0.13372449350037702, 1.0630949147253912, 0.2564341263687169, -0.0730734848447842, -1.0979368222280328, 0.020444577467637174, -0.23504086126832088, -0.9732471376470072, 0.44066668506036827, 0.763176647355994]), layer_3 = (weight = [-0.1856583527133895 0.22929735402026513 … 0.09917380955903728 -0.8030232540586258; 0.9857905822513771 0.058702349867780154 … -0.5122065445563736 -0.47390124773085], bias = [-0.7518837131940665, 0.9425021506469887]))], NamedTuple[(n_steps = 15, is_accept = true, acceptance_rate = 0.2000001027226274, log_density = -323.7137259593612, hamiltonian_energy = 573.2527247378006, hamiltonian_energy_error = -54.78838826251149, max_hamiltonian_energy_error = -54.78838826251149, tree_depth = 4, numerical_error = false, step_size = 0.05, nom_step_size = 0.05, is_adapt = true), (n_steps = 1, is_accept = true, acceptance_rate = 0.0, log_density = -323.7137259593612, hamiltonian_energy = 461.28600664357987, hamiltonian_energy_error = 0.0, max_hamiltonian_energy_error = 4210.911177351428, tree_depth = 0, numerical_error = true, step_size = 0.3173682687445034, nom_step_size = 0.3173682687445034, is_adapt = true), (n_steps = 3, is_accept = true, acceptance_rate = 3.2975939659275165e-8, log_density = -323.7137259593612, hamiltonian_energy = 451.3184835884836, hamiltonian_energy_error = 0.0, max_hamiltonian_energy_error = 1897.3438636379442, tree_depth = 1, numerical_error = true, step_size = 0.096033061016871, nom_step_size = 0.096033061016871, is_adapt = true), (n_steps = 127, is_accept = true, acceptance_rate = 0.37153237512795495, log_density = -215.8968164876886, hamiltonian_energy = 448.6183517566688, hamiltonian_energy_error = -4.601197319795062, max_hamiltonian_energy_error = 101.76361163945921, tree_depth = 7, numerical_error = false, step_size = 0.023340986148366504, nom_step_size = 0.023340986148366504, is_adapt = true), (n_steps = 255, is_accept = true, acceptance_rate = 0.5040837535160856, log_density = -158.32953646709908, hamiltonian_energy = 336.2326909442992, hamiltonian_energy_error = -0.6397681460121021, max_hamiltonian_energy_error = 11.328651046855498, tree_depth = 8, numerical_error = false, step_size = 0.014949845816381112, nom_step_size = 0.014949845816381112, is_adapt = true), (n_steps = 255, is_accept = true, acceptance_rate = 0.6831655476871451, log_density = -119.18361327440077, hamiltonian_energy = 271.1838962141173, hamiltonian_energy_error = 0.6877081643081624, max_hamiltonian_energy_error = 2.154658791205179, tree_depth = 8, numerical_error = false, step_size = 0.015078536050008239, nom_step_size = 0.015078536050008239, is_adapt = true), (n_steps = 127, is_accept = true, acceptance_rate = 0.7997487799546306, log_density = -128.3028099811414, hamiltonian_energy = 241.4930299018801, hamiltonian_energy_error = -0.9522473611082205, max_hamiltonian_energy_error = 973.811219334511, tree_depth = 7, numerical_error = false, step_size = 0.02801465390377256, nom_step_size = 0.02801465390377256, is_adapt = true), (n_steps = 4, is_accept = true, acceptance_rate = 0.029188409219604923, log_density = -128.3028099811414, hamiltonian_energy = 254.79975363740033, hamiltonian_energy_error = 0.0, max_hamiltonian_energy_error = 3191.1383018323377, tree_depth = 2, numerical_error = true, step_size = 0.07932645446969287, nom_step_size = 0.07932645446969287, is_adapt = true), (n_steps = 127, is_accept = true, acceptance_rate = 0.8877731578596323, log_density = -131.86286642341346, hamiltonian_energy = 256.19985031437176, hamiltonian_energy_error = -0.6503501415516553, max_hamiltonian_energy_error = 1.7785605759166856, tree_depth = 7, numerical_error = false, step_size = 0.02076597515527322, nom_step_size = 0.02076597515527322, is_adapt = true), (n_steps = 31, is_accept = true, acceptance_rate = 0.03398462805063141, log_density = -133.07616271630317, hamiltonian_energy = 255.13194548555586, hamiltonian_energy_error = 0.4163017051356803, max_hamiltonian_energy_error = 519.5122697439801, tree_depth = 5, numerical_error = false, step_size = 0.08148124012656918, nom_step_size = 0.08148124012656918, is_adapt = true)  …  (n_steps = 127, is_accept = true, acceptance_rate = 0.8360138984818558, log_density = -125.15082483082833, hamiltonian_energy = 254.34225658350204, hamiltonian_energy_error = 0.16752834429780705, max_hamiltonian_energy_error = 27.49280137442929, tree_depth = 7, numerical_error = false, step_size = 0.030271799607078976, nom_step_size = 0.030271799607078976, is_adapt = true), (n_steps = 28, is_accept = true, acceptance_rate = 0.17958754580177155, log_density = -129.2581662755457, hamiltonian_energy = 250.26127369284228, hamiltonian_energy_error = 0.2020761920472296, max_hamiltonian_energy_error = 1976.5726881889223, tree_depth = 4, numerical_error = true, step_size = 0.08126814690486521, nom_step_size = 0.08126814690486521, is_adapt = true), (n_steps = 127, is_accept = true, acceptance_rate = 0.32598967895783887, log_density = -129.68883987402907, hamiltonian_energy = 250.53365552233737, hamiltonian_energy_error = 0.09901630244507942, max_hamiltonian_energy_error = 272.9810898248464, tree_depth = 7, numerical_error = false, step_size = 0.041867448863407444, nom_step_size = 0.041867448863407444, is_adapt = true), (n_steps = 127, is_accept = true, acceptance_rate = 0.7466364467762802, log_density = -115.8344311250922, hamiltonian_energy = 247.17998016890436, hamiltonian_energy_error = -0.9174172380398318, max_hamiltonian_energy_error = 275.93722369853333, tree_depth = 7, numerical_error = false, step_size = 0.03128422527930519, nom_step_size = 0.03128422527930519, is_adapt = true), (n_steps = 49, is_accept = true, acceptance_rate = 0.009241495860288322, log_density = -115.8344311250922, hamiltonian_energy = 235.42160989860844, hamiltonian_energy_error = 0.0, max_hamiltonian_energy_error = 1155.4542055955612, tree_depth = 5, numerical_error = true, step_size = 0.06597371632676385, nom_step_size = 0.06597371632676385, is_adapt = true), (n_steps = 255, is_accept = true, acceptance_rate = 0.9920022850649056, log_density = -115.95146392411607, hamiltonian_energy = 244.54284622268085, hamiltonian_energy_error = -0.6073285659059593, max_hamiltonian_energy_error = -1.089947052714649, tree_depth = 8, numerical_error = false, step_size = 0.02278220455785583, nom_step_size = 0.02278220455785583, is_adapt = true), (n_steps = 5, is_accept = true, acceptance_rate = 0.006513734875662268, log_density = -115.95146392411607, hamiltonian_energy = 239.6564452912857, hamiltonian_energy_error = 0.0, max_hamiltonian_energy_error = 3128.5941402210606, tree_depth = 2, numerical_error = true, step_size = 0.08631859683124542, nom_step_size = 0.08631859683124542, is_adapt = true), (n_steps = 127, is_accept = true, acceptance_rate = 0.48114040885114345, log_density = -128.43530123993725, hamiltonian_energy = 243.724557452162, hamiltonian_energy_error = -0.5939874355335633, max_hamiltonian_energy_error = 9.805782866762257, tree_depth = 7, numerical_error = false, step_size = 0.029993800694084506, nom_step_size = 0.029993800694084506, is_adapt = true), (n_steps = 127, is_accept = true, acceptance_rate = 0.0642247731901723, log_density = -123.76619877460965, hamiltonian_energy = 265.8874192373089, hamiltonian_energy_error = -0.90869192532341, max_hamiltonian_energy_error = 66.3437917581096, tree_depth = 7, numerical_error = false, step_size = 0.032859895099091735, nom_step_size = 0.032859895099091735, is_adapt = true), (n_steps = 255, is_accept = true, acceptance_rate = 0.9045763811532068, log_density = -143.7439947764815, hamiltonian_energy = 250.81469829902568, hamiltonian_energy_error = 0.1357973066566842, max_hamiltonian_energy_error = -1.0378037630860604, tree_depth = 8, numerical_error = false, step_size = 0.013368579534066359, nom_step_size = 0.013368579534066359, is_adapt = true)])

Step 5: Plot diagnostics

Now let's make sure the fit is good. This can be done by looking at the chain mixing plot and the autocorrelation plot. First, let's create the chain mixing plot using the plot recipes from ????

samples = hcat(samples...)
samples_reduced = samples[1:5, :]
samples_reshape = reshape(samples_reduced, (500, 5, 1))
Chain_Spiral = MCMCChains.Chains(samples_reshape)
Plots.plot(Chain_Spiral)

Now we check the autocorrelation plot:

MCMCChains.autocorplot(Chain_Spiral)

As another diagnostic, let's check the result on retrodicted data. To do this, we generate solutions of the Neural ODE on samples of the neural network parameters, and check the results of the predictions against the data. Let's start by looking at the time series:

pl = Plots.scatter(tsteps, ode_data[1, :], color = :red, label = "Data: Var1", xlabel = "t",
    title = "Spiral Neural ODE")
Plots.scatter!(tsteps, ode_data[2, :], color = :blue, label = "Data: Var2")
for k in 1:300
    resol = predict_neuralode(samples[:, 100:end][:, rand(1:400)])
    Plots.plot!(tsteps, resol[1, :], alpha = 0.04, color = :red, label = "")
    Plots.plot!(tsteps, resol[2, :], alpha = 0.04, color = :blue, label = "")
end

losses = map(x -> loss_neuralode(x)[1], eachcol(samples))
idx = findmin(losses)[2]
prediction = predict_neuralode(samples[:, idx])
Plots.plot!(tsteps, prediction[1, :], color = :black, w = 2, label = "")
Plots.plot!(tsteps, prediction[2, :], color = :black, w = 2, label = "Best fit prediction",
    ylims = (-2.5, 3.5))

That showed the time series form. We can similarly do a phase-space plot:

pl = Plots.scatter(ode_data[1, :], ode_data[2, :], color = :red, label = "Data", xlabel = "Var1",
    ylabel = "Var2", title = "Spiral Neural ODE")
for k in 1:300
    resol = predict_neuralode(samples[:, 100:end][:, rand(1:400)])
    Plots.plot!(resol[1, :], resol[2, :], alpha = 0.04, color = :red, label = "")
end
Plots.plot!(prediction[1, :], prediction[2, :], color = :black, w = 2,
    label = "Best fit prediction", ylims = (-2.5, 3))