神經網路層 $f$ 就可以被我們拿來計算微分 $\frac{dy}{dt}$!
$\frac{d \theta}{dt} = 0$
$\frac{dt}{dt} = 1$
For computation efficiency, let $$\begin{bmatrix} h \\ \theta \\ t \end{bmatrix}$$ be a augmented state
The solution of an initial value problem exists and is unique, if the differential equation is uniformly Lipschitz continuous in z and continuous in t.
Ajoint method is not reversible.
converges