神經網路層 f 就可以被我們拿來計算微分 dydt!
dθdt=0
dtdt=1
For computation efficiency, let [hθt] be a augmented state
The solution of an initial value problem exists and is unique, if the differential equation is uniformly Lipschitz continuous in z and continuous in t.
Ajoint method is not reversible.
converges