Project Heads
Tobias Breiten, Carsten Hartmann
Project Members
Siragan Gailus, Omar Kebiri
Project Duration
01.01.2021 − 31.12.2022
Located at
TU Berlin
Using sampling methods to analyse and improve the parametrisation of neural networks is a relatively new idea. This project is devoted to the systematic development of stochastic differential equation (SDE) approximations for momentum enriched stochastic gradient schemes for deep neural networks and corresponding numerical algorithms. Specifically, we want to study the underdamped Langevin model that can be understood as the SDE counterpart of momentum enriched optimisation schemes. The underdamped Langevin model has a long tradition in statistical mechanics, and, from a control perspective, it offers more flexibility in altering the dynamics while preserving the invariant measure than the overdamped Langevin model that has been a standard tool in computational statistics or statistical learning for more than 20 years. The starting point for our analysis will be a controlled underdamped Langevin equation where the control is adapted to the filtration generated by the Brownian motion. Here the role of the control is twofold: it should accelerate the convergence to equilibrium at low temperature, while preserving the stationary distribution of the dynamics.
Stochastic modified equations with momentum
External Website
Related Publications
[1] T. Breiten, C. Hartmann, L. Neureither and U. Sharma. Stochastic gradient descent and fast relaxation to thermodynamic equilibrium: a stochastic control approach. Journal of Mathematical Physics 62, 123302, 2021.
[2] T. Breiten and K. Kunisch. Improving the convergence rates for the kinetic Fokker-Planck equation by optimal control. SIAM Journal on Control and Optimization, 2023. in press.
[3] H. B. Gherbal, A. Redjil, and O. Kebiri. The relaxed maximum principle for G-stochastic control systems with controlled jumps. Advances in Mathematics: Scientific Journal, 11(12):1313–1343, 2022.
[4] C. Hartmann, L. Neureither, M. Strehlau. Reachability Analysis of Randomly Perturbed Hamiltonian Systems. 7th IFAC Workshop on Lagrangian and Hamiltonian Methods for Nonlinear Control (LHMNC21), IFAC-PapersOnLine, 54(19), 307-314, 2021.
[5] O. Kebiri and N. Elgroud. Relaxed optimal control problem for a finite horizon G-SDE with delay and its application in economics, 2023. arXiv:2303.17427.
[6] Z. Mezdoud, C. Hartmann, M. R. Remita, and O. Kebiri. α-hypergeometric uncertain volatility models and their connection to 2BSDEs. Bull. Inst. Math., Acad. Sin., 16(3):263–288, 2021.
[7] A. Redjil, H. Gherbal, and O. Kebiri. Existence of relaxed stochastic optimal control for G-SDEs with controlled jumps. Stochastic Analysis and Applications, 41(1):115–133, 2023.88
[8] A. Saci, A. Redjil, H. Boutabia, and O. Kebiri. Fractional stochastic differential equations driven by G-Brownian motion with delays. Probab. Math. Stat., 2023 (in print).
[9] C. Schütte, S. Klus, and C. Hartmann. Overcoming the timescale barrier in molecular dynamics: Transfer operators, variational principles, and machine learning. Acta Numerica, 2023 (in print).
[10] U. Sharma and W. Zhang. Non-reversible sampling schemes on submanifolds. SIAM J. Numer. Anal., 59(6):2989–3031, 2021.
[11] R.D. Skeel, C. Hartmann. Choice of Damping Coefficient in Langevin Dynamics. EPJ B Topical Issue “Recent progress and emerging trends in Molecular Dynamics” 94, 178, 2021.
[12] N. Agram, M. Grid, O. Kebiri, and B. Øksendal. Deep learning for solving initial path optimization of mean-field systems with memory. Available at SSRN, 2022.
[13] H. Bouanani, C. Hartmann, and O. Kebiri. Model reduction and uncertainty quantification of multiscale diffusions with parameter uncertainties using nonlinear expectations, 2021. arXiv:
2102.04908.
[14] K. Bouguetof, Z. Mezdoud, O. Kebiri, and C. Hartmann. On the existence and uniqueness of the solution to multifractional stochastic delay differential equation, 2023 (submitted).
Related Pictures