Yufei Zhang

Cited by

	All	Since 2021
Citations	699	660
h-index	15	15
i10-index	23	22

280

140

210

201920202021202220232024202520265 27 46 48 83 198 269 10

Public access

View all

3 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Christoph ReisingerProfessor of Applied Mathematics, University of OxfordVerified email at maths.ox.ac.uk
Xin GuoUC Berkeley, Cornell Univeristy, IBMVerified email at berkeley.edu
Lukasz SzpruchUniversity of Edinburgh and The Alan Turing InstituteVerified email at ed.ac.uk
Anran HuColumbia UniversityVerified email at columbia.edu
David SiskaSchool of Mathematics, University of EdinburghVerified email at ed.ac.uk
Kazufumi ItoNorth Carolina State UniversityVerified email at math.ncsu.edu
Matteo BaseiQuant researcherVerified email at edf.fr
Xinyu LiUC Berkeley, Oxford UniversityVerified email at berkeley.edu
Eyal NeumanImperial College LondonVerified email at imperial.ac.uk
Xinshi ChenGeorgia Institution of TechnologyVerified email at bytedance.com
Le SongCTO, GenBio AI; Professor, MBZUAIVerified email at mbzuai.ac.ae
James-Michael LeahyPhysicsX and Imperial College LondonVerified email at imperial.ac.uk
Roxana DumitrescuProfessor, ENSAE-CREST, Institut Polytechnique de ParisVerified email at ensae.fr
Cristopher SalviImperial College LondonVerified email at ic.ac.uk
Leandro Sánchez-BetancourtMathematical Institute, and Oxford-Man Institute, University of OxfordVerified email at maths.ox.ac.uk
Camilo HernándezUniversity of Southern CaliforniaVerified email at usc.edu
Du OuyangTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Yanwei JiaThe Chinese University of Hong KongVerified email at cuhk.edu.hk
Henrietta RidleyVerified email at uam.es
Jun Zou, SIAM Fellow, AMS FellowChoh-Ming Li Chair Professor of Mathematics, The Chinese University of Hong KongVerified email at math.cuhk.edu.hk

Yufei Zhang

Imperial College London

Verified email at imperial.ac.uk - Homepage

Stochastic Control Reinforcement Learning Mathematical Finance


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Rectified deep neural networks overcome the curse of dimensionality for nonsmooth value functions in zero-sum games of nonlinear stiff systems C Reisinger, Y Zhang Analysis and Applications 18 (06), 951-999, 2020	96	2020
Logarithmic regret for episodic continuous-time linear-quadratic reinforcement learning over a finite-time horizon M Basei, X Guo, A Hu, Y Zhang Journal of Machine Learning Research 23 (178), 1-34, 2022	62*	2022
A Neural Network-Based Policy Iteration Algorithm with Global -Superlinear Convergence for Stochastic Games on Domains K Ito, C Reisinger, Y Zhang Foundations of Computational Mathematics 21 (2), 331-374, 2021	62	2021
Convergence of policy gradient methods for finite-horizon exploratory linear-quadratic control problems M Giegrich, C Reisinger, Y Zhang SIAM Journal on Control and Optimization 62 (2), 1060-1092, 2024	41*	2024
Exploration-exploitation trade-off for continuous-time episodic reinforcement learning with linear-convex models L Szpruch, T Treetanthiploet, Y Zhang arXiv preprint arXiv:2112.10264, 2021	39	2021
Optimal scheduling of entropy regularizer for continuous-time linear-quadratic reinforcement learning L Szpruch, T Treetanthiploet, Y Zhang SIAM Journal on Control and Optimization 62 (1), 135-166, 2024	37	2024
Regularity and stability of feedback relaxed controls C Reisinger, Y Zhang SIAM Journal on Control and Optimization 59 (5), 3118-3151, 2021	37	2021
Reinforcement learning for linear-convex models with jumps via stability analysis of feedback controls X Guo, A Hu, Y Zhang SIAM Journal on Control and Optimization 61 (2), 755-787, 2023	35	2023
Understanding deep architecture with reasoning layer X Chen, Y Zhang, C Reisinger, L Song Advances in Neural Information Processing Systems 33, 1240-1252, 2020	29	2020
A fast iterative PDE-based algorithm for feedback controls of nonsmooth mean-field control problems C Reisinger, W Stockinger, Y Zhang SIAM Journal on Scientific Computing 46 (4), A2737-A2773, 2024	25	2024
A posteriori error estimates for fully coupled McKean-Vlasov forward-backward SDEs C Reisinger, W Stockinger, Y Zhang IMA Journal of Numerical Analysis 44 (4), 2323–2369, 2023	23	2023
A Fisher–Rao Gradient Flow for Entropy-Regularised Markov Decision Processes in Polish Spaces B Kerimkulov, JM Leahy, D Siska, L Szpruch, Y Zhang Foundations of Computational Mathematics, 1-75, 2025	20	2025
Approximation schemes for mixed optimal stopping and control problems with nonlinear expectations and jumps R Dumitrescu, C Reisinger, Y Zhang Applied Mathematics & Optimization 83 (3), 1387-1429, 2021	19	2021
Entropy annealing for policy mirror descent in continuous time and space D Sethi, D Šiška, Y Zhang SIAM Journal on Control and Optimization 63 (4), 3006-3041, 2025	17	2025
A Neural RDE approach for continuous-time non-Markovian stochastic control problems M Hoglund, E Ferrucci, C Hernandez, AM Gonzalez, C Salvi, ... International Conference on Machine Learning (ICML 23), New Frontiers in …, 2023	17	2023
Linear convergence of a policy gradient method for some finite horizon continuous time control problems C Reisinger, W Stockinger, Y Zhang SIAM Journal on Control and Optimization 61 (6), 3526-3558, 2023	15	2023
Error estimates of penalty schemes for quasi-variational inequalities arising from impulse control problems C Reisinger, Y Zhang SIAM Journal on Control and Optimization 58 (1), 243-276, 2020	15	2020
An -Potential Game Framework for -Player Dynamic Games X Guo, X Li, Y Zhang SIAM Journal on Control and Optimization 63 (4), 2964-3005, 2025	14*	2025
Towards an analytical framework for dynamic potential games X Guo, Y Zhang SIAM Journal on Control and Optimization 63 (2), 1213-1242, 2025	12*	2025
Statistical learning with sublinear regret of propagator models E Neuman, Y Zhang arXiv preprint arXiv:2301.05157, 2023	11	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors