Photo of Jinchao Xu

Xu is an advocate of the idea that practical applications and theoretical completeness and beauty can go together. He studies numerical methods for partial differential equations and big data, especially finite element methods, multigrid methods, and deep neural networks, for their theoretical analysis, algorithmic development, and practical applications. He is well known for many groundbreaking studies in developing, designing, and analyzing fast methods for finite element discretization and for the solution of large-scale systems of equations, including several basic theories and algorithms that bear his name: the Bramble-Pasciak-Xu (BPX) preconditioner, the Hiptmair-Xu (HX) preconditioner, the Xu-Zikatanov (XZ) identity, and the Morley-Wang-Xu (MWX) element. The BPX-preconditioner is one of the two fundamental multigrid algorithms for solving large-scale discretized partial differential equations; the HX-preconditioner, which was featured in 2008 by the U.S. Department of Energy as one of the top 10 breakthroughs in computational science in recent years, is one of the most efficient solvers for the numerical simulation of electro-magnetic problems; the XZ-identity is a basic technical tool that can be used for the design and analysis of iterative methods such as the multigrid method and the method of alternating projections; the MWX-element is the only known class of finite elements universally constructed for elliptic partial differential equations of any order in any spatial dimension. Xu has published nearly 200 research papers, including his famous SIAM Review paper 'Iterative Methods by Space Decomposition and Subspace Correction' (1992) and more recently his Acta Numerica paper 'Algebraic Multigrid Methods' (2017).


In recent years, Xu has focused on deep learning models, training algorithms, and their applications. With his collaborators, he has obtained a number of results in both mathematical and practical aspects of deep learning. He showed that the most popular deep neural network (DNN) using the ReLU activation function is identical to the set of linear finite element functions used in numerical PDEs for decades. He showed that the most popular DNN for image classification, namely the convolutional neural network (CNN), is directly related to the geometric multigrid method used in numerical PDEs and in particular, he demonstrated that a minor modification of the multigrid algorithm can lead to a special CNN, known as MgNet, that can compete with and sometimes even outperform the corresponding CNN for various applications, including image classification, detection, segmentation, and forecasting problems. He has written sequence papers in which he solves several open problems or obtains the best results in the field to date in regard to the approximation properties of neural network functions, especially those using ReLU and its power as activation functions. His most recent theoretical results, supported also by numerical experiments, demonstrated that the SGD algorithms and variants are simply not appropriate as training algorithms when deep learning is used for the numerical solution of PDEs. He further studied, improved and applied a class of algorithms, known as greedy algorithms, to solving optimization problems arising from using neural networks for numerical PDEs. More recently, he has been leading a collaborative effort on the development of Arabic Large Language Model (LLM), known as AceGPT, that has demonstrated most competitive performances among all open-source LLMs for Arabic language.