Communications on Applied Mathematics and Computation ›› 2024, Vol. 6 ›› Issue (2): 1490-1517.doi: 10.1007/s42967-024-00389-8

• ORIGINAL PAPERS • 上一篇    

Optimization of Random Feature Method in the High-Precision Regime

Jingrun Chen1,2, Weinan E3,4, Yifei Sun5   

  1. 1. School of Mathematical Sciences and Suzhou Institute for Advanced Research, Suzhou 215006, Jiangsu, China;
    2. University of Science and Technology of China, Hefei 230026, Anhui, China;
    3. Center for Machine Learning Research and School of Mathematical Sciences, Peking University, Beijing 100871, China;
    4. AI for Science Institute, Beijing 100084, China;
    5. School of Mathematical Sciences, Soochow University, Suzhou 215006, Jiangsu, China
  • 收稿日期:2023-03-01 修回日期:2024-02-12 接受日期:2024-02-13 出版日期:2024-03-30 发布日期:2024-03-30
  • 通讯作者: Yifei Sun,E-mail:yfsun99@stu.suda.edu.cn;Jingrun Chen,E-mail:jingrunchen@ustc.edu.cn;Weinan E,E-mail:weinan@math.pku.edu.cn E-mail:yfsun99@stu.suda.edu.cn;jingrunchen@ustc.edu.cn;weinan@math.pku.edu.cn
  • 基金资助:
    The work is supported by the NSFC Major Research Plan—Interpretable and General-purpose Next-generation Artificial Intelligence (No. 92370205).

Optimization of Random Feature Method in the High-Precision Regime

Jingrun Chen1,2, Weinan E3,4, Yifei Sun5   

  1. 1. School of Mathematical Sciences and Suzhou Institute for Advanced Research, Suzhou 215006, Jiangsu, China;
    2. University of Science and Technology of China, Hefei 230026, Anhui, China;
    3. Center for Machine Learning Research and School of Mathematical Sciences, Peking University, Beijing 100871, China;
    4. AI for Science Institute, Beijing 100084, China;
    5. School of Mathematical Sciences, Soochow University, Suzhou 215006, Jiangsu, China
  • Received:2023-03-01 Revised:2024-02-12 Accepted:2024-02-13 Online:2024-03-30 Published:2024-03-30
  • Contact: Yifei Sun,E-mail:yfsun99@stu.suda.edu.cn;Jingrun Chen,E-mail:jingrunchen@ustc.edu.cn;Weinan E,E-mail:weinan@math.pku.edu.cn E-mail:yfsun99@stu.suda.edu.cn;jingrunchen@ustc.edu.cn;weinan@math.pku.edu.cn
  • Supported by:
    The work is supported by the NSFC Major Research Plan—Interpretable and General-purpose Next-generation Artificial Intelligence (No. 92370205).

摘要: Machine learning has been widely used for solving partial differential equations (PDEs) in recent years, among which the random feature method (RFM) exhibits spectral accuracy and can compete with traditional solvers in terms of both accuracy and efficiency. Potentially, the optimization problem in the RFM is more difficult to solve than those that arise in traditional methods. Unlike the broader machine-learning research, which frequently targets tasks within the low-precision regime, our study focuses on the high-precision regime crucial for solving PDEs. In this work, we study this problem from the following aspects: (i) we analyze the coefficient matrix that arises in the RFM by studying the distribution of singular values; (ii) we investigate whether the continuous training causes the overfitting issue; (iii) we test direct and iterative methods as well as randomized methods for solving the optimization problem. Based on these results, we find that direct methods are superior to other methods if memory is not an issue, while iterative methods typically have low accuracy and can be improved by preconditioning to some extent.

关键词: Random feature method (RFM), Partial differential equation (PDE), Least-squares problem, Direct method, Iterative method

Abstract: Machine learning has been widely used for solving partial differential equations (PDEs) in recent years, among which the random feature method (RFM) exhibits spectral accuracy and can compete with traditional solvers in terms of both accuracy and efficiency. Potentially, the optimization problem in the RFM is more difficult to solve than those that arise in traditional methods. Unlike the broader machine-learning research, which frequently targets tasks within the low-precision regime, our study focuses on the high-precision regime crucial for solving PDEs. In this work, we study this problem from the following aspects: (i) we analyze the coefficient matrix that arises in the RFM by studying the distribution of singular values; (ii) we investigate whether the continuous training causes the overfitting issue; (iii) we test direct and iterative methods as well as randomized methods for solving the optimization problem. Based on these results, we find that direct methods are superior to other methods if memory is not an issue, while iterative methods typically have low accuracy and can be improved by preconditioning to some extent.

Key words: Random feature method (RFM), Partial differential equation (PDE), Least-squares problem, Direct method, Iterative method