Communications on Applied Mathematics and Computation ›› 2024, Vol. 6 ›› Issue (2): 1270-1298.doi: 10.1007/s42967-023-00328-z

• ORIGINAL PAPERS • 上一篇    下一篇

Model Change Active Learning in Graph-Based Semi-supervised Learning

Kevin S. Miller, Andrea L. Bertozzi   

  1. Department of Mathematics, University of California, Los Angeles, 520 Portola Plaza, Los Angeles, CA 90095, USA
  • 收稿日期:2022-10-24 修回日期:2023-09-18 接受日期:2023-09-21 出版日期:2024-02-17 发布日期:2024-02-17
  • 通讯作者: Kevin S. Miller,E-mail:millerk22@g.ucla.edu;Andrea L. Bertozzi,E-mail:bertozzi@g.ucla.edu E-mail:millerk22@g.ucla.edu;bertozzi@g.ucla.edu
  • 基金资助:
    This work was supported by the DOD National Defense Science and Engineering Graduate (NDSEG) Research Fellowship and the NGA under Contract No. HM04762110003.

Model Change Active Learning in Graph-Based Semi-supervised Learning

Kevin S. Miller, Andrea L. Bertozzi   

  1. Department of Mathematics, University of California, Los Angeles, 520 Portola Plaza, Los Angeles, CA 90095, USA
  • Received:2022-10-24 Revised:2023-09-18 Accepted:2023-09-21 Online:2024-02-17 Published:2024-02-17
  • Contact: Kevin S. Miller,E-mail:millerk22@g.ucla.edu;Andrea L. Bertozzi,E-mail:bertozzi@g.ucla.edu E-mail:millerk22@g.ucla.edu;bertozzi@g.ucla.edu
  • Supported by:
    This work was supported by the DOD National Defense Science and Engineering Graduate (NDSEG) Research Fellowship and the NGA under Contract No. HM04762110003.

摘要: Active learning in semi-supervised classification involves introducing additional labels for unlabelled data to improve the accuracy of the underlying classifier. A challenge is to identify which points to label to best improve performance while limiting the number of new labels. “Model Change” active learning quantifies the resulting change incurred in the classifier by introducing the additional label(s). We pair this idea with graph-based semi-supervised learning (SSL) methods, that use the spectrum of the graph Laplacian matrix, which can be truncated to avoid prohibitively large computational and storage costs. We consider a family of convex loss functions for which the acquisition function can be efficiently approximated using the Laplace approximation of the posterior distribution. We show a variety of multiclass examples that illustrate improved performance over prior state-of-art.

关键词: Active learning, Graph-based methods, Semi-supervised learning (SSL), Graph Laplacian

Abstract: Active learning in semi-supervised classification involves introducing additional labels for unlabelled data to improve the accuracy of the underlying classifier. A challenge is to identify which points to label to best improve performance while limiting the number of new labels. “Model Change” active learning quantifies the resulting change incurred in the classifier by introducing the additional label(s). We pair this idea with graph-based semi-supervised learning (SSL) methods, that use the spectrum of the graph Laplacian matrix, which can be truncated to avoid prohibitively large computational and storage costs. We consider a family of convex loss functions for which the acquisition function can be efficiently approximated using the Laplace approximation of the posterior distribution. We show a variety of multiclass examples that illustrate improved performance over prior state-of-art.

Key words: Active learning, Graph-based methods, Semi-supervised learning (SSL), Graph Laplacian