Communications on Applied Mathematics and Computation ›› 2025, Vol. 7 ›› Issue (2): 718-732.doi: 10.1007/s42967-024-00377-y

Previous Articles     Next Articles

Immunity-Based Orthogonal Weights Modification Algorithm

Xiaotao Liang1, Shuo Yin1, Xing Zhao1,2, XuYing Zhao1   

  1. 1 School of Mathematical Sciences, Capital Normal University, Beijing 100048, China;
    2 Beijing Advanced Innovation Center for Imaging Technology, Capital Normal University, Beijing 100048, China
  • Received:2023-10-22 Revised:2024-01-24 Accepted:2024-01-26 Online:2025-06-20 Published:2025-04-21
  • Supported by:
    The research is supported by the Beijing Natural Science Foundation, China (Grant/ Award Number: Z210003), the National Nature Science Foundation of China (NSFC) (Grant/Award Numbers: 12071313, 61827809), the key research project of the Academy for Multidisciplinary Studies, Capital Normal University, China, the National Key Research and Development Program of China (Grant/ Award Number: 2020YFA0712200), and the Major Technologies R & D Program of Shenzhen, China (JSGGZD20220822095600001).

Abstract: Recently, the catastrophic forgetting problem of neural networks in the process of continual learning (CL) has attracted more and more attention with the development of deep learning. The orthogonal weight modification (OWM) algorithm to some extent overcomes the catastrophic forgetting problem in CL. It is well-known that the mapping rule learned by the network is usually not accurate in the early stage of neural network training. Our main idea is to establish an immune mechanism in CL, which rejects unreliable mapping rules at the beginning of the training until those are reliable enough. Our algorithm showed a very good competitive advantage in the permuted and disjoint MNIST tasks and disjoint CIFAR-10 tasks. As for the more challenging task of Chinese handwriting character recognition, our algorithm showed a notable improvement compared with the OWM algorithm. In view of the context-dependent processing (CDP) module in [37], we revealed that the module may result in a loss of information and we proposed a modified CDP module to overcome this weakness. The performance of the system with the modified CDP module outperforms the original one in the CelebFaces attributed recognition task. Besides continual multi-task, we also considered a single task, where the immunity-based OWM (IOWM) algorithm was designed as an optimization solver of neural networks for low-dose computed tomography (CT) denoising task.

Key words: Image processing, Neural network, Continual learning (CL), Computed tomography (CT)

CLC Number: