Selected article for: "deep layer and neural network"

Author: Guo, Ping; Huang, Kaizhu; Xu, Zenglin
Title: Partial Differential Equations is All You Need for Generating Neural Architectures -- A Theory for Physical Artificial Intelligence Systems
  • Cord-id: c9m0cvsu
  • Document date: 2021_3_10
  • ID: c9m0cvsu
    Snippet: In this work, we generalize the reaction-diffusion equation in statistical physics, Schr\"odinger equation in quantum mechanics, Helmholtz equation in paraxial optics into the neural partial differential equations (NPDE), which can be considered as the fundamental equations in the field of artificial intelligence research. We take finite difference method to discretize NPDE for finding numerical solution, and the basic building blocks of deep neural network architecture, including multi-layer pe
    Document: In this work, we generalize the reaction-diffusion equation in statistical physics, Schr\"odinger equation in quantum mechanics, Helmholtz equation in paraxial optics into the neural partial differential equations (NPDE), which can be considered as the fundamental equations in the field of artificial intelligence research. We take finite difference method to discretize NPDE for finding numerical solution, and the basic building blocks of deep neural network architecture, including multi-layer perceptron, convolutional neural network and recurrent neural networks, are generated. The learning strategies, such as Adaptive moment estimation, L-BFGS, pseudoinverse learning algorithms and partial differential equation constrained optimization, are also presented. We believe it is of significance that presented clear physical image of interpretable deep neural networks, which makes it be possible for applying to analog computing device design, and pave the road to physical artificial intelligence.

    Search related documents:
    Co phrase search for related documents
    • activation function and loss function: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12
    • adam adaptive moment estimation and adaptive moment: 1, 2
    • adam adaptive moment estimation and adaptive moment estimation: 1, 2
    • adam adaptive moment estimation and loss function: 1
    • adaptive moment and loss function: 1
    • adaptive moment estimation and loss function: 1
    • long lstm short term memory model and loss function: 1, 2