Reconfigurable optical neural networks with Plug-and-Play metasurfaces
  • Abstract

    In a very recent study, Prof. Lingling Huang and co-workers proposed and demonstrated reconfigurable optical neural networks based on cascaded metasurfaces. By fixing one metasurface and switching the other pluggable metasurfaces, the neural networks, which operate at near-infrared wavelengths, can perform distinct recognition tasks for handwritten digits and fashion products. This innovative device opens up an avenue for all-optical, high-speed, low-power, and multi-functional artificial intelligence systems.

  • The P-DNNs were designed and optimized in a similar way in training machine learning models, encompassing the introduction of a loss function and the employment of the stochastic gradient descent and error backpropagation algorithms. The objective of the training process was to maximize the light intensity at the corresponding detection region for a given class of objects, and minimize the total signal in other regions. In order to reduce the training time for different tasks, the authors used transfer learning. First, the MNIST dataset was utilized to train and optimize the phase parameters of the shared layer (the first metasurface) and the handwritten digits classification layer (the second metasurface) with 10 training epochs. Then, the parameters of the shared layer were fixed, and the Fashion-MNIST dataset was used to train the fashion classification layer (the third metasurface). This process only took 5 training epochs to reach high classification accuracy thanks to transfer learning. The phase profiles essential for the three metasurfaces were achieved by employing Pancharatnam–Berry phase modulation for circularly polarized light. This was realized by rotating individual silicon nanofins to specific angles. The silicon metasurfaces, each 500×500 μm2 in size, were fabricated by standard electron beam lithography. The period was set as 500 nm, and the length, width and height of nanofin were 210, 135 and 600 nm, respectively.

    Over the past years, researchers have demonstrated ONNs using distinct platforms, including photonic integrated circuits, diffractive optical elements, and optical metasurfaces. However, most of the demonstrated ONNs lack reconfigurability. Their functionalities are fixed once they are trained and fabricated. In a very work published in Opto-Electronic Advances, Lingling Huang et al. conceptually proposed and experimentally demonstrated pluggable diffractive neural networks (P-DNNs) operating in the near-infrared region. P-DNNs can switch between different tasks, such as classifying handwritten digits and fashion products, by simply interchanging the pluggable components of the networks. As a result, we can significantly enhance the flexibility and adaptation of ONNs while effectively reducing computing resources and training time in the network design.

    Figure 1(b) shows the representative results and performances of the P-DNNs. The left column presents the inputs of P-DNNs, which are handwritten digits and fashion images. In the experiment, circularly polarized light carrying the input information was generated by a digital micromirror device. It passed through the P-DNNs, in which the second layer could be readily switched depending on the classification tasks. The light intensity on the output plane was captured by a camera, as shown in the middle column. For specific input images, the region with the highest energy corresponds to the result of classification. The right column in Fig. 1(b) shows energy distribution on the detection plane, both the experimental and simulation results, for handwritten digits and fashion. The authors introduced ΔE = Emax (maximum energy) - Esmax (second maximum energy) as an indicator to verify the classification accuracy and effectiveness. The results confirm that the P-DNNs can correctly perform classification, although the maximum energy distribution in the experimental was a bit lower than the simulation results due to the inevitable fabrication errors and misalignment of the two metasurface layers in optical characterization. Overall, the accuracy for handwritten digit classification was 90% in experiment and 91.8% in simulation. The fashion classification accuracy reached 90% in experiment and 90.2% in simulation.

    Drawing inspiration from signal processing in the nervous system, artificial neural networks (ANNs) have demonstrated their indispensability in a variety of tasks, such as computer vision, natural language processing, new materials discovery, and medical diagnosis. However, the prevailing von Neumann architectures in modern computers hinder the efficient utilization of ANNs, prompting extensive efforts toward enhancing the computation speed and accuracy. One particularly promising solution is optical neural networks (ONNs), which shift from real-number matrix operations to complex-number logical computing. ONNs offer inherent advantages in power efficiency, speed, parallelism, bandwidth, and scalability, in comparison to their conventional digital and electronic counterparts. A notable feature of ONNs is their passive functionality, relying solely on input light energy, thus eliminating the need for additional power consumption during the computation processes. Moreover, ONNs could seamlessly integrate diverse functionalities across different wavelengths and polarizations, which can operate independently and avoid cross-talk. This inherent feature enhances the capacity of ONNs for both extensive bandwidth utilization and parallel computing capabilities.

    In summary, the proposed P-DNNs can classify diverse patterns through the seamless switching of the second plugin layer. They help to effectively overcome one major limitation of previously demonstrated ONNs, that is, inability to adapt to multiple tasks once ONNs are designed and fabricated. This new approach allows for the architecture reconfigurability of ONNs, akin to tunable and programmable metasurfaces. Moreover, the utilization of metasurfaces technology facilitates the realization of optical intelligent computing chips, enabling intelligent functions such as real-time object detection in autonomous driving systems. By leveraging wavelength and polarization multiplexing techniques, we can further advance ONNs for more versatile, energy-efficient, and adaptive optical computing technologies.

    Figure 1(a) schematically illustrates the proposed P-DNNs, which consist of two layers of metasurfaces with a separation distance of 500 μm. The first layer is a shared component to preprocess input information, and the second layer is a pluggable task-specific classification layer for different types of objects, including handwritten digits (from 0 to 5) and fashions (T-shirts, trousers, coats, sneakers, bags, and ankle boots). When light illuminates distinct objects and passes through the P-DNNs, it converges on different regions of the detection plane, and hence classification is achieved. The metasurfaces consist of rectangular silicon nanofins with deliberately designed geometries and rotation angles, as shown in the inset of Fig. 1(a). Individual nanofins function as optical neurons, and the neurons on the adjacent metasurface layers are connected through diffraction.

    Figure 1. (a) Schematic illustration of P-DNNs, which can perform recongnization of handwritten digits and fashion by switching the pluggable classification layer. The inset at the bottom right corner shows the scanning electron micrograph of the fabricated metasurface. (b) Characterization and performance of P-DNNs. Left column: handwritten digits and fashion input images. Middle column: experimentally detected energy distribution maps for handwritten digits and fashion. Right column: experimental and simulation results of energy distribution for handwritten digits and fashion. ΔE represents the difference between the percentage of maximum and second maximum energy. The figures are adapted from ref.8 with modification.
    Full-Size Img PowerPoint

    (a) Schematic illustration of P-DNNs, which can perform recongnization of handwritten digits and fashion by switching the pluggable classification layer. The inset at the bottom right corner shows the scanning electron micrograph of the fabricated metasurface. (b) Characterization and performance of P-DNNs. Left column: handwritten digits and fashion input images. Middle column: experimentally detected energy distribution maps for handwritten digits and fashion. Right column: experimental and simulation results of energy distribution for handwritten digits and fashion. ΔE represents the difference between the percentage of maximum and second maximum energy. The figures are adapted from ref. with modification.

    Y. M. Liu acknowledges the financial support of the National Science Foundation (ECCS-1916839 and DMR-2202268).

  • References

    [1]

    Aggarwal CC. Neural Networks and Deep Learning (Springer, Cham, 2018).

    Google Scholar

    [2]

    Wetzstein G, Ozcan A, Gigan S et al. Inference in artificial intelligence with deep optics and photonics. Nature 588, 39–47 (2020).

    DOI: 10.1038/s41586-020-2973-6

    CrossRef Google Scholar

    [3]

    Shastri BJ, Tait AN, de Lima TF et al. Photonics for artificial intelligence and neuromorphic computing. Nat Photonics 15, 102–114 (2021).

    DOI: 10.1038/s41566-020-00754-y

    CrossRef Google Scholar

    [4]

    Shen YC, Harris NC, Skirlo S et al. Deep learning with coherent nanophotonic circuits. Nat Photonics 11, 441 (2017).

    DOI: 10.1038/nphoton.2017.93

    CrossRef Google Scholar

    [5]

    Ashtiani F, Geers AJ, Aflatouni F. An on-chip photonic deep neural network for image classification. Nature 606, 501–506 (2022).

    DOI: 10.1038/s41586-022-04714-0

    CrossRef Google Scholar

    [6]

    Lin X, Rivenson Y, Yardimci NT et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004–1008 (2018).

    DOI: 10.1126/science.aat8084

    CrossRef Google Scholar

    View full references list
  • Cited by

    Periodical cited type(2)

    1. Yan, H., Sun, Y., Yu, Y. et al. Multi-Functional Optical Neural Network in a Disordered Medium. Laser and Photonics Reviews, 2025. DOI:10.1002/lpor.202501407
    2. Deng, L., Cai, Z., Liu, Y. Functionality Expansion of Guided Mode Radiation via On-Chip Metasurfaces. Nano Letters, 2024, 24(29): 9042-9049. DOI:10.1021/acs.nanolett.4c02231

    Other cited types(0)

  • Author Information

  • Copyright

    Open Access. © The Author(s). This article is licensed under a Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
  • About this Article

    DOI: 10.29026/oea.2024.240057
    Cite this Article
    Yongmin Liu, Yuxiao Li. Reconfigurable optical neural networks with Plug-and-Play metasurfaces. Opto-Electronic Advances 7, 240057 (2024). DOI: 10.29026/oea.2024.240057
    Download Citation
    Article History
    • Received Date April 10, 2024
    • Accepted Date April 12, 2024
    • Available Online June 03, 2024
    • Published Date July 15, 2024
    Article Metrics
    Article Views(1813) PDF Downloads(603)
    Share:
  • Related Articles

[1]

Aggarwal CC. Neural Networks and Deep Learning (Springer, Cham, 2018).

Google Scholar

[2]

Wetzstein G, Ozcan A, Gigan S et al. Inference in artificial intelligence with deep optics and photonics. Nature 588, 39–47 (2020).

DOI: 10.1038/s41586-020-2973-6

CrossRef Google Scholar

[3]

Shastri BJ, Tait AN, de Lima TF et al. Photonics for artificial intelligence and neuromorphic computing. Nat Photonics 15, 102–114 (2021).

DOI: 10.1038/s41566-020-00754-y

CrossRef Google Scholar

[4]

Shen YC, Harris NC, Skirlo S et al. Deep learning with coherent nanophotonic circuits. Nat Photonics 11, 441 (2017).

DOI: 10.1038/nphoton.2017.93

CrossRef Google Scholar

[5]

Ashtiani F, Geers AJ, Aflatouni F. An on-chip photonic deep neural network for image classification. Nature 606, 501–506 (2022).

DOI: 10.1038/s41586-022-04714-0

CrossRef Google Scholar

[6]

Lin X, Rivenson Y, Yardimci NT et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004–1008 (2018).

DOI: 10.1126/science.aat8084

CrossRef Google Scholar

[7]

Luo Y, Mengu D, Yardimci NT et al. Design of task-specific optical systems using broadband diffractive neural networks. Light Sci Appl 8, 112 (2019).

DOI: 10.1038/s41377-019-0223-1

CrossRef Google Scholar

[8]

Luo XH, Hu YQ, Ou XN et al. Metasurface-enabled on-chip multiplexed diffractive neural networks in the visible. Light Sci Appl 11, 158 (2022).

DOI: 10.1038/s41377-022-00844-2

CrossRef Google Scholar

[9]

He C, Zhao D, Fan F et al. Pluggable multitask diffractive neural networks based on cascaded metasurfaces. Opto-Electron Adv 7, 230005 (2024).

DOI: 10.29026/oea.2024.230005

CrossRef Google Scholar

[10]

Ma W, Xu YH, Xiong B et al. Pushing the limits of functionality-multiplexing capability in metasurface design based on statistical machine learning. Adv Mater 34, 2110022 (2022).

DOI: 10.1002/adma.202110022

CrossRef Google Scholar

[11]

Xiong B, Liu Y, Xu YH et al. Breaking the limitation of polarization multiplexing in optical metasurfaces with engineered noise. Science 379, 294–299 (2023).

DOI: 10.1126/science.ade5140

CrossRef Google Scholar

Related Articles
Show full outline

Catalog

    Yuxiao Li

    1. On this Site
    2. On Google Scholar
    3. On PubMed
    Reconfigurable optical neural networks with Plug-and-Play metasurfaces
    • Figure  1

      (a) Schematic illustration of P-DNNs, which can perform recongnization of handwritten digits and fashion by switching the pluggable classification layer. The inset at the bottom right corner shows the scanning electron micrograph of the fabricated metasurface. (b) Characterization and performance of P-DNNs. Left column: handwritten digits and fashion input images. Middle column: experimentally detected energy distribution maps for handwritten digits and fashion. Right column: experimental and simulation results of energy distribution for handwritten digits and fashion. ΔE represents the difference between the percentage of maximum and second maximum energy. The figures are adapted from ref.8 with modification.

    • Figure  1