Integrated photonic convolution acceleration core for wearable devices

Baiheng Zhao; Junwei Cheng; Bo Wu; Dingshan Gao; Hailong Zhou; Jianji Dong

doi:10.29026/oes.2023.230017

Abstract

With the advancement of deep learning and neural networks, the computational demands for applications in wearable devices have grown exponentially. However, wearable devices also have strict requirements for long battery life, low power consumption, and compact size. In this work, we propose a scalable optoelectronic computing system based on an integrated optical convolution acceleration core. This system enables high-precision computation at the speed of light, achieving 7-bit accuracy while maintaining extremely low power consumption. It also demonstrates peak throughput of 3.2 TOPS (tera operations per second) in parallel processing. We have successfully demonstrated image convolution and the typical application of an interactive first-person perspective gesture recognition application based on depth information. The system achieves a comparable recognition accuracy to traditional electronic computation in all blind tests.

Keywords

FullText(HTML)

Introduction

Combined with field programmable gate array (FPGA) control, we utilized this system to conduct parallel convolution for edge detection experiments. Subsequently, we shifted our focus to a typical application in the wearable device domain: first-person perspective gesture recognition tasks. This system enables high-speed computation with 7-bit precision when PCAC chip loading weights, and achieves the same accuracy as traditional electronic computation in blind testing for gesture recognition. It provides an effective approach for wearable devices to achieve complex computational tasks accurately and efficiently while ensuring low power consumption and miniaturization.

In this work, a viable solution has been proposed to address the power consumption and computational speed limitations in wearable devices. The solution is based on an integrated photonic convolution acceleration core (PCAC) with a reconfigurable MRR array that has self-calibration functionality²⁷.

Wearable devices, characterized by their portability and strong human interaction capabilities, have long represented the future of technology and innovation¹. Within the realm of wearable devices, numerous recognition tasks rely on machine vision, such as vehicle detection², human pose recognition^3-6, and facial recognition^{2, 7-9}. These applications primarily rely on the forward propagation of deep learning algorithms to accomplish classification and recognition tasks. However, as the complexity of these applications increases¹⁰, the demand for computational power, low power consumption, low heat generation, and high efficiency in wearable devices becomes increasingly challenging to traditional electronic computing because Moore's Law is reaching its limits¹¹. As a result, alternative solutions are imperative.

In recent years, research on optical neural networks (ONNs) has emerged as a potential breakthrough solution to address the bottlenecks of electronic computing^{12, 13}. By mapping the mathematical models of neural networks to analog optical devices, ONNs can achieve computational capabilities superior to electronic computing because optical transmission networks offer the potential for ultra-low power consumption and minimal heat generation¹⁴. This makes them well-suited for meeting the energy consumption and heat dissipation requirements of wearable devices. Several ONN architectures have been reported in current researches, including diffractive spatial light networks (DNNs)^15-17, wavelength division multiplexing (WDM) based on fiber dispersion^18,19, and array modulation using Mach-Zehnder interferometers (MZIs)^20-23. While diffractive optical network elements have a large scale of neurons, they are typically bulky and not suitable for integration, and the refresh rate is low. Fiber dispersion-based wavelength division multiplexing schemes also face challenges in the miniaturization of long fiber and precise control of delay dispersion in large-scale networks. Although MZI devices can be implemented for on-chip integration, their relatively large footprint does not provide a significant advantage for large-scale expansion. None of these methods offer substantial advantages in meeting the requirements of future wearable devices. In contrast, the array-based approach using micro-ring resonator (MRR) devices exhibits several advantages that are well-aligned with the breakthrough requirements in wearable device research. MRR arrays are compact and easily integrated, allowing for high-precision and complex calculations through one-to-one assignment during parameter configuration^24-26. This makes them suitable for small-size and large-scale applications, meeting the demands of current wearable device research.

Results

The principle

Figure 1. Full-Size Img PowerPoint

Schematic of a computing system based on the integrated convolution acceleration core (PCAC) chip.

Figure 1 illustrates the principle of the convolutional acceleration system. The proposed convolution acceleration system is capable of performing the multiplication and addition operations of a matrix A of M×N and a vector B of N×1. The vector B is composed of N channels of light signals with different wavelengths. These signals are encoded using an intensity modulator array, where each channel is loaded with a different intensity of light signal. Specifically, for convolving an image with a 4×4 convolutional kernel, we take the image and arrange its elements in groups of four at each row, transposing them into column vectors. These column vectors serve as the encoded information input to the modulators, which are then fed into the PCAC chip. Within the PCAC chip, each column vector is multiplied and summed with the corresponding four MRRs in each row, producing the convolution operation results. The input data then slides down by one stride step, and the next set of four elements is extracted and transposed as the next input signal, continuing the operation with the PCAC chip. We repeat this process and encode all the extracted data into four data streams, which serve as the input for the intensity modulators. The multiplexed signals are then coupled into the PCAC chip through optical fiber. In the PCAC chip, the M×N MRR array is utilized, where each element of matrix A corresponds to an MRR operating at a different resonant wavelength. Under the operation of our developed self-calibrated MRR array, the final computation result is obtained by weighted summation using balanced photodetectors (BPDs), yielding the difference of optical power as the output vector C. The convolution result can be obtained by recovering the data with the assistance of FPGA. During the data recovery and reconstruction process, since each input column vector in the PCAC chip undergoes a simultaneous multiplication and accumulation operation with all rows of the convolutional kernel, the results of the computational operations need to be summed along the diagonal to obtain a single element of the actual convolution result, and this represents the completion of one convolution operation. It is worth noting that due to the one-to-one correspondence between the MRR array and the matrix elements, it is theoretically possible to simultaneously configure multiple convolution kernels and perform convolution operations on data streams representing multiple images. This scalability provides excellent support for large-scale parallelism in optoelectronic computation. Further details of the experiments will be discussed in subsequent sections.

The fabrication and characterization of PCAC chip

Figure 2(a) shows the PCAC chip, which is fabricated using a typical 220 nm silicon-on-insulator (SOI) integration process, a standard technique in chip manufacturing. This proof-of-concept chip has a compact size of 2.6 mm × 2.0 mm, comprising a 4×4 array of MRR synapses, forming the core of the computing system. These synapses play a crucial role in the chip's computational power. Additionally, a thermally tunable MRR with TiN (titanium nitride) heaters acts as the computational control module of the PCAC chip. This tunable MRR enables precise manipulation of the resonance wavelength, a critical aspect for accurate calculations. To facilitate accurate voltage control of the MRR synapses, meticulous design considerations have been incorporated. We implement specifically tailored FPGA circuit for the chip's requirements, along with a high-resolution digital-to-analog converter circuit capable of programmable voltage outputs at a remarkable 16-bit resolution, enables fine-grained control of the MRR synapses. In order to ensure stability and reliability, the chip incorporates a thermo-electric cooler (TEC) module at its base. This TEC module plays a pivotal role in maintaining a stable temperature environment for the chip, further enhancing the accuracy of its computations. On the left side of the chip, an optical signal output module is meticulously designed, featuring fiber optic packaging for seamless integration with external systems.

Figure 2. (a) Detailed photos of the packaged layout chip show the MRR array in the center, with the photonics chip on the right combined with the leads of the customized printed circuit board (PCB) for computation and control. On the left, there is an optical input/output port using a fiber V-groove, and the entire assembly is mounted on a TEC for heat dissipation. (b) The micrograph of the MRR array and detailed photo of a single MRR. (c) The transmission spectra of the MRR array. Different voltages (800–1800 mV, 100 mV/step) are applied to the third MRR. Similar results can be obtained when the voltage is applied to other MRRs. (d) The transmission rate of a single IM on the chip under voltage tuning. These curves represent the normalized W-V mapping. (e) The transmission rate of a single MRR on the chip under voltage tuning. These curves represent the normalized W-V mapping.

Figure 2. Full-Size Img PowerPoint

(a) Detailed photos of the packaged layout chip show the MRR array in the center, with the photonics chip on the right combined with the leads of the customized printed circuit board (PCB) for computation and control. On the left, there is an optical input/output port using a fiber V-groove, and the entire assembly is mounted on a TEC for heat dissipation. (b) The micrograph of the MRR array and detailed photo of a single MRR. (c) The transmission spectra of the MRR array. Different voltages (800–1800 mV, 100 mV/step) are applied to the third MRR. Similar results can be obtained when the voltage is applied to other MRRs. (d) The transmission rate of a single IM on the chip under voltage tuning. These curves represent the normalized W-V mapping. (e) The transmission rate of a single MRR on the chip under voltage tuning. These curves represent the normalized W-V mapping.

Based on this method, we have established a look-up table for the weight-voltage mapping of the modulator and MRR array. For modulator calibration, the laser operating wavelength is chosen away from the MRR resonance peak for one path of the MRR array. The reference voltage of the MRR array is fixed, and the voltage applied to the modulator is incrementally adjusted in a step of 0.1 V. The optical power of the pass-through end (THRU) is detected using a balanced photodetector (BPD), allowing the construction of a P-V curve that represents the relationship between power and modulator voltage. After differential and normalization operations, a weight-voltage (W-V) curve is established that describes the relationship between input data weights and modulator voltage. Figure 2(d) displays the W-V curve obtained from the calibration of one path of the PCAC chip's modulator. For MRR array calibration, the laser operating wavelength is adjusted to a region close to the resonance wavelength of each MRR. The modulator input voltage is kept constant while the MRR tuning voltage is adjusted, causing each MRR to redshift with the laser wavelength. Throughout this process, the optical power of the pass-through end is continuously detected, enabling the construction of a P-V curve that represents the relationship between power and MRR tuning voltage. After differential and normalization operations, a W-V curve is established that describes the relationship between convolutional kernel weights and modulator voltage. Figure 2(e) illustrates the W-V curve obtained from the calibration of one MRR in the PCAC chip.

Moving to the microscopic level, Fig. 2(b) offers an up-close view of the MRR synapses within the array. Additionally, an enlarged microscopic photograph showcases the intricate details of a single MRR. To facilitate efficient electrical and optical input/output (I/O) connections, the chip's design incorporates advanced packaging techniques. Both wire bonding and fiber array have been thoughtfully integrated, ensuring reliable and high-performance I/O connections for both electrical and optical signals. Figure 2(c) illustrates the tuning curve of the pass-through end of an MRR as a function of applied voltage. Increasing the voltage on the MRR leads to a redshift in the resonance wavelength. It can also be observed that when changing the resonance peak of one MRR, the transmission spectra of the other MRRs remain almost unchanged. This indicates that the crosstalk between the MRRs in the array during precise tuning is negligible. To ensure the computational precision of the PCAC chip, we have developed a self-calibration procedure that works in conjunction with the circuit hardware to monitor and calibrate the weights of the on-chip MRRs²⁷. This calibration process enables us to achieve a precision of 7 bits during the actual loading process (specific evaluation criteria can be found in ref.²⁸).

Operation for convolution and edge detection

In order to verify the convolutional computing capability of the PCAC chip within our system, we conducted a series of experiments using the widely recognized "cameraman" image as a standard test case. Figure 3(a) provides a comprehensive overview of the experimental setup, illustrating the key components involved in this proof-of-concept study. During the experiment, we employed a 3×3 MRR array as a convolutional kernel weight loading device, perfectly matching the size of the 3×3 convolutional kernel used. The input image, a grayscale image with dimensions of 256×256 pixels, was initially flattened into a one-dimensional vector. To achieve high-speed processing, we adopted an intensive parallel processing approach, where every three elements of the vector were grouped together and loaded onto the intensity modulator (IM). This allowed us to stream the data into the system in a synchronized manner. Once the data was serialized, it was channeled into the PCAC chip, which served as the core processing unit. Within the PCAC chip, each ring was dedicated to a specific convolutional kernel. The input values were fed through the pass-through end (THRU) and underwent multiplication and addition (MAC) operations along each row. Ultimately, the results of the convolutional operations were transmitted to a balanced photodetector via the drop port (DROP), where optical power was acquired for further analysis. Figure 3(b) shows the original image used in the edge detection test, which is a 256×256 pixels image of a cameraman. To better understand the impact of the convolutional kernels, Fig. 3(c) shows three specific types used for edge detection: Bottom sobel, Top sobel, and Left sobel. These kernels were designed to detect vertical and horizontal edges within the image. Figure 3(d) visually presents the outcome of applying these three edge detection operations, representing the result of a single convolutional operation. The experimental results provided substantial evidence to support the effectiveness of utilizing the PCAC chip within an optoelectronic system for parallel convolutional computing.

Figure 3. (a) Experimental setup of the PCAC chip for performing convolutional operations. (b) Original image used for demonstrating the convolution effect. (c) Convolution kernels used: Bottom sobel, Top sobel, Left sobel. (d) Corresponding convolution image results.

Figure 3. Full-Size Img PowerPoint

(a) Experimental setup of the PCAC chip for performing convolutional operations. (b) Original image used for demonstrating the convolution effect. (c) Convolution kernels used: Bottom sobel, Top sobel, Left sobel. (d) Corresponding convolution image results.

Application of first-person depth-based gesture recognition using PCAC chip

Figure 4. (a) Schematic diagram of the convolutional neural network (CNN) architecture suitable for first-person digit gesture recognition with depth information. (b) Probability of recognition for the 10 gestures after performing the convolutional layer computation using the PCAC chip as a replacement for the computer.

Figure 4. Full-Size Img PowerPoint

(a) Schematic diagram of the convolutional neural network (CNN) architecture suitable for first-person digit gesture recognition with depth information. (b) Probability of recognition for the 10 gestures after performing the convolutional layer computation using the PCAC chip as a replacement for the computer.

Figure 5. (a) Scatter plot comparing measured results with calculated results for Gesture 2. (b) Probability distribution of the error offset in the experimental results, resembling a Gaussian curve. (c) Offset of each point during the computation process. (d) Results of the first layer convolution computation obtained through electronic computation. (e) Results of the first layer convolution computation obtained through optical-electronic computation using the PCAC chip.

Figure 5. Full-Size Img PowerPoint

(a) Scatter plot comparing measured results with calculated results for Gesture 2. (b) Probability distribution of the error offset in the experimental results, resembling a Gaussian curve. (c) Offset of each point during the computation process. (d) Results of the first layer convolution computation obtained through electronic computation. (e) Results of the first layer convolution computation obtained through optical-electronic computation using the PCAC chip.

In this part, we further explore its performance in practical applications for devices. First-person perspective gesture recognition is one of the most widespread applications for wearable devices, such as virtual reality (VR) and augmented reality (AR) glasses, Remote Healthcare Monitoring devices²⁹. Taking this into account, we have developed a digital gesture recognition application that incorporates depth information, specifically designed for wearable devices. This application is capable of recognizing hand gestures representing digits from 0 to 9. We utilized the EgoGesture dataset³⁰, released by the Institute of Automation, Chinese Academy of Sciences in 2017. Each gesture was represented by 1500 training images and 300 testing images, resulting in a total of 18000 images as our dataset. We trained the artificial intelligence model on a computer. Figure 4(a) illustrates the main structure of the convolutional neural network (CNN) used in our application. Depth images captured by the SR300 depth camera were used as input data, with a gesture image size of 32×32×1. The first layer consisted of 16 convolutional kernels, each with a size of 3×3. The convolutional operations were performed entirely by the PCAC chip. Similar to the previous experiments, input images were reshaped into three rows of data and streamed into the PCAC chip, where they were convoluted with the loaded kernels. After one convolutional layer, the output size became 30×30×16. With the assistance of a computer system, the output data were processed by the activation function (ReLU) and then injected into a pooling layer for downsampling. Subsequently, two more convolutional layers, maximum pooling layers, and fully connected layers were applied, resulting in the final recognition of the 0–9 numeral gestures. Figure 4(b) displays a bar graph showing the recognition results of the ten gestures calculated by the PCAC chip. The horizontal axis represents the ten gestures, while the vertical axis represents the probability of recognizing each numeral. In the 10 recognition samples for digits 0–9, except for digits 2, 3, and 8, where there are probability distributions with both main and secondary peaks, the rest of the digits show single peak recognition. This indicates that the PCAC chip enables accurate recognition tasks. It is worth noting that for electronic computation, the model achieves a recognition accuracy of 91.14% in blind testing. Similarly, when using the PCAC chip for optoelectronic computation, all the blind test images yield the same recognition accuracy as those obtained through electronic computation. The graph demonstrates that the PCAC chip successfully implemented convolutional operations and achieved accurate recognition of depth-based numeral gestures.

To further investigate the performance of the PCAC chip in computational tasks, we conducted a more detailed analysis of the experimental results. Figure 5(a) compares the experimental results obtained by performing convolutional calculations using PCAC and the theoretical results obtained using digital computers for the recognition of Gesture 2. The scatter points exhibit a tight distribution along the diagonal line, which corresponds to the theoretical expectations. Figure 5(b) displays a histogram showing the probability distribution of the offsets (experimental values minus theoretical values) for all data points. The histogram exhibits a distribution similar to a Gaussian distribution, with the highest probability of offset near zero. Figure 5(c) shows the recorded offsets for each calculation sample during the computation process. The offsets are mostly distributed around zero and exhibit a stable and uniform distribution without significant fluctuations. Figure 5(d) and 5(e) provide visual comparisons between the theoretical results (computed by a computer) and the experimental results (obtained using the PCAC chip) after the first-layer convolutional operation specifically for the gesture representing the numeral 2. Apart from some variations in background color caused by experimental noise, the results obtained by the PCAC chip for convolutional computations are nearly identical to those obtained by the computer. In summary, the analysis reveals that the PCAC chip demonstrates high accuracy and stability in computational tasks when compared to theoretical calculations. The visual comparisons also confirm the consistency between the results obtained by the PCAC chip and those obtained by a conventional computer. These findings underscore the potential of the PCAC chip as a viable alternative for accelerating and improving recognition and classification tasks.

Discussion

Energy efficiency estimation

Estimated power consumption of the proof-of-concept system.

Components	Voltage(V)	Current(A)	Power(W)
Lasers			~10×10^-3×4=0.04
On ring heaters	$\bar{V}\approx 2$	~5×10^-3	~0.01×16=0.16
Intensity modulator1	$\bar{V}\approx 7$	0.266	1.862
Intensity modulator2	$\bar{V}\approx 6.5$	0.278	1.807
Intensity modulator3	$\bar{V}\approx 7$	0.272	1.904
Intensity modulator4	$\bar{V}\approx 7.5$	0.259	1.943
System’s power consumption			7.716
TEC for PCAC chip	6.145	0.53	3.257
CPU			~30
Total power consumption			40.973

CSV Show Table

Benefiting from the compact size of MRR resonators, the PCAC chip achieves high integration density within a footprint of just 0.2 mm². In the meantime, it enables basic multiplication and addition operations with same recognition results as electronic computation. For a 4×4 scale PCAC chip with four parallel channels, the footprint increases to approximately 5 square millimeters, allowing for parallel convolution operations and efficient processing of more complex computational recognition tasks. However, despite these advantages, the PCAC chip design still has limitations and potential areas for improvement.

Using phase-change materials as thermal shifters can further optimize the energy efficiency of the system. With the development of tunable optical frequency combs^31-33, replacing lasers with microcombs as light sources can significantly reduce power consumption. This will unlock the full potential of the optoelectronic computing system, offering higher scalability, higher integration, and lower power consumption. It is important to note that with the development of hybrid integration and monolithic integration techniques, advancements in light sources, silicon photonic circuits, and related electronic components (including modulators, drivers, trans-impedance amplifiers (TIA), digital-to-analog converters (DAC), and analog-to-digital converters (ADC)) can be integrated onto the same motherboard or even onto a single chip. This integration trend has the potential to significantly reduce power consumption. Therefore, the power and integration performance demonstrated in this work have the potential for further enhancement, although there is still a long road ahead.

Firstly, the eternal pursuit of photonic computation lies in processing data with high speed and low power consumption. In our concept verification setup, the power consumption is primarily attributed to the laser, silicon photonic chip, modulator, TEC, and digital backend. Based on the components utilized in our measurement setup, the estimated power consumption in the computation system is approximately 7.716 W, resulting in a total power consumption of around 40.973 W. Consequently, 80% of the power is attributed to these benchtop instruments. Table 1 shows the details of power consumption.

Throughput estimation

$T({\rm{OPS}})=2m\times {N}^{2}\times r\;,$

where T represents the throughput in OPS (operations per second) excluding the time spent on off-chip signal loading during photonic computation, m is the number of layers implemented by the photonic computing hardware, N² is the size of the on-chip weight library, and r is the detection rate of the photodetector (PD). Since the PCAC chip can naturally perform multiplication and addition (MAC) operations, and each MAC operation consists of one multiplication and one addition operation, one MAC operation corresponds to two operations. With a typical photodetection rate of 100 GHz, our PCAC concept validation chip (N²=4×4) can achieve 3.2 TOPS, which still lags behind leading electronic processors such as Google's tensor processing unit (TPU)³⁴ and other chips. However, due to the chip's strong scalability, in future large-scale chips of 16×16 dimensions with auxiliary optical frequency combs as multiple light sources is possible to reach the theoretical computational power of 51.2 TOPS. This will enable outstanding performance in complex computational tasks with ultra-high integration and ultra-low power consumption, helping to alleviate the high cost of electronic computing while ensuring high computational power. It serves as an effective solution for breakthroughs in the field of wearable devices. Although, there are various challenges in photonic computing that include limitations posed by components such as ADCs, DACs, modulators, PDs, in terms of their speed and bandwidth. While these challenges are not the primary focus of our current work, they are certainly within the broader scope of the field. We believe that with concerted efforts from the entire photonic computing community, these challenges can be addressed and overcome. As the field progresses, it is reasonable to expect advancements that will lead to breakthroughs in addressing the speed and bandwidth limitations of photonic components.

Furthermore, as a key metric for evaluating computational hardware performance, throughput is defined as the number of operations per second (OPS) performed by a processor in high-performance computing (HPC) domain. The throughput of photonic computing hardware can be calculated using Eq. (1)²⁰:

Scalability

To further improve the computational performance of PCAC chips, ensuring scalability is an extremely important requirement. The main source of losses in PCAC chip arises from the coupling gratings. Therefore, the scalability of PCAC chip is not primarily limited by its loss performance. Instead, it is predominantly determined by the free spectral range (FSR) of each MRR. Since each MRR requires individual tuning, and precision is essential to avoid resonance overlap during thermal tuning for high-precision computations, the scalability within a given operational wavelength range is somewhat constrained. This constraint emerges as we conduct experiments within a specific wavelength range. Our future endeavors are aimed at addressing this limitation by designing MRR with larger FSR. This design approach will enable the development of larger-scale PCAC chips operating with a greater range of wavelengths, thereby delivering enhanced computational performance. Ultimately, this advancement will expand the horizons for exploring more complex applications in the field, offering a broader spectrum of possibilities.

Wearable application potential

Finally, it is important to note that in this work, we have only showcased one application scenario for wearable devices. In this work, we have successfully demonstrated the capabilities of optical-electrical computation in a practical context by implementing first-person perspective gesture recognition tasks using the PCAC chip and accompanying algorithms (We have provided a demo video in the attachment that showcases the real-time interaction of this application). Unlike previous tests limited to MNIST handwritten digit recognition (with small input images and a few convolutional kernels), our application involves larger input images (32×32 pixels) and a more intricate network structure (with a first-layer convolution containing 16 kernels). These factors pose a greater challenge to the sustained high-precision computational capability of the photonic chip. The successful completion of the recognition task demonstrates the photonics hardware's capacity to handle such complex tasks. Compared to the previous tasks involving simple MNIST digits or edge detection, this work holds higher practical value due to its ability to address more intricate recognition tasks. However, the photonic convolution acceleration core computational system presented here can be applied to various scenarios involving convolution operations. Especially when considering the inherent advantages of photonics such as low power consumption and minimal heat generation, which align perfectly with the requirements of wearable devices. Building upon the previously mentioned approaches, further optimizations can be pursued to enhance integration, energy efficiency, and scalability. These improvements aim to achieve higher computational power while maintaining efficiency and compactness. We believe that this computational system has the potential to play a significant role in a broader range of wearable device applications.

Conclusions

In this work, we propose a convolutional acceleration processor based on an MRR array and have successfully fabricated a prototype PCAC chip. When combined with the computational control module programmed on an FPGA, the PCAC chip is capable of performing convolution operations with a maximum precision of 7 bits. We demonstrate the application of the PCAC chip in complex gesture recognition tasks, specifically in first-person depth information gesture recognition. With parallel and precise convolution operations, we obtain the same recognition results as traditional electronic computation in all blind tests, achieving a high level of recognition accuracy. The outstanding performance in accomplishing complex recognition tasks and high-precision forward propagation tasks opens up new possibilities for intuitive human-machine interaction. Furthermore, the advantages of optical computation, including reduced power consumption and faster data processing, make this application particularly important in the development of wearable devices. Accurate and efficient gesture recognition enables seamless control and interaction with the device, enhancing user experience and convenience. Additionally, the compact and easily integrable nature of the device provides opportunities for higher computational power and lower power consumption in future large-scale expansions. These advantages offer an effective solution to address the challenges of heat dissipation and integration in wearable devices when dealing with complex, high-precision, multi-scenario computational recognition tasks. It paves the way for efficient computation by effectively surpassing the limitations of electronic processors.

Acknowledgements

This work was partially supported by the National Natural Science Foundation of China (U21A20511), the Innovation Project of Optics Valley Laboratory (OVL2021BG001).

Competing interests

The authors declare no competing financial interests.

References (34)

References

[1]	Zhang SB, Li YX, Zhang S, Shahabi F, Xia S et al. Deep learning in human activity recognition with wearable sensors: a review on advances. Sensors 22, 1476 (2022). DOI: 10.3390/s22041476 CrossRef Google Scholar
[2]	Chang WJ, Chen LB, Chiou YZ. Design and implementation of a drowsiness-fatigue-detection system based on wearable smart glasses to increase road safety. IEEE Trans Consum Electron 64, 461–469 (2018). DOI: 10.1109/TCE.2018.2872162 CrossRef Google Scholar
[3]	Ramanujam E, Perumal T, Padmavathi S. Human activity recognition with smartphone and wearable sensors using deep learning techniques: a review. IEEE Sensors J 21, 13029–13040 (2021). DOI: 10.1109/JSEN.2021.3069927 CrossRef Google Scholar
[4]	Chen KX, Zhang DL, Yao LN, Guo B, Yu ZW et al. Deep learning for sensor-based human activity recognition: overview, challenges, and opportunities. ACM Comput Surv 54, 77 (2022). DOI: 10.1145/3447744 CrossRef Google Scholar
[5]	Wang JD, Chen YQ, Hao SJ, Peng XH, Hu LS. Deep learning for sensor-based activity recognition: a survey. Pattern Recognit Lett 119, 3–11 (2019). DOI: 10.1016/j.patrec.2018.02.010 CrossRef Google Scholar
[6]	Nweke HF, Teh YW, Al-garadi MA, Alo UR. Deep learning algorithms for human activity recognition using mobile and wearable sensor networks: state of the art and research challenges. Expert Syst Appl 105, 233–261 (2018). DOI: 10.1016/j.eswa.2018.03.056 CrossRef Google Scholar
[7]	Terven JR, Raducanu B, Meza-de-Luna ME, Salas J. Head-gestures mirroring detection in dyadic social interactions with computer vision-based wearable devices. Neurocomputing 175, 866–876 (2016). DOI: 10.1016/j.neucom.2015.05.131 CrossRef Google Scholar
[8]	Perusquía-Hernández M, Hirokawa M, Suzuki K. A wearable device for fast and subtle spontaneous smile recognition. IEEE Trans Affective Comput 8, 522–533 (2017). DOI: 10.1109/TAFFC.2017.2755040 CrossRef Google Scholar
[9]	Gruebler A, Suzuki K. Design of a wearable device for reading positive expressions from facial EMG signals. IEEE Trans Affective Comput 5, 227–237 (2014). DOI: 10.1109/TAFFC.2014.2313557 CrossRef Google Scholar
[10]	Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J et al. Language models are few-shot learners. In Proceedings of the 34th International Conference on Neural Information Processing Systems 159 (ACM, 2020); http://doi.org/10.5555/3495724.3495883. http://doi.org/10.5555/3495724.3495883. " target="_blank">Google Scholar
[11]	Hussein AI. Wearable computing: challenges of implementation and its future. In 2015 12th Learning and Technology Conference 14–19 (IEEE, 2015);http://doi.org/10.1109/LT.2015.7587224. http://doi.org/10.1109/LT.2015.7587224. " target="_blank">Google Scholar
[12]	Zhou HL, Dong JJ, Cheng JW, Dong WC, Huang CR et al. Photonic matrix multiplication lights up photonic accelerator and beyond. Light Sci Appl 11, 30 (2022). DOI: 10.1038/s41377-022-00717-8 CrossRef Google Scholar
[13]	Shastri BJ, Tait AN, Ferreira De Lima T, Pernice WHP, Bhaskaran H et al. Photonics for artificial intelligence and neuromorphic computing. Nat Photonics 15, 102–114 (2021). DOI: 10.1038/s41566-020-00754-y CrossRef Google Scholar
[14]	Nahmias MA, De Lima TF, Tait AN, Peng HT, Shastri BJ et al. Photonic multiply-accumulate operations for neural networks. IEEE J Sel Top Quantum Electron 26, 1–18 (2020). DOI: 10.1109/jstqe.2019.2941485 CrossRef Google Scholar
[15]	Lin X, Rivenson Y, Yardimci NT, Veli M, Luo Y et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004–1008 (2018). DOI: 10.1126/science.aat8084 CrossRef Google Scholar
[16]	Panuski CL, Christen I, Minkov M, Brabec CJ, Trajtenberg-Mills S et al. A full degree-of-freedom spatiotemporal light modulator. Nat Photonics 16, 834–842 (2022). DOI: 10.1038/s41566-022-01086-9 CrossRef Google Scholar
[17]	Li JX, Hung YC, Kulce O, Mengu D, Ozcan A. Polarization multiplexed diffractive computing: all-optical implementation of a group of linear transformations through a polarization-encoded diffractive network. Light Sci Appl 11, 153 (2022). DOI: 10.1038/s41377-022-00849-x CrossRef Google Scholar
[18]	Xu XY, Tan MX, Corcoran B, Wu JY, Boes A et al. 11 TOPS photonic convolutional accelerator for optical neural networks. Nature 589, 44–51 (2021). DOI: 10.1038/s41586-020-03063-0 CrossRef Google Scholar
[19]	Bai BW, Yang QP, Shu HW, Chang L, Yang FH et al. Microcomb-based integrated photonic processing unit. Nat Commun 14, 66 (2023). DOI: 10.1038/s41467-022-35506-9 CrossRef Google Scholar
[20]	Shen YC, Harris NC, Skirlo S, Prabhu M, Baehr-Jones T et al. Deep learning with coherent nanophotonic circuits. Nat Photonics 11, 441–446 (2017). DOI: 10.1038/nphoton.2017.93 CrossRef Google Scholar
[21]	Hughes TW, Minkov M, Shi Y, Fan SH. Training of photonic neural networks through in situ backpropagation and gradient measurement. Optica 5, 864–871 (2018). DOI: 10.1364/OPTICA.5.000864 CrossRef Google Scholar
[22]	Zhou HL, Zhao YH, Wang X, Gao DS, Dong JJ et al. Self-configuring and reconfigurable silicon photonic signal processor. ACS Photonics 7, 792–799 (2020). DOI: 10.1021/acsphotonics.9b01673 CrossRef Google Scholar
[23]	Zhang H, Gu M, Jiang XD, Thompson J, Cai H et al. An optical neural chip for implementing complex-valued neural network. Nat Commun 12, 457 (2021). DOI: 10.1038/s41467-020-20719-7 CrossRef Google Scholar
[24]	Tait AN, De Lima TF, Zhou E, Wu AX, Nahmias MA et al. Neuromorphic photonic networks using silicon photonic weight banks. Sci Rep 7, 7430 (2017). DOI: 10.1038/s41598-017-07754-z CrossRef Google Scholar
[25]	Xu SF, Wang J, Yi SC, Zou WW. High-order tensor flow processing using integrated photonic circuits. Nat Commun 13, 7970 (2022). DOI: 10.1038/s41467-022-35723-2 CrossRef Google Scholar
[26]	Huang CR, Bilodeau S, Ferreira De Lima T, Tait AN, Ma PY et al. Demonstration of scalable microring weight bank control for large-scale photonic integrated circuits. APL Photonics 5, 040803 (2020). DOI: 10.1063/1.5144121 CrossRef Google Scholar
[27]	Cheng JW, He ZM, Guo YH, Wu B, Zhou HL et al. Self-calibrating microring synapse with dual-wavelength synchronization. Photonics Res 11, 347 (2023). DOI: 10.1364/PRJ.478370 CrossRef Google Scholar
[28]	Zhang WP, Huang CR, Peng HT, Bilodeau S, Jha A et al. Silicon microring synapses enable photonic deep learning beyond 9-bit precision. Optica 9, 579–584 (2022). DOI: 10.1364/OPTICA.446100 CrossRef Google Scholar
[29]	Bandini A, Zariffa J. Analysis of the hands in egocentric vision: a survey. IEEE Trans Pattern Anal Mach Intell 45, 6846–6866 (2023). DOI: 10.1109/TPAMI.2020.2986648 CrossRef Google Scholar
[30]	Zhang YF, Cao CQ, Cheng J, Lu HQ. EgoGesture: a new dataset and benchmark for egocentric hand gesture recognition. IEEE Trans Multimedia 20, 1038–1050 (2018). DOI: 10.1109/TMM.2018.2808769 CrossRef Google Scholar
[31]	Razzari L, Duchesne D, Ferrera M, Morandotti R, Chu S et al. CMOS-compatible integrated optical hyper-parametric oscillator. Nat Photonics 4, 41–45 (2010). DOI: 10.1038/nphoton.2009.236 CrossRef Google Scholar
[32]	Moss DJ, Morandotti R, Gaeta AL, Lipson M. New CMOS-compatible platforms based on silicon nitride and Hydex for nonlinear optics. Nat Photonics 7, 597–607 (2013). DOI: 10.1038/nphoton.2013.183 CrossRef Google Scholar
[33]	Kippenberg TJ, Gaeta AL, Lipson M, Gorodetsky ML. Dissipative kerr solitons in optical microresonators. Science 361, eaan8083 (2018). Google Scholar
[34]	Graves A, Wayne G, Reynolds M, Harley T, Danihelka I et al. Hybrid computing using a neural network with dynamic external memory. Nature 538, 471–476 (2016). DOI: 10.1038/nature20101 CrossRef Google Scholar

View full references list

Cited By

Cited by

Periodical cited type(12)

1.	Luo, X.. Digital optics and optical intelligent agent. Applied Physics Letters, 2025, 127(1): 010501. DOI:10.1063/5.0271341
2.	Jia, X., Xu, B., Luo, J. et al. Heterogeneous integration of silicon nitride photonics and CVD-grown WS2 for second harmonic generation. Chinese Physics B, 2025, 34(7): 074205. DOI:10.1088/1674-1056/adc7f7
3.	Li, J., Zhang, W., Dong, W. et al. Progress and Challenges of Optical Logic Computation (Invited) \| [光学逻辑计算进展与挑战（特邀）]. Guangxue Xuebao Acta Optica Sinica, 2025, 45(14): 1420001. DOI:10.3788/AOS241964
4.	Schmid, D., Eisermann, R., Peczek, A. et al. Wafer-Scale Experimental Determination of Coupling and Loss for Photonic Integrated Circuit Design Optimisation. Photonics, 2025, 12(3): 234. DOI:10.3390/photonics12030234
5.	Xiao, Z., Ren, Z., Zhuge, Y. et al. Edge-Computing Enabled Si Photonics Multimodal Sensor with Integrated Photonic Convolutional Processor. Proceedings of the IEEE International Conference on Micro Electro Mechanical Systems MEMS, 2025. DOI:10.1109/MEMS61431.2025.10918045
6.	Huang, Y., Qian, Z., Xue, J. High-precision photonic neural networks utilizing micro-ring resonators. Proceedings of SPIE the International Society for Optical Engineering, 2025. DOI:10.1117/12.3057079
7.	Yan, H., Sun, Y., Yu, Y. et al. Multi-Functional Optical Neural Network in a Disordered Medium. Laser and Photonics Reviews, 2025. DOI:10.1002/lpor.202501407
8.	Huang, Y., Jiang, Z., Gu, J. et al. Cascaded Micro-Ring Resonators for Low-Crosstalk High-Density Photonic Convolutional Computing. Laser and Photonics Reviews, 2025. DOI:10.1002/lpor.202401874
9.	Xiao, Z., Ren, Z., Zhuge, Y. et al. Multimodal In-Sensor Computing System Using Integrated Silicon Photonic Convolutional Processor. Advanced Science, 2024, 11(47): 2408597. DOI:10.1002/advs.202408597
10.	Zhang, N., Huang, Z., Zhang, Z. et al. Advances of Optical Computing and Optoelectronic Intelligent Computing \| [光计算和光电智能计算研究进展]. Zhongguo Jiguang Chinese Journal of Lasers, 2024, 51(18): 1800001. DOI:10.3788/CJL240780
11.	Zhou, H., Zhou, H., Dong, J. Advances and Challenges of Optical Convolution Computation (Invited) \| [光学卷积计算的进展与挑战(特邀)]. Guangxue Xuebao Acta Optica Sinica, 2024, 44(15): 1513022. DOI:10.3788/AOS240782
12.	Hu, G., Rho, J. Special issue on digital and intelligent optics. Opto Electronic Science, 2023, 2(11) DOI:10.29026/oes.2023.230050

Other cited types(0)

Author Information

Author Information
- Baiheng Zhao On this Site On Google Scholar
  - Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan 430074, China
- Junwei Cheng On this Site On Google Scholar
  - Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan 430074, China
- Bo Wu On this Site On Google Scholar
  - Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan 430074, China
- Dingshan Gao On this Site On Google Scholar
  - Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan 430074, China
- Hailong Zhou On this Site On Google Scholar
  - Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan 430074, China
- Corresponding author: Jianji Dong, jjdong@hust.edu.cn On this Site On Google Scholar
  JJ Dong, E-mail: jjdong@hust.edu.cn
  - Wuhan National Laboratory for Optoelectronics, Huazhong University of Science and Technology, Wuhan 430074, China
  - Optics Valley Laboratory, Wuhan 430074, China

Supplements (1)

Supplementary Information

Title

Actions

Copyright

Open Access. © The Author(s). This article is licensed under a Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

About this Article

About this Article
DOI: 10.29026/oes.2023.230017

Cite this Article

Baiheng Zhao, Junwei Cheng, Bo Wu, et al. Integrated photonic convolution acceleration core for wearable devices. Opto-Electronic Science 2, 230017 (2023). DOI: 10.29026/oes.2023.230017

Baiheng Zhao, Junwei Cheng, Bo Wu, et al. Integrated photonic convolution acceleration core for wearable devices. Opto-Electronic Science 2, 230017 (2023). DOI: 10.29026/oes.2023.230017

Download Citation
Article History
- Received Date June 29, 2023
- Accepted Date September 16, 2023
- Available Online November 27, 2023
- Published Date December 29, 2023
Article Metrics

Article Views(4988) PDF Downloads(782)
Share:

Related Articles
- Simultaneous detection of inflammatory process indicators via operando dual lossy mode resonance-based biosensor
  
  Desiree Santano, Abian B. Socorro, Ambra Giannetti, Ignacio Del Villar, Francesco Chiavaioli
  
  Opto-Electronic Science, DOI: 10.29026/oes.2025.250005
- Multi-resonance enhanced photothermal synergistic fiber-optic Tamm plasmon polariton tip for high-sensitivity and rapid hydrogen detection
  
  Xinran Wei, Yuzhang Liang, Xuhui Zhang, Rui Li, Haonan Wei, Yijin He, Lanlan Shen, Yurui Fang, Ting Xu, Wei Peng
  
  Opto-Electronic Science, DOI: 10.29026/oes.2025.240029
- Optical micro/nanofiber enabled tactile sensors and soft actuators: A review
  
  Lei Zhang, Yuqi Zhen, Limin Tong
  
  Opto-Electronic Science, DOI: 10.29026/oes.2024.240005
- Applications of lasers: A promising route toward low-cost fabrication of high-efficiency full-color micro-LED displays
  
  Shouqiang Lai, Shibiao Liu, Zilu Li, Zhening Zhang, Zhong Chen, Rong Zhang, Hao-Chung Kuo, Tingzhu Wu
  
  Opto-Electronic Science, DOI: 10.29026/oes.2023.230028
- Pattern recognition in multi-synaptic photonic spiking neural networks based on a DFB-SA chip
  
  Yanan Han, Shuiying Xiang, Ziwei Song, Shuang Gao, Xingxing Guo, Yahui Zhang, Yuechun Shi, Xiangfei Chen, Yue Hao
  
  Opto-Electronic Science, DOI: 10.29026/oes.2023.230021
- A review of liquid crystal spatial light modulators: devices and applications
  
  Yiqian Yang, Andrew Forbes, Liangcai Cao
  
  Opto-Electronic Science, DOI: 10.29026/oes.2023.230026
- In-fiber photoelectric device based on graphene-coated tilted fiber grating
  
  Biqiang Jiang, Yueguo Hou, Jiexing Wu, Yuxin Ma, Xuetao Gan, Jianlin Zhao
  
  Opto-Electronic Science, DOI: 10.29026/oes.2023.230012
- High-speed multiwavelength InGaAs/InP quantum well nanowire array micro-LEDs for next generation optical communications
  
  Fanlu Zhang, Zhicheng Su, Zhe Li, Yi Zhu, Nikita Gagrani, Ziyuan Li, Mark Lockrey, Li Li, Igor Aharonovich, Yuerui Lu, Hark Hoe Tan, Chennupati Jagadish, Lan Fu
  
  Opto-Electronic Science, DOI: 10.29026/oes.2023.230003
- Multi-foci metalens for spectra and polarization ellipticity recognition and reconstruction
  
  Hui Gao, Xuhao Fan, Yuxi Wang, Yuncheng Liu, Xinger Wang, Ke Xu, Leimin Deng, Cheng Zeng, Tingan Li, Jinsong Xia, Wei Xiong
  
  Opto-Electronic Science, DOI: 10.29026/oes.2023.220026
- High-speed visible light communication based on micro-LED: A technology with wide applications in next generation communication
  
  Tingwei Lu, Xiangshu Lin, Wenan Guo, Chang-Ching Tu, Shibiao Liu, Chun-Jung Lin, Zhong Chen, Hao-Chung Kuo, Tingzhu Wu
  
  Opto-Electronic Science, DOI: 10.29026/oes.2022.220020

Article Contents
Figures/Tables
References

View in article Downloads

Full-Size Img PowerPoint
View in article Downloads

Full-Size Img PowerPoint
View in article Downloads

Full-Size Img PowerPoint
View in article Downloads

Full-Size Img PowerPoint
View in article Downloads

Full-Size Img PowerPoint

Components	Voltage(V)	Current(A)	Power(W)
Lasers			~10×10^-3×4=0.04
On ring heaters	$\bar{V}\approx 2$	~5×10^-3	~0.01×16=0.16
Intensity modulator1	$\bar{V}\approx 7$	0.266	1.862
Intensity modulator2	$\bar{V}\approx 6.5$	0.278	1.807
Intensity modulator3	$\bar{V}\approx 7$	0.272	1.904
Intensity modulator4	$\bar{V}\approx 7.5$	0.259	1.943
System’s power consumption			7.716
TEC for PCAC chip	6.145	0.53	3.257
CPU			~30
Total power consumption			40.973

View in article Downloads

[1]	Zhang SB, Li YX, Zhang S, Shahabi F, Xia S et al. Deep learning in human activity recognition with wearable sensors: a review on advances. Sensors 22, 1476 (2022). DOI: 10.3390/s22041476 CrossRef Google Scholar
[2]	Chang WJ, Chen LB, Chiou YZ. Design and implementation of a drowsiness-fatigue-detection system based on wearable smart glasses to increase road safety. IEEE Trans Consum Electron 64, 461–469 (2018). DOI: 10.1109/TCE.2018.2872162 CrossRef Google Scholar
[3]	Ramanujam E, Perumal T, Padmavathi S. Human activity recognition with smartphone and wearable sensors using deep learning techniques: a review. IEEE Sensors J 21, 13029–13040 (2021). DOI: 10.1109/JSEN.2021.3069927 CrossRef Google Scholar
[4]	Chen KX, Zhang DL, Yao LN, Guo B, Yu ZW et al. Deep learning for sensor-based human activity recognition: overview, challenges, and opportunities. ACM Comput Surv 54, 77 (2022). DOI: 10.1145/3447744 CrossRef Google Scholar
[5]	Wang JD, Chen YQ, Hao SJ, Peng XH, Hu LS. Deep learning for sensor-based activity recognition: a survey. Pattern Recognit Lett 119, 3–11 (2019). DOI: 10.1016/j.patrec.2018.02.010 CrossRef Google Scholar
[6]	Nweke HF, Teh YW, Al-garadi MA, Alo UR. Deep learning algorithms for human activity recognition using mobile and wearable sensor networks: state of the art and research challenges. Expert Syst Appl 105, 233–261 (2018). DOI: 10.1016/j.eswa.2018.03.056 CrossRef Google Scholar
[7]	Terven JR, Raducanu B, Meza-de-Luna ME, Salas J. Head-gestures mirroring detection in dyadic social interactions with computer vision-based wearable devices. Neurocomputing 175, 866–876 (2016). DOI: 10.1016/j.neucom.2015.05.131 CrossRef Google Scholar
[8]	Perusquía-Hernández M, Hirokawa M, Suzuki K. A wearable device for fast and subtle spontaneous smile recognition. IEEE Trans Affective Comput 8, 522–533 (2017). DOI: 10.1109/TAFFC.2017.2755040 CrossRef Google Scholar
[9]	Gruebler A, Suzuki K. Design of a wearable device for reading positive expressions from facial EMG signals. IEEE Trans Affective Comput 5, 227–237 (2014). DOI: 10.1109/TAFFC.2014.2313557 CrossRef Google Scholar
[10]	Brown TB, Mann B, Ryder N, Subbiah M, Kaplan J et al. Language models are few-shot learners. In Proceedings of the 34th International Conference on Neural Information Processing Systems 159 (ACM, 2020); http://doi.org/10.5555/3495724.3495883. http://doi.org/10.5555/3495724.3495883. " target="_blank">Google Scholar
[11]	Hussein AI. Wearable computing: challenges of implementation and its future. In 2015 12th Learning and Technology Conference 14–19 (IEEE, 2015);http://doi.org/10.1109/LT.2015.7587224. http://doi.org/10.1109/LT.2015.7587224. " target="_blank">Google Scholar
[12]	Zhou HL, Dong JJ, Cheng JW, Dong WC, Huang CR et al. Photonic matrix multiplication lights up photonic accelerator and beyond. Light Sci Appl 11, 30 (2022). DOI: 10.1038/s41377-022-00717-8 CrossRef Google Scholar
[13]	Shastri BJ, Tait AN, Ferreira De Lima T, Pernice WHP, Bhaskaran H et al. Photonics for artificial intelligence and neuromorphic computing. Nat Photonics 15, 102–114 (2021). DOI: 10.1038/s41566-020-00754-y CrossRef Google Scholar
[14]	Nahmias MA, De Lima TF, Tait AN, Peng HT, Shastri BJ et al. Photonic multiply-accumulate operations for neural networks. IEEE J Sel Top Quantum Electron 26, 1–18 (2020). DOI: 10.1109/jstqe.2019.2941485 CrossRef Google Scholar
[15]	Lin X, Rivenson Y, Yardimci NT, Veli M, Luo Y et al. All-optical machine learning using diffractive deep neural networks. Science 361, 1004–1008 (2018). DOI: 10.1126/science.aat8084 CrossRef Google Scholar
[16]	Panuski CL, Christen I, Minkov M, Brabec CJ, Trajtenberg-Mills S et al. A full degree-of-freedom spatiotemporal light modulator. Nat Photonics 16, 834–842 (2022). DOI: 10.1038/s41566-022-01086-9 CrossRef Google Scholar
[17]	Li JX, Hung YC, Kulce O, Mengu D, Ozcan A. Polarization multiplexed diffractive computing: all-optical implementation of a group of linear transformations through a polarization-encoded diffractive network. Light Sci Appl 11, 153 (2022). DOI: 10.1038/s41377-022-00849-x CrossRef Google Scholar
[18]	Xu XY, Tan MX, Corcoran B, Wu JY, Boes A et al. 11 TOPS photonic convolutional accelerator for optical neural networks. Nature 589, 44–51 (2021). DOI: 10.1038/s41586-020-03063-0 CrossRef Google Scholar
[19]	Bai BW, Yang QP, Shu HW, Chang L, Yang FH et al. Microcomb-based integrated photonic processing unit. Nat Commun 14, 66 (2023). DOI: 10.1038/s41467-022-35506-9 CrossRef Google Scholar
[20]	Shen YC, Harris NC, Skirlo S, Prabhu M, Baehr-Jones T et al. Deep learning with coherent nanophotonic circuits. Nat Photonics 11, 441–446 (2017). DOI: 10.1038/nphoton.2017.93 CrossRef Google Scholar
[21]	Hughes TW, Minkov M, Shi Y, Fan SH. Training of photonic neural networks through in situ backpropagation and gradient measurement. Optica 5, 864–871 (2018). DOI: 10.1364/OPTICA.5.000864 CrossRef Google Scholar
[22]	Zhou HL, Zhao YH, Wang X, Gao DS, Dong JJ et al. Self-configuring and reconfigurable silicon photonic signal processor. ACS Photonics 7, 792–799 (2020). DOI: 10.1021/acsphotonics.9b01673 CrossRef Google Scholar
[23]	Zhang H, Gu M, Jiang XD, Thompson J, Cai H et al. An optical neural chip for implementing complex-valued neural network. Nat Commun 12, 457 (2021). DOI: 10.1038/s41467-020-20719-7 CrossRef Google Scholar
[24]	Tait AN, De Lima TF, Zhou E, Wu AX, Nahmias MA et al. Neuromorphic photonic networks using silicon photonic weight banks. Sci Rep 7, 7430 (2017). DOI: 10.1038/s41598-017-07754-z CrossRef Google Scholar
[25]	Xu SF, Wang J, Yi SC, Zou WW. High-order tensor flow processing using integrated photonic circuits. Nat Commun 13, 7970 (2022). DOI: 10.1038/s41467-022-35723-2 CrossRef Google Scholar
[26]	Huang CR, Bilodeau S, Ferreira De Lima T, Tait AN, Ma PY et al. Demonstration of scalable microring weight bank control for large-scale photonic integrated circuits. APL Photonics 5, 040803 (2020). DOI: 10.1063/1.5144121 CrossRef Google Scholar
[27]	Cheng JW, He ZM, Guo YH, Wu B, Zhou HL et al. Self-calibrating microring synapse with dual-wavelength synchronization. Photonics Res 11, 347 (2023). DOI: 10.1364/PRJ.478370 CrossRef Google Scholar
[28]	Zhang WP, Huang CR, Peng HT, Bilodeau S, Jha A et al. Silicon microring synapses enable photonic deep learning beyond 9-bit precision. Optica 9, 579–584 (2022). DOI: 10.1364/OPTICA.446100 CrossRef Google Scholar
[29]	Bandini A, Zariffa J. Analysis of the hands in egocentric vision: a survey. IEEE Trans Pattern Anal Mach Intell 45, 6846–6866 (2023). DOI: 10.1109/TPAMI.2020.2986648 CrossRef Google Scholar
[30]	Zhang YF, Cao CQ, Cheng J, Lu HQ. EgoGesture: a new dataset and benchmark for egocentric hand gesture recognition. IEEE Trans Multimedia 20, 1038–1050 (2018). DOI: 10.1109/TMM.2018.2808769 CrossRef Google Scholar
[31]	Razzari L, Duchesne D, Ferrera M, Morandotti R, Chu S et al. CMOS-compatible integrated optical hyper-parametric oscillator. Nat Photonics 4, 41–45 (2010). DOI: 10.1038/nphoton.2009.236 CrossRef Google Scholar
[32]	Moss DJ, Morandotti R, Gaeta AL, Lipson M. New CMOS-compatible platforms based on silicon nitride and Hydex for nonlinear optics. Nat Photonics 7, 597–607 (2013). DOI: 10.1038/nphoton.2013.183 CrossRef Google Scholar
[33]	Kippenberg TJ, Gaeta AL, Lipson M, Gorodetsky ML. Dissipative kerr solitons in optical microresonators. Science 361, eaan8083 (2018). Google Scholar
[34]	Graves A, Wayne G, Reynolds M, Harley T, Danihelka I et al. Hybrid computing using a neural network with dynamic external memory. Nature 538, 471–476 (2016). DOI: 10.1038/nature20101 CrossRef Google Scholar

Links

Simultaneous detection of inflammatory process indicators via operando dual lossy mode resonance-based biosensor

Desiree Santano, Abian B. Socorro, Ambra Giannetti, Ignacio Del Villar, Francesco Chiavaioli

Opto-Electronic Science, DOI: 10.29026/oes.2025.250005
Multi-resonance enhanced photothermal synergistic fiber-optic Tamm plasmon polariton tip for high-sensitivity and rapid hydrogen detection

Xinran Wei, Yuzhang Liang, Xuhui Zhang, Rui Li, Haonan Wei, Yijin He, Lanlan Shen, Yurui Fang, Ting Xu, Wei Peng

Opto-Electronic Science, DOI: 10.29026/oes.2025.240029
Optical micro/nanofiber enabled tactile sensors and soft actuators: A review

Lei Zhang, Yuqi Zhen, Limin Tong

Opto-Electronic Science, DOI: 10.29026/oes.2024.240005
Applications of lasers: A promising route toward low-cost fabrication of high-efficiency full-color micro-LED displays

Shouqiang Lai, Shibiao Liu, Zilu Li, Zhening Zhang, Zhong Chen, Rong Zhang, Hao-Chung Kuo, Tingzhu Wu

Opto-Electronic Science, DOI: 10.29026/oes.2023.230028
Pattern recognition in multi-synaptic photonic spiking neural networks based on a DFB-SA chip

Yanan Han, Shuiying Xiang, Ziwei Song, Shuang Gao, Xingxing Guo, Yahui Zhang, Yuechun Shi, Xiangfei Chen, Yue Hao

Opto-Electronic Science, DOI: 10.29026/oes.2023.230021
A review of liquid crystal spatial light modulators: devices and applications

Yiqian Yang, Andrew Forbes, Liangcai Cao

Opto-Electronic Science, DOI: 10.29026/oes.2023.230026
In-fiber photoelectric device based on graphene-coated tilted fiber grating

Biqiang Jiang, Yueguo Hou, Jiexing Wu, Yuxin Ma, Xuetao Gan, Jianlin Zhao

Opto-Electronic Science, DOI: 10.29026/oes.2023.230012
High-speed multiwavelength InGaAs/InP quantum well nanowire array micro-LEDs for next generation optical communications

Fanlu Zhang, Zhicheng Su, Zhe Li, Yi Zhu, Nikita Gagrani, Ziyuan Li, Mark Lockrey, Li Li, Igor Aharonovich, Yuerui Lu, Hark Hoe Tan, Chennupati Jagadish, Lan Fu

Opto-Electronic Science, DOI: 10.29026/oes.2023.230003
Multi-foci metalens for spectra and polarization ellipticity recognition and reconstruction

Hui Gao, Xuhao Fan, Yuxi Wang, Yuncheng Liu, Xinger Wang, Ke Xu, Leimin Deng, Cheng Zeng, Tingan Li, Jinsong Xia, Wei Xiong

Opto-Electronic Science, DOI: 10.29026/oes.2023.220026
High-speed visible light communication based on micro-LED: A technology with wide applications in next generation communication

Tingwei Lu, Xiangshu Lin, Wenan Guo, Chang-Ching Tu, Shibiao Liu, Chun-Jung Lin, Zhong Chen, Hao-Chung Kuo, Tingzhu Wu

Opto-Electronic Science, DOI: 10.29026/oes.2022.220020

Integrated photonic convolution acceleration core for wearable devices

Abstract

Keywords

Introduction

Results

The principle

The fabrication and characterization of PCAC chip

Operation for convolution and edge detection

Application of first-person depth-based gesture recognition using PCAC chip

Discussion

Energy efficiency estimation

Throughput estimation

Scalability

Wearable application potential

Conclusions

Acknowledgements

Competing interests

References

Cited by

Periodical cited type(12)

Other cited types(0)

Author Information

Baiheng Zhao On this Site On Google Scholar

Junwei Cheng On this Site On Google Scholar

Bo Wu On this Site On Google Scholar

Dingshan Gao On this Site On Google Scholar

Hailong Zhou On this Site On Google Scholar

Corresponding author: Jianji Dong, jjdong@hust.edu.cn On this Site On Google Scholar

Supplementary Information

Copyright

About this Article

Cite this Article

Article History

Article Metrics

Links

Catalog

Corresponding author: Jianji Dong, jjdong@hust.edu.cn

Manuscript Submission

More Content

Legal & Privacy

Integrated photonic convolution acceleration core for wearable devices

Abstract

Keywords

Introduction

Results

The principle

The fabrication and characterization of PCAC chip

Operation for convolution and edge detection

Application of first-person depth-based gesture recognition using PCAC chip

Discussion

Energy efficiency estimation

Throughput estimation

Scalability

Wearable application potential

Conclusions

Acknowledgements

Competing interests

References

Cited by

Periodical cited type(12)

Other cited types(0)

Author Information

Baiheng Zhao On this SiteOn Google Scholar

Junwei Cheng On this SiteOn Google Scholar

Bo Wu On this SiteOn Google Scholar

Dingshan Gao On this SiteOn Google Scholar

Hailong Zhou On this SiteOn Google Scholar

Corresponding author: Jianji Dong, jjdong@hust.edu.cn On this SiteOn Google Scholar

Supplementary Information

Copyright

About this Article

Cite this Article

Article History

Article Metrics

Related Articles

Links

Related Articles

Catalog

Corresponding author: Jianji Dong, jjdong@hust.edu.cn

Manuscript Submission

More Content

Legal & Privacy

Export File

Citation

Format

Content

WeChat Qrcode

Baiheng Zhao On this Site On Google Scholar

Junwei Cheng On this Site On Google Scholar

Bo Wu On this Site On Google Scholar

Dingshan Gao On this Site On Google Scholar

Hailong Zhou On this Site On Google Scholar

Corresponding author: Jianji Dong, jjdong@hust.edu.cn On this Site On Google Scholar