Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Compact, efficient, and scalable nanobeam core for photonic matrix-vector multiplication

Open Access Open Access

Abstract

Optical neural networks have emerged as a promising avenue for implementing artificial intelligence applications, with matrix computations being a crucial component. However, the existing implementations based on microring resonators (MRRs) face bottlenecks in integration, power efficiency, and scalability, hindering the practical applications of wavelength division multiplexing (WDM)-based matrix-vector multiplications at the hardware level. Here we present a photonic crystal nanobeam cavity (PCNC) matrix core. Remarkably compact with dimensions reduced to ${20}\;{\unicode{x00B5}{\rm m}} \times {0.5}\;{\unicode{x00B5}{\rm m}}$, the PCNC unit exhibits a thermal tuning efficiency more than three times that of MRRs. Crucially, it is immune to the free spectral range constraint, thus able to harness the wealth of independent wavelength channels provided by WDM. A ${3} \times {3}$ PCNC core chip is demonstrated for animal face recognition and a six-channel chip is employed for handwritten digit classification to demonstrate the scalability. The PCNC solution holds immense promise, offering a versatile platform for next-generation photonic artificial intelligence chips.

© 2024 Optica Publishing Group under the terms of the Optica Open Access Publishing Agreement

1. INTRODUCTION

Artificial neural networks (ANNs) have emerged as a transformative force in the realm of artificial intelligence (AI), exhibiting remarkable capabilities in various applications such as image and voice recognition, medical treatment, and autonomous systems [19]. However, the escalating demand for computational power and energy efficiency has driven research into advanced hardware solutions to accelerate computations within ANNs [1012]. Optical neural networks (ONNs) have garnered significant attention as promising alternatives to conventional electronic processors, drawing on several key advantages over their electronic counterparts [1124]. The utilization of photons as information carriers enables ONNs to engage in high-speed parallel information processing. Additionally, photonic circuits, characterized by their exceptional attributes like ultra-wide bandwidth, high frequency, and low energy consumption, offer an ideal foundation for the execution of complex AI computations.

Matrix-vector multiplications (MVMs) are fundamental mathematical operations in ONNs essential for data transformation, weight adjustments, and learning processes [5,2527]. The weighted interconnections between adjacent neurons in the network are mathematically represented by a matrix whose entries are the weight values. Due to the abundant orthogonal channels in the wavelength domain, a wavelength division multiplexing (WDM) protocol known as “broadcast-and-weight” has been proposed for neural network models [20,2832]. The input vector is simultaneously fed into a single waveguide and evenly distributed to a reconfigurable filter bank in order to load weights. WDM-MVMs offer a practical approach to expand the number of neurons with spectrum reuse strategies, demonstrating significant potential for high computational density. Currently, WDM-MVMs predominantly utilize microring resonator (MRR) arrays [12,16,24,3338]. On the one hand, however, the footprint of MRR is constrained by waveguide bending losses, impeding the further reduction of chip size and power consumption. On the other hand, the free spectral range (FSR) limits the number of channels, posing a barrier to a large-scale cascade [39].

 figure: Fig. 1.

Fig. 1. (a) Schematic representation of the PCNC matrix core. (b) Advantages of the PCNC scheme compared with MRR. (c) Principle of PCNC weight bank.

Download Full Size | PDF

To address the challenges in WDM-MVMs, here we propose a novel and cost-effective photonic crystal nanobeam cavity (PCNC) matrix core. As a one-dimensional structure based on band engineering, PCNCs exhibit notable advantages over MRRs to execute WDM-MVMs. Our PCNC unit has a compact size of ${20}\;{\unicode{x00B5}{\rm m}} \times {0.5}\;{\unicode{x00B5}{\rm m}}$, which is comparable to that of a single mode waveguide and is conducive to mitigation of footprint expansion. Consequently, PCNCs naturally hold higher thermal-tuning efficiency than MRRs, leading to reduced power consumption when loading the same weights in ONNs. Furthermore, owing to the properties of finely tailored Bloch modes, PCNCs are not constrained by the limitation of free spectral range (FSR) within the required bandwidth. This practical attribute enables feasible scaling of channels for enhanced computational capacity of the matrix core. We demonstrate a ${3} \times {3}$ PCNC matrix core, showcasing its matrix computation capabilities through image convolution and cat-dog face recognition. Moreover, to highlight its scalability, we fabricate a chip with six wavelength channels, realizing a modified National Institute of Standards and Technology (MNIST) recognition task as a general competency demonstration. This compact, efficient, and scalable device paves the way for scaling up MVMs. Its compatibility with the standard complementary metal-oxide semiconductor (CMOS) processes and flexibility for implementation on other material platforms imbue our solution with potential for application in high-speed, large-capacity networks on-chip (NoCs) and commercial optical computing modules.

2. RESULTS

Figure 1(a) illustrates the schematic diagram of the proposed PCNC matrix core. The $N$-dimensional vector (with its components represented as ${x_i}$) obtained from preliminary modulation is loaded as intensity signals of different wavelengths into the chip and distributed to the $N \times N$ PCNC weight bank through a directional coupler (DC). Similar to microrings, the PCNC array selectively weights the input vector [30]. Each PCNC processes a specific wavelength channel, with the transmission corresponding to that wavelength serving as the synaptic weight (${w_{\textit{ij}}}$). All units in the array achieve reconfigurable transmission through metal heaters located above each PCNC (not indicated in Fig. 1), and the mapping between weights and driving voltages is accomplished through predefined calibration. At the end of each row, the total power obtained from the parallel bus waveguides through a multiplication and summation process is detected by a photodetector (PD), corresponding to a component (${y_i}$) of the output vector. This process is depicted in Fig. 1(c), which matches the principle of WDM-MVM [28].

As a distinctive design based on one-dimensional photonic crystals, a nanobeam core exhibits three typical features superior to the MRR core for matrix computations in ONNs, shown in Fig. 1(b). Nanobeams achieve resonance enhancement not through the ring waveguide required for the whispering gallery mode (WGM) of MRRs, but rather by etching elliptic holes on a compact one-dimensional waveguide to create a bandgap reflection [40]. The smaller footprint also allows nanobeams to generate the same resonant frequency shift as MRRs with less thermal power (${P_t}$) under equivalent conditions, leading to lower energy consumption in optical MVMs [41]. Furthermore, through optimization of the intracavity Bloch modes [4244], an individual nanobeam unit exhibits no multi-wavelength resonant peaks within the required bandwidth, which is crucial for the scalability of the matrix core.

The proposed PCNC matrix core is composed of an array of $N \times N$ tunable nanobeam filters as shown in Fig. 2(a). It is fabricated on a silicon-on-insulator (SOI) chip with a top silicon layer thickness of 220 nm. The detailed fabrication process can be found in Section 1 of Supplement 1. The photonic crystal structure of the nanobeam cavity is centrally symmetric, and for convenience, we have labeled the elliptical holes from ${-}{20}$ to 20 and will only discuss the region with positive serial numbers. Each hole is characterized by three parameters that define its position and shape: ${r_x}$ and ${r_y}$ correspond to the semi-axes of the ellipse, while $a$ represents the distance between adjacent ellipses, as shown in Fig. 2(b). For instance, $r_x^{(4)}$ represents the half of the horizontal axis length of the hole with serial number 4. The photonic band of the hole closest to the center allows for the presence of resonant wavelength light, while holes 9 to 20 are designed identical to form a strong photonic crystal bandgap, confining the resonant light to the central region. Holes 1 to 8 are arranged in a structure with a quadratic gradient, to reduce scattering loss from reflections.

 figure: Fig. 2.

Fig. 2. Design and characteristics of PCNC units. (a) Schematic diagram of a single PCNC unit. (b) Key structural parameters and elliptical hole distribution characteristics of the PCNC unit. The absolute values of serial numbers represent the distance of holes from the center, while the positive and negative signs indicate different directions. (c) Scanning electron microscope (SEM) image of one PCNC unit. (d) Manufactured ${3} \times {3}$ PCNC matrix core, with an additional row for backup. (e) Photonic crystal band diagram of a PCNC, retaining only bandgap information. (f) Optical field distribution of resonant wavelengths (top) and non-resonant wavelengths (bottom) obtained through three-dimensional time-domain finite-difference (3D-FDTD) simulations. (g) Measured transmission spectra of one row (top) and thermal tuning characteristics of the transmission spectrum of a single unit (bottom).

Download Full Size | PDF

As a conceptual demonstration, we have designed a ${3} \times {3}$ array of nanobeam resonators and fabricated the matrix core chip, and an image of one unit captured by a scanning electron microscope (SEM) is depicted in Fig. 2(c). The chip is packaged with optical input/output (I/O) and the thermoelectric cooler (TEC), as is shown in Fig. 2(d). A zoom-in view of the device and the content regarding the structure of multi-channel configurations can be found in Section 2 of Supplement 1. Each unit is individually parameterized through the aforementioned procedure, and each column of PCNCs corresponds to a distinct wavelength channel. Metal electrodes deposited on the silica cladding are employed to manipulate the effective refractive index distribution and band structure of the PCNC, thereby thermally modulating the signal light intensity. The mechanism of thermo-optic tuning is presented in Section 3 of Supplement 1.

With this configuration, Fig. 2(e) shows that the photonic bandgap formed by each hole varies with the increasing serial number. More details are given in Supplement 1, Section 2. Wavelengths allowed (i.e., on the band) at the center hole are positioned within the bandgap of the edge holes and are thus reflected. Essentially, this introduces a defect mode within the bandgap of the edge holes, allowing coupled light to stably resonate at the center hole. By employing this band engineering approach, light that meets the resonance wavelength condition can be coupled into the PCNC through a bus waveguide and stimulate Bloch mode [45], while light of other wavelengths directly propagates to the next level of units, as depicted in Fig. 2(f).

The transmission spectra of one row of tested PCNCs are depicted in Fig. 2(g). The three resonant wavelengths are ${\lambda _1} = {1544.40}\;{\rm nm}$, ${\lambda _2} = {1554.20}\;{\rm nm}$, and ${\lambda _3} = {1561.08}\;{\rm nm}$. In fact, it can be observed that there are still numerous wavelength channels available due to the FSR-free feature. In view of the flexible structural design, channels can be set at wavelengths that align with the requirements of the light source band, rather than being restricted solely to the conventional band (C-band).

In order to characterize the advantages of the PCNC device, we have fabricated and tested an MRR unit under the same conditions, and the results are presented in Table 1. The relation between resonance wavelength and tuning power of an individual channel can be found in Supplement 1, Section 4. Compared to microrings, PCNC reduces one dimension in size and exhibits a tuning coefficient over three times larger. Furthermore, there is no FSR within the detected wavelength band, indicating excellent scalability.

Tables Icon

Table 1. Performance Comparison of the Fabricated PCNC and MRR Unit

For practical purposes, we experimentally demonstrated the applications of the PCNC matrix core chip in a convolutional neural network (CNN). First, we select standard test images to validate the chip’s capability in extracting image features. A standard ${256} \times {256}$ grayscale “Cameraman” image with 8-bit color depths is utilized. The principle of the convolution process involves the weighted summation of element-wise products between a small matrix (referred to as the convolutional kernel) and corresponding image patches in the input image. Specifically, the weights of the kernel are assigned to the pixel values of the input image. During each image convolution operation, the convolutional kernel is slid over the input image with a predefined step length, generating weighted summations that are reshaped into feature maps. Several commonly used convolutional kernels are employed to extract various features from the image, including blur, bottom sobel, emboss, and motion blur. A tunable laser source (TLS) is employed to provide multi-channel laser inputs that match the resonant wavelengths of PCNCs. The grayscale values of the input image’s pixels are normalized and arranged as a serial data flow, which is then fed into an electro-optic modulator and weighted by the PCNC matrix core during light propagation, as illustrated in Fig. 3. The optically weighted results are detected by photodetectors (PDs), and subsequently reshaped into the final feature maps. The experimental results closely resemble the theoretical feature maps generated by a 64-bit digital computer. Section 5 of Supplement 1 provides further insights into the photonic matrix multiplication, encompassing the treatment of negative numbers.

 figure: Fig. 3.

Fig. 3. Image convolution implemented by PCNC core. (a) Experimental setup diagram. (b) Matrix convolution results using different kernels, including several classical image processing techniques. EOM: electro-optical Mach–Zehnder modulator. MUX, wavelength multiplexer. PD, photodetector. FPGA, field programmable gate array.

Download Full Size | PDF

Next, we conduct an experiment to validate the practicality of our chip in CNNs using an animal faces dataset (AFHQ) [46], in which we demonstrate the recognition of cat and dog faces. Figure 4(a) illustrates the CNN model employed, where the first layer of convolutional computations is accomplished using the PCNC matrix core. Due to the intricate details present in animal faces and the fact that the actual images are compressed versions of the original dataset images, the complexity of this task is challenging. Leveraging a pre-encoded dataset, the matrix core is trained through a gradient descent optimizer, and the learned parameters are mapped onto the driving voltages of the PCNC core. A total of 6000 images with dimensions of ${28} \times {28}$ pixels are utilized for training, and a test set consisting of 120 images (60 cats and 60 dogs) is employed. Input features are intensity-modulated onto lasers of different wavelengths and processed through convolutional, non-linear, and fully connected layers to generate classification information. The PCNC synapses enable parallel convolutional computations on optical data streams of image patches, yielding feature maps as depicted in Figs. 4(b) and 4(c). The accuracy of the convolutional layer computations is depicted in Fig. 4(d), where the experimentally obtained values closely cluster near the diagonal, indicating precise computations with a high degree of reliability. It is worth noting that a calibration method is employed to correct errors caused by thermal crosstalk [47], and a detailed explanation of this calibration approach can be found in Section 6 of Supplement 1. The normalized standard deviation obtained from the Gaussian fit for the residuals is 0.0128. This result indicates that the PCNC core holds competitive computational accuracy. For the cat-dog face test dataset, we achieved a recognition accuracy of 85.9%, which is close to the result obtained using a 64-bit computer (88.3%). The relatively low precision in both optics and electronic computing attribute to substantial compression of the input image to accommodate the current scale of optical computing.

 figure: Fig. 4.

Fig. 4. Cat and dog image recognition experiment using a ${3} \times {3}$ PCNC core. (a) Architecture of the convolutional neural network employed. (b) Subset of the test dataset images and (c) corresponding output feature maps. (d) Scatter plot for measuring convolution accuracy. The inset presents a histogram of the residual distribution, illustrating the differences between measured and computed values. (e) Confusion matrices for cat and dog image classification on the AFHQ test dataset. Conv., convolution layer. ReLU, rectified linear unit. FC, fully connected layer.

Download Full Size | PDF

Finally, to demonstrate the excellent scalability of PCNC, we have fabricated a matrix core chip with six wavelength channels. Microscope images of the chip and the transmission spectrum of one row are depicted in Figs. 5(a) and 5(b), respectively. The six resonant wavelengths, namely, ${\lambda _4}$ to ${\lambda _9}$, are 1539.48 nm, 1544.08 nm, 1548.70 nm, 1553.38 nm, 1558.08 nm, and 1563.40 nm. Similarly, the diverse resonant wavelengths are designed based on the aforementioned band-engineering mapping. Moreover, this approach can still be further extended for additional wavelength channels, primarily attributed to PCNC’s freedom from FSR limitations across a considerable frequency range. We have utilized the extended chip to recognize the more demanding MNIST handwritten digit images. The multi-layer network model architecture used in this experiment is identical to the one employed in the previous AFHQ dataset experiment, and the parameters were trained in the electrical domain using the gradient descent algorithm and then loaded into the optical domain; 10,000 images are employed for training the gradient descent algorithm, while 500 images form the test set. The outcomes, depicted in Fig. 5(c), reveal that the classification accuracy of the PCNC core is 87.0%, closely aligned with the 64-bit computer’s performance of 87.8%.

 figure: Fig. 5.

Fig. 5. Handwritten digit recognition experiment using a six-channel PCNC core. (a) Manufactured six-channel PCNC matrix core chip. (b) Transmission spectra of one of the rows. (c) Confusion matrices for handwritten digit classification on the MNIST test dataset.

Download Full Size | PDF

3. DISCUSSION

As a foundational device distinct from MRRs, the PCNC approach, in addition to the demonstrated attributes of compact size, high efficiency, and scalability, also holds untapped potential for further enhancement when employed as a matrix computation core. In this paper, the theoretical computational domain for the nanobeam utilized is within the domain of positive real numbers. Alternatively, employing an add-drop type of waveguide coupling scheme and performing differential processing at the output end can extend both the transmission matrix and the output vector to encompass the negative number domain [43]. Notably, the Fano-enhanced nanobeam proposed in Refs. [48,49] can be adopted to enhance port transmission efficiency and the extinction ratio, ensuring a higher computational dynamic range. Designing thermal electrodes tailored to the small mode volume of PCNCs or employing phase-change material (PCM) [20] to switch output states holds the potential to further enhance energy efficiency. Etching thermal isolation trenches between PCNC units can also be employed to increase the thermal tuning coefficient and aid in eliminating thermal crosstalk, thereby further reducing the overall chip size [50].

Although we have demonstrated the multi-channel capabilities of the PCNC matrix core, it is evident from the transmission spectrum that there are still numerous wavelength channels available for selection. Previous analyses [30,39] have examined the expansion limits of the MRR scheme, with its maximum channel count (${N_c}$) constrained by FSR as

$${N_c} \le \frac{F}{{\delta \omega}},$$
where $F$ is finesse, defined as the ratio of FSR and the full-width half-maximum (FWHM), and $\delta \omega$ is the linewidth-normalized channel spacing. The PCNC scheme overcomes this limiting bottleneck since $F$ is considerably magnified. It should be noted that due to the periodicity of the reciprocal lattice, the photonic crystal’s band structure is folded into the first Brillouin zone, which may result in additional resonances at large wavelength intervals [51]. However, this wavelength spacing is generally larger than the spectral range of other hardware components in CNN architectures, and is no longer the primary limiting factor.

The flexibility of the PCNC scheme extends beyond these aspects. In scenarios where stringent precision is not required, the lateral dimensions of the nanobeams can be reduced by decreasing the number of holes in the bandgap regions on both sides [52], which enables achieving a higher computational density. Similar to MRRs, PCNCs can be employed for on-chip front modulation, substituting the need for external modulators to load vectors [22]. As a structurally adaptable design, the PCNC can even be transposed onto other material platforms, such as lithium niobite [53,54], to achieve faster and more efficient modulation. In future endeavors, the integration of non-linear activation functions could be pursued to achieve all-optical neural network chips. When compared to systems like Mach–Zehnder interferometer (MZI) mesh and spatial diffraction, the remarkably compact dimensions and compatibility with CMOS processes of our proposed approach offer advantages for cost-effective deployment within miniaturized commercial computing modules.

In conclusion, we have introduced the PCNC matrix core as a novel solution that addresses key challenges in WDM-MVMs. The compact dimensions facilitate high-density computations, while its superior thermal tuning efficiency, surpassing MRRs by more than threefold, offers energy-efficient operation. Notably, the PCNC matrix core’s design enables utilization of a multitude of independent wavelength channels in WDM, overcoming the limitations of FSR. Our experiments in animal face recognition and handwritten digit classification highlight its practicality and scalability. With its low-cost attributes, inherent flexibility, and compatibility with CMOS processes, the PCNC scheme emerges as an ideal choice for optical accelerator chips and paves the way for a new generation of hardware to advance AI applications.

Funding

National Natural Science Foundation of China (62274071, U21A20511); Innovation Project of Optics Valley Laboratory (OVL2021BG001).

Disclosures

The authors declare no conflicts of interest.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

Supplemental document

See Supplement 1 for supporting content.

REFERENCES

1. Y. LeCun, Y. Bengio, and G. Hinton, “Deep learning,” Nature 521, 436–444 (2015). [CrossRef]  

2. J. Misra and I. Saha, “Artificial neural networks in hardware: a survey of two decades of progress,” Neurocomputing 74, 239–255 (2010). [CrossRef]  

3. A. K. Jain, M. Jianchang, and K. M. Mohiuddin, “Artificial neural networks: a tutorial,” Computer 29, 31–44 (1996). [CrossRef]  

4. V. Mnih, K. Kavukcuoglu, D. Silver, et al., “Human-level control through deep reinforcement learning,” Nature 518, 529–533 (2015). [CrossRef]  

5. D. D. Lee and H. S. Seung, “Learning the parts of objects by non-negative matrix factorization,” Nature 401, 788–791 (1999). [CrossRef]  

6. J. Sourati and J. A. Evans, “Accelerating science with human-aware artificial intelligence,” Nat. Hum. Behav. 7, 1682–1696 (2023). [CrossRef]  

7. N. Han, W. Hwang, K. Tzelepis, et al., “Identification of SARS-CoV-2–induced pathways reveals drug repurposing strategies,” Sci. Adv. 7, eabh3032 (2021). [CrossRef]  

8. Y. Liu, Z. Zhao, M. Xu, et al., “Decoding and synthesizing tonal language speech from brain activity,” Sci. Adv. 9, eadh0478 (2023). [CrossRef]  

9. Y. Sandamirskaya, M. Kaboli, J. Conradt, et al., “Neuromorphic computing hardware and neural architectures for robotics,” Sci. Rob. 7, eabl8419 (2022). [CrossRef]  

10. Y. Shen, N. C. Harris, S. Skirlo, et al., “Deep learning with coherent nanophotonic circuits,” Nat. Photonics 11, 441–446 (2017). [CrossRef]  

11. X. Xu, M. Tan, B. Corcoran, et al., “11 TOPS photonic convolutional accelerator for optical neural networks,” Nature 589, 44–51 (2021). [CrossRef]  

12. B. Bai, Q. Yang, H. Shu, et al., “Microcomb-based integrated photonic processing unit,” Nat. Commun. 14, 66 (2023). [CrossRef]  

13. T. W. Hughes, M. Minkov, Y. Shi, et al., “Training of photonic neural networks through in situ backpropagation and gradient measurement,” Optica 5, 864–871 (2018). [CrossRef]  

14. B. J. Shastri, A. N. Tait, T. Ferreira de Lima, et al., “Photonics for artificial intelligence and neuromorphic computing,” Nat. Photonics 15, 102–114 (2021). [CrossRef]  

15. X. Meng, G. Zhang, N. Shi, et al., “Compact optical convolution processing unit based on multimode interference,” Nat. Commun. 14, 3000 (2023). [CrossRef]  

16. S. Xu, J. Wang, S. Yi, et al., “High-order tensor flow processing using integrated photonic circuits,” Nat. Commun. 13, 7970 (2022). [CrossRef]  

17. T. Zhou, X. Lin, J. Wu, et al., “Large-scale neuromorphic optoelectronic computing with a reconfigurable diffractive processing unit,” Nat. Photonics 15, 367–373 (2021). [CrossRef]  

18. S. Xu, J. Wang, H. Shu, et al., “Optical coherent dot-product chip for sophisticated deep learning regression,” Light Sci. Appl. 10, 221 (2021). [CrossRef]  

19. H. Zhang, M. Gu, X. D. Jiang, et al., “An optical neural chip for implementing complex-valued neural network,” Nat. Commun. 12, 457 (2021). [CrossRef]  

20. J. Feldmann, N. Youngblood, M. Karpov, et al., “Parallel convolutional processing using an integrated photonic tensor core,” Nature 589, 52–58 (2021). [CrossRef]  

21. T. Fu, Y. Zang, Y. Huang, et al., “Photonic machine learning with on-chip diffractive optics,” Nat. Commun. 14, 70 (2023). [CrossRef]  

22. J. Cheng, H. Zhou, and J. Dong, “Photonic matrix computing: from fundamentals to applications,” Nanomaterials-Basel 11, 1689 (2021). [CrossRef]  

23. H. Zhou, J. Dong, J. Cheng, et al., “Photonic matrix multiplication lights up photonic accelerator and beyond,” Light Sci. Appl. 11, 30 (2022). [CrossRef]  

24. W. Zhang, C. Huang, H.-T. Peng, et al., “Silicon microring synapses enable photonic deep learning beyond 9-bit precision,” Optica 9, 579–584 (2022). [CrossRef]  

25. D. Pérez, I. Gasulla, P. Das Mahapatra, et al., “Principles, fundamentals, and applications of programmable integrated photonics,” Adv. Opt. Photonics 12, 709–786 (2020). [CrossRef]  

26. W. Bogaerts, D. Pérez, J. Capmany, et al., “Programmable photonic circuits,” Nature 586, 207–216 (2020). [CrossRef]  

27. D. Pérez, I. Gasulla, and J. Capmany, “Programmable multifunctional integrated nanophotonics,” Nanophotonics 7, 1351–1371 (2018). [CrossRef]  

28. A. N. Tait, M. A. Nahmias, B. J. Shastri, et al., “Broadcast and weight: an integrated network for scalable photonic spike processing,” J. Lightwave Technol. 32, 4029–4041 (2014). [CrossRef]  

29. A. N. Tait, T. F. d. Lima, M. A. Nahmias, et al., “Continuous calibration of microring weights for analog optical networks,” IEEE Photonics Technol. Lett. 28, 887–890 (2016). [CrossRef]  

30. A. N. Tait, A. X. Wu, T. F. d. Lima, et al., “Microring weight banks,” IEEE J. Sel. Top. Quantum Electron. 22, 312–325 (2016). [CrossRef]  

31. A. N. Tait, T. F. de Lima, M. A. Nahmias, et al., “Multi-channel control for microring weight banks,” Opt. Express 24, 8895–8906 (2016). [CrossRef]  

32. J. Cheng, Y. Zhao, W. Zhang, et al., “A small microring array that performs large complex-valued matrix-vector multiplication,” Front. Optoelectron. 15, 15 (2022). [CrossRef]  

33. J. Feldmann, N. Youngblood, C. D. Wright, et al., “All-optical spiking neurosynaptic networks with self-learning capabilities,” Nature 569, 208–214 (2019). [CrossRef]  

34. P. Y. Ma, A. N. Tait, T. F. de Lima, et al., “Photonic independent component analysis using an on-chip microring weight bank,” Opt. Express 28, 1827–1844 (2020). [CrossRef]  

35. V. Bangari, B. A. Marquez, H. Miller, et al., “Digital electronics and analog photonics for convolutional neural networks (DEAP-CNNs),” IEEE J. Sel. Top. Quantum Electron. 26, 7701213 (2020). [CrossRef]  

36. M. H. Khan, H. Shen, Y. Xuan, et al., “Ultrabroad-bandwidth arbitrary radiofrequency waveform generation with a silicon photonic chip-based spectral shaper,” Nat. Photonics 4, 117–122 (2010). [CrossRef]  

37. M. Miscuglio and V. J. Sorger, “Photonic tensor cores for machine learning,” Appl. Phys. Rev. 7, 031404 (2020). [CrossRef]  

38. Z. He, J. Cheng, X. Liu, et al., “Microcomb-driven optical convolution for car plate recognition,” Photonics 10, 972 (2023). [CrossRef]  

39. A. Jha, C. Huang, T. F. deLima, et al., “Nanophotonic cavity based synapse for scalable photonic neural networks,” IEEE J. Sel. Top. Quantum Electron. 28, 6100908 (2022). [CrossRef]  

40. Z. Cheng, J. Zhang, J. Dong, et al., “Compact high-contrast silicon optical filter using all-passive and CROW Fano nanobeam resonators,” Opt. Lett. 46, 3873–3876 (2021). [CrossRef]  

41. Y. Zhang, Y. He, Q. Zhu, et al., “Single-resonance silicon nanobeam filter with an ultra-high thermo-optic tuning efficiency over a wide continuous tuning range,” Opt. Lett. 43, 4518–4521 (2018). [CrossRef]  

42. T. Tanabe, M. Notomi, S. Mitsugi, et al., “All-optical switches on a silicon chip realized using photonic crystal nanocavities,” Appl. Phys. Lett. 87, 151112 (2005). [CrossRef]  

43. P. Yu, H. Qiu, T. Dai, et al., “Ultracompact channel add-drop filter based on single multimode nanobeam photonic crystal cavity,” J. Lightwave Technol. 39, 162–166 (2021). [CrossRef]  

44. L. Liu, M. Ye, Z. Yu, et al., “Notch microwave photonic filter with narrow bandwidth and ultra-high all-optical tuning efficiency based on a silicon nanobeam cavity,” J. Lightwave Technol. 41, 5051–5058 (2023). [CrossRef]  

45. R. Miura, S. Imamura, R. Ohta, et al., “Ultralow mode-volume photonic crystal nanobeam cavities for high-efficiency coupling to individual carbon nanotube emitters,” Nat. Commun. 5, 5580 (2014). [CrossRef]  

46. Y. Choi, Y. Uh, J. Yoo, et al., “StarGAN v2: diverse image synthesis for multiple domains,” in IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2020), pp. 8185–8194.

47. Z. He, J. Cheng, H. Zhou, et al., “BP neural network enables 2bits precision improvement for microring weight bank,” in 14th International Photonics and Optoelectronics Meetings (POEM) (2023), paper 1261403.

48. Z. Cheng, J. Dong, and X. Zhang, “Ultracompact optical switch using a single semisymmetric Fano nanobeam cavity,” Opt. Lett. 45, 2363–2366 (2020). [CrossRef]  

49. J. Zhang, Z. Cheng, J. Dong, et al., “Cascaded nanobeam spectrometer with high resolution and scalability,” Optica 9, 517–521 (2022). [CrossRef]  

50. S. N. Zheng, J. Zou, H. Cai, et al., “Microring resonator-assisted Fourier transform spectrometer with enhanced resolution and large bandwidth in single chip solution,” Nat. Commun. 10, 2349 (2019). [CrossRef]  

51. J. D. Joannopoulos, S. G. Johnson, J. N. Winn, et al., Photonic Crystals: Molding the Flow of Light (Princeton University, 1995).

52. Z. Cheng, Y. Zhao, J. Zhang, et al., “Generalized modular spectrometers combining a compact nanobeam microcavity and computational reconstruction,” ACS Photonics 9, 74–81 (2022). [CrossRef]  

53. H. Jiang, H. Liang, R. Luo, et al., “Nonlinear frequency conversion in one dimensional lithium niobate photonic crystal nanocavities,” Appl. Phys. Lett. 113, 021104 (2018). [CrossRef]  

54. T. K. Fryett, Y. Chen, J. Whitehead, et al., “Encapsulated silicon nitride nanobeam cavity for hybrid nanophotonics,” ACS Photonics 5, 2176–2181 (2018). [CrossRef]  

Supplementary Material (1)

NameDescription
Supplement 1       Supplemental document

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (5)

Fig. 1.
Fig. 1. (a) Schematic representation of the PCNC matrix core. (b) Advantages of the PCNC scheme compared with MRR. (c) Principle of PCNC weight bank.
Fig. 2.
Fig. 2. Design and characteristics of PCNC units. (a) Schematic diagram of a single PCNC unit. (b) Key structural parameters and elliptical hole distribution characteristics of the PCNC unit. The absolute values of serial numbers represent the distance of holes from the center, while the positive and negative signs indicate different directions. (c) Scanning electron microscope (SEM) image of one PCNC unit. (d) Manufactured ${3} \times {3}$ PCNC matrix core, with an additional row for backup. (e) Photonic crystal band diagram of a PCNC, retaining only bandgap information. (f) Optical field distribution of resonant wavelengths (top) and non-resonant wavelengths (bottom) obtained through three-dimensional time-domain finite-difference (3D-FDTD) simulations. (g) Measured transmission spectra of one row (top) and thermal tuning characteristics of the transmission spectrum of a single unit (bottom).
Fig. 3.
Fig. 3. Image convolution implemented by PCNC core. (a) Experimental setup diagram. (b) Matrix convolution results using different kernels, including several classical image processing techniques. EOM: electro-optical Mach–Zehnder modulator. MUX, wavelength multiplexer. PD, photodetector. FPGA, field programmable gate array.
Fig. 4.
Fig. 4. Cat and dog image recognition experiment using a ${3} \times {3}$ PCNC core. (a) Architecture of the convolutional neural network employed. (b) Subset of the test dataset images and (c) corresponding output feature maps. (d) Scatter plot for measuring convolution accuracy. The inset presents a histogram of the residual distribution, illustrating the differences between measured and computed values. (e) Confusion matrices for cat and dog image classification on the AFHQ test dataset. Conv., convolution layer. ReLU, rectified linear unit. FC, fully connected layer.
Fig. 5.
Fig. 5. Handwritten digit recognition experiment using a six-channel PCNC core. (a) Manufactured six-channel PCNC matrix core chip. (b) Transmission spectra of one of the rows. (c) Confusion matrices for handwritten digit classification on the MNIST test dataset.

Tables (1)

Tables Icon

Table 1. Performance Comparison of the Fabricated PCNC and MRR Unit

Equations (1)

Equations on this page are rendered with MathJax. Learn more.

N c F δ ω ,
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.