Low-threshold dual-polarization electro-optic nonlinear activation functions
In this paper, a scheme for realizing low-threshold dual-polarization electro-optic nonlinear activation functions is proposed, designed, and experimentally demonstrated. This scheme relies on a structure composed of an optical power splitter and a thermo-optic modulator based on Mach–Zehnder interf...
Saved in:
Main Authors: | , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
AIP Publishing LLC
2025-01-01
|
Series: | APL Photonics |
Online Access: | http://dx.doi.org/10.1063/5.0230013 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | In this paper, a scheme for realizing low-threshold dual-polarization electro-optic nonlinear activation functions is proposed, designed, and experimentally demonstrated. This scheme relies on a structure composed of an optical power splitter and a thermo-optic modulator based on Mach–Zehnder interferometer, which are optimized by using direct binary search and particle swarm optimization algorithms. When a fixed proportion of the input optical signal is transformed into the electrical signal and the remaining input optical signal is modulated by using the thermo-optic effect, the rectified linear unit activation function with a low-threshold power of 0.2 mW for transverse-electric polarization and the Gaussian activation function with a low-threshold power of 0.1 mW for transverse-magnetic polarization can be measured separately. If the above-mentioned two nonlinear activation functions are introduced into the convolutional neural network to perform the modified National Institute of Standards and Technology handwritten digit classification task, validation accuracies of 97.3% and 96.85% will be achieved. |
---|---|
ISSN: | 2378-0967 |