Fast adaptation of multi-task meta-learning for optical performance monitoring

An algorithm is proposed for few-shot-learning (FSL) jointing modulation format identification (MFI) and optical signal-to-noise ratio (OSNR) estimation. The constellation diagrams of six widely-used modulation formats over a wide range of OSNR (10-40 dB) are obtained by a dual-polarization (DP) coherent detection system at 32 GBaud. We introduce auxiliary task to model-agnostic meta-learning (MAML) which makes the gradient of meta tasks decline faster in the direction of optimal target. Ablation experiments including multi-task model-agnostic meta-learning (MT-MAML), single-task model-agnostic meta-learning (ST-MAML) and adaptive multi-task learning (AMTL) are executed to train a data set with only 20 examples for each class. First, we discuss the impact from the number of shots and gradient descent steps for support set on the meta-learning based schemes to determine the best hyper parameters and conclude that the proposed method better captures the similarity between new and previous knowledge at 4 shot and 1 step. Withdrawn fine-tuning, the model achieves the lowest error ∼0.37 dB initially. Then, we simulate two other schemes (AMTL and ST-MAML), and the numerical results shows that mean square error (MSE) are ∼0.6 dB, ∼0.3 dB and ∼0.18 dB, respectively, proposed method has faster adaption to main task. For low order modulation formats, the proposed method almost reduces the error to 0. Meanwhile, we reveal the degree of deviation between the prediction and target and find that the deviation is mainly concentrated in the high OSNR range of 25-40 dB. Specifically, we investigate the variation curve of adaptive weights during pretraining and conclude that after 30 epoch, the model's attention was almost entirely focused on estimating OSNR. In addition, we study the generalization ability of the model by varying the transmission distance. Importantly, excellent generalization is also experimentally verified. In this paper, the method proposed will greatly reduce the cost for repetitively collecting data and the training resources required for fine-tuning models when OPM devices need to be deployed at massive nodes in dynamic optical networks.

Medienart:

E-Artikel

Erscheinungsjahr:

2023

Erschienen:

2023

Enthalten in:

Zur Gesamtaufnahme - volume:31

Enthalten in:

Optics express - 31(2023), 14 vom: 03. Juli, Seite 23183-23197

Sprache:

Englisch

Beteiligte Personen:

Zhang, Yu [VerfasserIn]
Zhou, Peng [VerfasserIn]
Liu, Yan [VerfasserIn]
Wang, Jixiang [VerfasserIn]
Li, Chuanqi [VerfasserIn]
Lu, Ye [VerfasserIn]

Links:

Volltext

Themen:

Journal Article

Anmerkungen:

Date Revised 21.07.2023

published: Print

Citation Status PubMed-not-MEDLINE

doi:

10.1364/OE.488829

funding:

Förderinstitution / Projekttitel:

PPN (Katalog-ID):

NLM359758398