4.7 Article

Deep Neural Network-Aided Soft-Demapping in Coherent Optical Systems: Regression Versus Classification

Journal

IEEE TRANSACTIONS ON COMMUNICATIONS
Volume 70, Issue 12, Pages 7973-7988

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TCOMM.2022.3213284

Keywords

Artificial neural networks; Equalizers; Training; Task analysis; Adaptive optics; Propagation losses; Machine learning; Neural networks; nonlinear equalizer; classification; regression; coherent detection; digital signal processing; optical communications

Funding

  1. EU [813144]
  2. Leverhulme Trust [RP-2018-063]
  3. EPSRC project TRANSNET

Ask authors/readers for more resources

This article examines the use of predictive modeling, classification, and regression based on neural networks for soft-demapping in coherent optical communications. The study finds that regression-based learning is superior to classification-based learning in terms of Q-factor and achievable information rates. Additionally, the use of cross-entropy loss function may not always be the most suitable option due to learning problems.
We examine here what type of predictive modelling, classification, or regression, using neural networks (NN), fits better the task of soft-demapping based post-processing in coherent optical communications, where the transmission channel is nonlinear and dispersive. For the first time, we present possible drawbacks in using each type of predictive task in a machine learning context, considering the nonlinear coherent optical channel equalization/soft-demapping problem. We study two types of equalizers based on the feed-forward and recurrent NNs, for several transmission scenarios, in linear and nonlinear regimes of the optical channel. We point out that even though from the information theory perspective the cross-entropy loss (classification) is the most suitable option for our problem, the NN models based on the cross-entropy loss function can severely suffer from learning problems. The latter translates into the fact that regression-based learning is typically superior in terms of delivering higher Q-factor and achievable information rates. In short, we show by empirical evidence that loss functions based on cross-entropy may not be necessarily the most suitable option for training communication systems in practical scenarios when overfitting- and vanishing gradients-related problems come into play.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available