## Abstract

Polarization-division multiplexed (PDM) transmission based on the nonlinear Fourier transform (NFT) is proposed for optical fiber communication. The NFT algorithms are generalized from the scalar nonlinear Schrödinger equation for one polarization to the Manakov system for two polarizations. The transmission performance of the PDM nonlinear frequency-division multiplexing (NFDM) and PDM orthogonal frequency-division multiplexing (OFDM) are determined. It is shown that the transmission performance in terms of Q-factor is approximately the same in PDM-NFDM and single polarization NFDM at twice the data rate and that the polarization-mode dispersion does not seriously degrade system performance. Compared with PDM-OFDM, PDM-NFDM achieves a Q-factor gain of 6.4 dB. The theory can be generalized to multi-mode fibers in the strong coupling regime, paving the way for the application of the NFT to address the nonlinear effects in space-division multiplexing.

© 2017 Optical Society of America

## 1. Introduction

Nonlinear frequency-division multiplexing (NFDM) is an elegant method to address the nonlinear effects in optical fiber communication. In this approach, information is encoded in the nonlinear spectrum of the signal, defined by means of the nonlinear Fourier transform (NFT) [1–6]. The evolution of the nonlinear spectral components in fiber is governed by simple independent equations. As a result, the combined effects of the dispersion and nonlinearity can be compensated in the digital domain by the application of an all-pass-like filter. Furthermore, interference-free communication can be achieved in network environments.

The nonlinear spectrum consists of a continuous part and, in the case of the anomalous dispersion fiber, also a discrete part (solitonic component). In principle all degrees-of-freedom can be modulated. Prior work has mostly focused on either continuous spectrum modulation [7–11], or discrete spectrum modulation [7, 12, 13]. Transmission based on NFT has been experimentally demonstrated for the continuous spectrum [14–16], discrete spectrum [17], as well as the full spectrum [18].

Modern coherent optical fiber systems are based on polarization-division multiplexed (PDM) transmission to improve the achievable rates. However, research on data transmission using the NFT is limited to the scalar nonlinear Schrödinger equation, which does not take into account polarization effects (with the exception of [19]).

In this paper we overcome this limitation by generalizing the NFDM to the Manakov system, proposing PDM-NFDM. We develop a stable and accurate algorithm to compute the forward and inverse NFT of a two-dimensional signal. It is shown that the PDM-NFDM based on the continuous spectrum modulation is feasible, and that the data rate can be approximately doubled compared to the single polarization NFDM. Compared to the PDM-OFDM, the PDM-NFDM exhibits a peak Q-factor gain of 6.4 dB, in a system with 25 spans of 80 km standard single-mode fiber and 16 QAM.

In this paper, we set the discrete spectrum to zero and modulate only the continuous spectrum. Even though not all available degrees of freedom are modulated, the achievable rates reach remarkably close to the upper bound [20, 21].

The differential group delay and randomly varying birefringence give rise to temporal pulse broadening due to polarization-mode dispersion (PMD). The PMD might be compensated in conventional systems, and may even be beneficial in reducing the nonlinear distortions [22]. On the other hand, PMD changes the nonlinear interaction between signals in the two polarizations. The impact of PMD on NFDM is not fully investigated yet [23]. In this paper, we show that linear polarization effects can be equalized in PDM-NFDM at the receiver using standard techniques, and that the PMD does not seriously degrade performance.

## 2. Channel model with two polarizations

Light propagation in two polarizations in optical fiber is modeled by the coupled nonlinear Schrödinger equation (CNLSE) [24]. The fiber birefringence usually varies rapidly and randomly along the fiber in practical systems (on a scale of 0.3 to 100 meters). Under this assumption, the averaging of the nonlinearity in the CNLSE leads to the Manakov-PMD equation [25–27]:

**A**≡

**A**(

*Z, T*) is the 2 × 1 Jones vector containing the complex envelopes

*A*

_{1}and

*A*

_{2}of the two polarization components,

*Z*denotes the distance along the fiber,

*T*represents time,

*σ(Z*) is a 2 × 2 Hermitian matrix (defined by the state of polarization at

*Z*), and ∆

*β*

_{0}, ∆

*β*

_{1},

*α*,

*β*

_{2}and

*γ*are constant numbers.

The term *∂***A**/*∂T* is responsible for the PMD [27], while the second line represents loss, chromatic dispersion and Kerr nonlinearity. The factor 8/9 in front of the nonlinear term stems from polarization averaging.

The NFT applies to the integrable equations. However, the Manakov-PMD system (1) apparently is not integrable in the presence of loss or PMD. The loss may be approximately compensated using ideal distributed Raman amplification, leading to a lossless model with distributed additive white Gaussian noise (AWGN). Discrete amplification with short spans can also be modeled similarly, but with a modified nonlinearity coefficient (depending on loss); see [9]. Ignoring the PMD and, for the moment, noise, (1) is reduced to

It is convenient to normalize (2). Let *Z*_{0} = 1 and

Introducing the normalized variables *z* = *Z*/*Z*_{0}, *t* = *T*/*T*_{0} and **q** = **A**/*A*_{0}, (2) is simplified to the Manakov equation

*s*= 1 in the normal dispersion (defocusing) regime and

*s*= −1 in the anomalous dispersion (focusing or solitonic) regime.

It is shown in [4] that (4) is integrable. In what follows, we develop the corresponding NFT.

## 3. NFT of the two-dimensional signals

In this section, basics of the NFT theory of the two-dimensional signals, as well as numerical algorithms to compute the forward and inverse NFT, are briefly presented [28].

#### 3.1. Brief review of the theory

Equation (4) can be represented by a Lax pair $\widehat{L}$ and $\widehat{M}$. This means that, operators $\widehat{L}$ and $\widehat{M}$ can be found such that (4) is in one-to-one correspondence with the Lax equation $\partial \widehat{L}/\partial z=[\widehat{M},\widehat{L}]$. The Lax pair for the Manakov equation was found by Manakov in 1974 [28,29]. The operator $\widehat{L}$ is:

*s*= −1. Also assume the signal

*q*to be compactly supported.

The eigenvalue problem

can be solved for the Jost function v (eigenvector) assuming that the signals vanish at*t*= ±∞. This gives rise to six boundary conditions for

*v*, denoted by

**j**

_{±}:

*e*

^{(}

^{k}^{)}are unit vectors, i.e., ${e}_{l}^{(k)}={\delta}_{kl}$,

*k, l*= 0, 1, 2. Each of the boundary conditions (7) is bounded when

*λ*∈ ℂ

^{+}or

*λ*∈ ℂ

^{−}. The eigenvalue problem (6) under the boundary conditions (7) can be solved, obtaining six Jost functions ${\{{\mathbf{j}}_{\pm}^{(i)}(t,\lambda )\}}_{i=0,1,2}$ for all

*t*. It can be shown that ${\{{\mathbf{j}}_{+}^{(i)}(t,\lambda )\}}_{i=0,1,2}$ and ${\{{\mathbf{j}}_{-}^{(i)}(t,\lambda )\}}_{i=0,1,2}$ each form a complete (and for

*λ*∈ ℝ) basis for the solution space of (6). Thus we can expand ${\mathbf{j}}_{-}^{(0)}(t,\lambda )$ in the basis of ${\{{\mathbf{j}}_{+}^{(i)}(t,\lambda )\}}_{i=0,1,2}$:

*a*(

*λ*),

*b*

_{1}(

*λ*) and

*b*

_{2}(

*λ*) are called nonlinear Fourier coefficients. It can be shown that in the focusing regime the inner product of two Jost functions corresponding to the same eigenvalue does not depend on time. This implies that

*a*(

*λ*) and

*b*

_{i}(

*λ*) do not depend on time (as the notation in (8) suggests). The NFT of the

**q**= [

*q*

_{1},

*q*

_{2}] is now defined as

*λ*,

_{j}*j*= 1; 2; ⋯,

*N*, are the solutions of

*a*(

*λ*) = 0 in

_{j}*λ*∈ ℂ

_{j}^{+}.

An important property of NFT(*q*(*t, z*))(*λ*) is that it evolves in distance according to the all-pass-like filter

### Remark

Note that if *q*_{1} or *q*_{2} is set to zero in the Manakov equation and the associated $\widehat{L}$ operator (5), the equation and the operator are reduced, respectively, to the scalar NLSE and the corresponding $\widehat{L}$ operator [20]. Also, the theory in this section can straightforwardly be generalized to include any number of signals *q _{i}*,

*i*= 1, 2, …, for instance, LP modes propagating in a multi-mode fiber in the strong coupling regime.

*Remark* Since the Jost vectors for *λ* ∈ ℝ are orthonormal at all times, (8) implies the unimodularity condition

#### 3.2. Forward NFT algorithm

We develop the NFT algorithms for the continuous spectrum that is considered in this paper. The forward and inverse NFT algorithms are, respectively, based on the Ablowitz-Ladik and discrete layer peeling (DLP) methods. These algorithms generalize the corresponding ones in [10]. It is important to note that the design of accurate and fast algorithms to compute the nonlinear Fourier transform and its reverse is a complicated task [30–32]. In the following we intend to present an algorithm that is relatively straightforward to derive. We do not claim this algorithm to be the most accurate or most efficient possible with current techniques.

We begin by rewriting the eigenvalue problem $\widehat{L}v=\lambda v$ in the form *∂v*/*∂t* = *Pv*, where

*z*. We discretize the time interval [

*T*

_{0},

*T*

_{1}] according to

*t*[

*k*] =

*T*

_{0}+

*k*Δ

*T*, where ∆

*T*= (

*T*

_{1}−

*T*

_{0})/

*N*such that

*t*[0] =

*T*

_{0}and

*t*[

*N*] =

*T*

_{1}. We set

*q*[

_{i}*k*] =

*q*(

_{i}*T*

_{0}+

*k*Δ

*T*) and similarly for the vector

*v*. The Ablowitz-Ladik method is a discretization of

*∂v*/

*∂t*=

*Pv*based on the integrable discretized Manakov equation [33] as follows

*Q*[

_{i}*k*] =

*q*[

_{i}*k*]∆

*T*,

*z*:=

*e*

^{−2j}

^{λ}^{∆}

*and*

^{T}*k*= 0, 1, ⋯,

*N*− 1. With this discretization v becomes a periodic function of

*λ*with period

*π/*∆

*T*. We introduced a normalization factor ${c}_{k}=1/\sqrt{\left|\mathrm{det}P[k]\right|}$ in (13), where |det

*P*[

*k*]| = 1 + |

*Q*

_{2}[

*k*]|

^{2}+ |

*Q*

_{2}[

*k*]|

^{2}, to improve numerical stability. This factor does not follow from theory and could be defined in a different way. The iterative equation, Eq. (13), is initialized with

*N*iterations, the nonlinear Fourier coefficients are obtained as projections onto the Jost-solutions ${j}_{+}^{(i)}({T}_{1},\lambda )$,

*q*)(

*λ*) is then computed based on (9).

The algorithm can be efficiently implemented in the frequency domain. Let us write (13) as

*V*[

*k, λ*] := (

*A*[

*k, λ*],

*B*

_{1}[

*k, λ*],

*B*

_{2}[

*k, λ*])

*, and*

^{T}We discretize *λ* on the interval [Λ_{0}, Λ_{1}] with Λ_{1} = − Λ_{0} = *π*/(2Δ*T*) such that *λ*[*k*] = Λ_{0} + *k*ΔΛ. Let tilde ’~’ denote the action of the discrete Fourier transform DFT with respect to *λ*[*k*], e.g., *Ã*[., *l*] = DFT(*A*[., *λ*[*k*]]), where *l* = 0, 1, , ⋯, *N* − 1 is the discrete frequency. Note that $\text{DFT{}{z}^{-1}{B}_{i}[.,\lambda [k]]\text{}}[l]=\text{shift}\{{\tilde{B}}_{i}\}[l]$, where shift denotes circular right shift of the array by one element. Equation (16) in the frequency domain is

The initial condition is given by *Ã*[0] = DFT[*a*[*k*]][0] and ${\tilde{B}}_{i}[0]=0$ At *k* = *N* − 1, *a* and *b _{i}* are found by recovering

*V*[N,

*λ*] through an inverse DFT and using (17).

*Remark* Both NFT algorithm and the INFT algorithm can also be implemented via a polynomial approach. This might be more flexible and could make it easier to devise fast NFT/INFT algorithms.

#### 3.3. Inverse NFT algorithm

The inverse NFT algorithm consists of two steps. First, we compute *v*[*N, λ*] from the continuous spectra ${\widehat{q}}_{1,2}(\lambda )$ and invert the forward iterations (13). Second, at each iteration, we compute *Q*[*k*] from *v*[*k*, *λ*]. We start with ${\widehat{q}}_{i}(\lambda )$ discretized on the same lattice as introduced in Section 3.2.

Substituting ${b}_{i}(\lambda )={\widehat{q}}_{i}(\lambda )a(\lambda )$ in the unimodularity condition (11), we can compute |*a*(*λ*)|,

In the absence of a discrete spectrum, from (9), *a*(*λ*) ≠ 0 for all *λ*. We have Re(log(*a*)) = log(|*a*|) and Im(log(*a*)) = ∠*a*. Since *a*(*λ*) and therefore log(*a*) are analytic functions of *λ* we can recover the phase of a as ∠*a* = ℋ(log(|*a*|)), where ℋ denotes the discrete Hilbert transform. *b _{i}* is recovered from a and ${\widehat{q}}_{i}$ via ${b}_{i}(\lambda )={\widehat{q}}_{i}(\lambda )a(\lambda )$. Finally, we recover

*v*from

*a*and

*b*via Eqs. (14) and (15).

The inverse iterations are obtained by inverting the matrix in (13) and dropping terms of order ${Q}_{i}^{2}~\mathrm{\Delta}{T}^{2}$ (to yield the same accuracy as for the forward iterations):

In practice, as in the forward NFT, it is better to invert the frequency domain iterations (18). The signal *Q*[*k*] can then be obtained from *V*[*k* + 1, *l*] as follows. Recall that $\text{shift}\left[{\tilde{B}}_{i}[k]\right][l]={\tilde{B}}_{i}[k,l-1]$. Using the initial conditions for the *forward* iterations *Ã*[0, *l*] = *δ*_{0,}* _{l}* and ${\tilde{B}}_{i}[0,l]=0$, it is straightforward to show that

*Ã*[

*k*,

*l*] = 0 and ${\tilde{B}}_{i}[k,l]=0$ for

*l*≥

*k*> 0. In particular, ${\tilde{B}}_{i}[k,N-1]=0$ for

*k*<

*N*. For the first element of ${\tilde{B}}_{i}[k]$, from (18) and $\text{shift}\left[{\tilde{B}}_{1}[k]\right][-1]=\text{shift}\left[{\tilde{B}}_{1}[k]\right][N-1]=0$, we obtain

#### 3.4. Testing the NFT algorithms

The forward and inverse NFT numerical algorithms are tested as follows. Figure 1 compares a signal containing two polarization components in the nonlinear Fourier domain with its reconstruction after successive INFT and NFT operations. We take two displaced Gaussians with standard deviation $\sigma =\sqrt{2}$ as input signals for the two polarization components ${\widehat{\mathbf{q}}}_{i}$, *i* = 1, 2. We set time windows to *T* = 64 and take 2048 samples in time and nonlinear Fourier domain. In the discrete layer peeling method, the signal is periodic in nonlinear Fourier domain with period *π*/∆*T* ≈ 100.

Deviations occur as the amplitude of the signal is increased. The accuracy of the NFT (INFT) can be enhanced by increasing the sampling rate in time (nonlinear frequency), as this reduces errors due to to the piecewise-constant approximation of the signal. We find that we roughly need twice the number of samples to achieve the same accuracy as in the single-polarization case.

## 4. PDM-NFDM transmitter and receiver

In this section, we describe the transmitter (TX) and receiver (RX) digital signal processing (DSP) in PDM-NFDM and PDM-OFDM. A schematic diagram of the polarization-division multiplexed NFDM and OFDM transmission systems is shown in Fig. 2. The TX DSP produces digital signals for the in-phase and quadrature components of both polarization components, which are fed into the IQ-Modulator. The modulated signals for the two polarization components are combined in the polarization beam combiner before they enter the transmission line consisting of multiple fiber spans. We consider the practically relevant case of lumped amplification using Erbium-doped fiber amplifiers (EDFAs). After propagation through the fiber the two polarization components are separated in the polarization beam splitter and fed into the polarization-diversity intradyne coherent receiver, which provides the input to the RX DSP.

We briefly describe the TX and RX DSP in OFDM and NFDM. In the data modulator we first map the incoming bits to symbols taken from a QAM constellation. We then oversample the discrete-time signal in the time domain (by introducing zeros in the frequency domain outside the support of the signal) and then add guard intervals in the time domain. Increasing the guard interval increases the accuracy of the INFT. Unless stated otherwise, we do not increase the guard intervals during the computation of the INFT as the amplitude is increased, so there is a penalty due to inaccuracies in the algorithm. Up to here the DSP is identical for OFDM and NFDM. In contrast to OFDM, in NFDM these steps are however performed in what is called the *U*-domain; see [10]. The signal is subsequently mapped from the *U*-domain to nonlinear Fourier domain through the transformation

*U*-domain. At this point, we perform the inverse NFT in the case of NFDM and an inverse FFT in case of OFDM. Then we obtain the unnormalized signal by introducing units using (3). Finally we combine the OFDM or NFDM bursts to obtain the signal to be transmitted. The output of the DSP are the in-phase and quadrature components of the signals in the two polarization components. All steps in the DSP are performed independently for the two polarizations, except for the INFT, which requires joint processing.

At the RX we invert the steps of the TX DSP. The signal processing begins with burst separation and signal normalization using (3), followed by the forward NFT (FFT) in case of NFDM (OFDM). In NFDM, we subsequently equalize the channel using (10), which is a single-tap phase compensation. Similarly, in OFDM we either compensate the dispersion with the phase exp(−*j β*_{2}*ω*^{2}*N*_{span}*L*/2), or perform digital backpropagation (DBP) for a fixed number of steps per span. We then remove the guard intervals in time, down-sample the signal in the time domain, and obtain the output symbols. The bit error rate (BER) is calculated using a minimum distance decoder for symbols.

In Fig. 3 we compare the time domain signals of NFDM and OFDM at the TX for one of the polarization components. We use 112 subcarriers over a burst duration of *T*_{0} = 2ns in both cases. These parameters are the same as in [9] and correspond to a burst data rate of 224GBit/s per polarization at a Baudrate of 56GBaud. In NFDM the shape of the signal in the time domain changes as the amplitude $|{\widehat{q}}_{1,2}|$ is increased in the nonlinear Fourier domain. At a signal power of 0.5 dBm, one can see a significant amount of broadening of the NFDM signal in the time domain. This effect makes it difficult to control the time duration of signals in NFDM.

Pulse broadening in the time domain due to the chromatic dispersion can be estimated by [34]

where*B*is the signal bandwidth. The guard time intervals for minimizing the interaction among bursts can be approximated using (21). For the parameters in Table 1, we obtain ∆

*T ~*15ns, to which we add a 20% margin to obtain a total symbol duration of

*T*=

*T*

_{0}+

*T*

_{guard}= (2+18)ns= 20ns. This corresponds to an effective data rate of 44.8Gbit/s. The right panel of Fig. 3 shows the time domain signals in NFDM and OFDM in one polarization at the RX after propagation over 2000 km governed by the integrable model (2) (we consider transmission over the realistic fiber model (1) below in Sec. 5.3). We observe that the amount of the temporal broadening of the NFDM and OFDM signals is about the same. In our simulations we do not employ pre-compensation at the TX, which may be added to further increase the data rate [35].

## 5. Simulation results

In this section, we consider the system shown in Fig. 2 and compare the PDM-OFDM and PDM-NFDM via simulations, taking into account loss and PMD. Before turning to realistic simulations, we first assess the effect of fiber losses on the NFT transmission performance. In Sec. 5.2 we consider the propagation with loss and periodic amplification, setting the PMD to zero (namely, neglecting the terms in the first line of Eq. (1)). Next, the PMD effects are studied in Sec. 5.3. The system parameters used are summarized in Table 1.

#### 5.1. Effect of loss and lumped amplification

In the lumped amplification scheme, the signal is periodically amplified after each span. The channel is described by the model (1) including the attenuation term, which is manifestly not integrable. The effect of lumped amplification on NFT transmission has been studied previously [9].

We illustrate the effect of the attenuation by comparing the BER as a function of the OSNR in Fig. 4 for four scenarios: 1) the back-to-back (B2B) configuration (no fiber propagation); 2) lossless model (2) (there is propagation but without loss); 3) lossy propagation with lumped amplification; and 4) lossy propagation with lumped amplification that we approximate by a transformed-lossless model that is introduced below. In producing Fig. 4, we artificially introduced AWGN at the receiver in all cases, to exclude the effects of the signal-noise interaction from the comparison. We fix the power *P* = −3.1dBm to keep the accuracy of the NFT algorithms the same and vary the OSNR by changing the noise power.

We see that the BER in B2B and lossless models are approximately the same. This result verifies that the implementation of the NFT is correct. It also shows that the effect of burst interaction due to the finite guard interval is negligible.

The BER of the lossy model is significantly higher than the BER in the B2B and lossless models at high OSNRs. The BER of the lossy model seems to flatten out as the OSNR is increased.

Fiber loss can be taken into account in the NFT as follows. The attenuation term in (1) can be eliminated by a change of variable $A(z,t)={A}_{0}(z,t){e}^{-\frac{\alpha}{2}z}$ [36, 37]. This way the non-integrable equation, Eq. (1), can approximately be modeled by Eq. (2), which is integrable (thus with zero loss) and a modified nonlinearity parameter

We refer to the resulting model as the transformed-lossless model.

Note that the power of the signal *A*_{0}(*z, t*) in the transformed-lossless model is higher than the power of the *A*(*z, t*) in the original lossy model. That is because *γ*_{eff} < *γ*, which yields higher amplitudes according to (3). For our fiber parameters this approximately doubles the signal power to approximately 3dBm.

Figure 4 shows that the proposed scheme for loss cancellation is indeed surprisingly effective for the transmission over a lossy link with lumped amplification. Using *γ*_{eff} (*L*) in the inverse and forward NFTs, the BER of the lossless and transformed-lossless models are almost identical. This result is in line with the conclusion of [9] that the transformed-lossless effectively removes the penalty due to fact that the transmission is non-integrable. We note that with increasing transmission power, performance is expected to degrade.

#### 5.2. Transmission performance PDM-NFDM

In order to assess the transmission performance of the PDM-NFDM transmission, we performed system simulations as described in Sec. 4. For now we focus on deterministic impairments and neglect PMD. We simulate transmission of 112 subcarrier NFDM and OFDM pulses based on a 16QAM constellation over *N*_{span} = 25 spans of 80km of standard single-mode fiber. We assume ideal flat-gain amplifiers with a noise figure of *N _{F}* = 6.2dB. The system parameters are summarized in Table 1. They are comparable with those of [9] to facilitate a comparison with the single-polarization case.

We first compare single-polarization NFT transmission based on the NLSE with polarization-multiplexed transmission based on the Manakov equation. Figure 5 shows that in the linear regime the two curves are offset by 3dB because polarization multiplexing doubles the transmission power without increasing the signal to noise ratio. We note that in this calculation we increased the guard band and thereby the sampling rate in frequency domain by a factor of two in the computation of the PDM-NFT (the guard band in transmission is the same) compared to the single-polarization case. This is to avoid penalties due to inaccuracies in the NFT, which here amount to approximately 1dB at peak power if the same guard bands are used. Without this penalty, the *Q*-factor at peak power is roughly the same in both cases. This implies that data rates can approximately be doubled using polarization multiplexing. We show in Sec. 5.3 that this conclusion still holds in presence of polarization effects.

In Fig. 6 we compare the *Q*-factor as a function of launch power in PDM-OFDM and PDM-NFDM transmission. The optimum launch power in OFDM is significantly smaller than in NFDM. OFDM transmission is limited by nonlinearity, while NFDM is mainly limited by nonlinear signal-noise interaction. The *Q*-factor at optimum launch power is significantly higher in NFDM than in OFDM. We observe an overall gain of 6.4dB in *Q*-factor. Figure 7 shows the corresponding received constellations at the respective optimal launch power.

In Fig. 6 we further report results obtained by digital backpropagation after OFDM transmission. For a fair comparison with NFDM, we use the same sampling rate in the DBP and apply it without prior downsampling. OFDM with DBP achieves similar performance to NFDM for around 10 DBP steps per span and exceeds it for 16 steps per span. We note that here we considered a single-user scenario. The full potential of NFDM is leveraged in a network scenario, where DBP is less efficient. Our results are in good qualitative agreement with those reported in [9] for the single-polarization case, while achieving twice the data rate.

#### 5.3. Effect of PMD

### 5.3.1. Simulation of PMD

The fiber model (1) describes light propagation in linearly birefringent fibers [38, 39]. Here ∆*β*_{1} is the difference in propagation constants for the two polarization states which are induced by fiber imperfections or stress. In real fibers, this so-called modal birefringence varies randomly, resulting in PMD.

We simulate PMD with the coarse-step method, in which continuous variations of the birefringence are approximated by a large number of short fiber sections in which the birefringence is kept constant. PMD is thus emulated in a distributed fashion [40]. We use fixed-length sections of length 1km, larger than typical fiber correlation lengths. At the beginning of each section, the polarization is randomly rotated to a new point on the Poincaré sphere. We apply a uniform random phase in the new reference frame. The latter accounts for the fact that in reality the birefringence will vary in the sections where it is assumed constant, which will lead to a random phase relationship between the two polarization components [38]. The differential group delay (DGD) of each section is selected randomly from a Gaussian distribution. This way artifacts in the wavelength domain caused by a fixed delay for all sections are avoided [41].

Within each scattering section, Eq. (1) without PMD terms is solved using standard split-step Fourier integration. To speed up the simulations, we employ a CUDA/C++ based implementation with a MEX interface to Matlab.

The resulting DGD of the fiber is Maxwell distributed [42],

For the Maxwell distribution, the mean is related to the root-mean-square (RMS) delay by $\u3008\mathrm{\Delta}t\u3009\sqrt{3\pi /8}=\sqrt{\u3008\mathrm{\Delta}{t}^{2}\u3009}$. The average DGD varies with the square root of the fiber length [34],

*D*

_{PMD}is the PMD parameter. Typical PMD values for fibers used in telecommunications range from 0.05 in modern fibers to $0.5\text{ps}/\sqrt{\text{km}}$.

### 5.3.2. PMD impact on NFDM

In this section, we consider polarization effects on the NFT transmission. Since the nonlinear term in the Manakov equation is invariant under polarization rotations, the equalization can be done in in the nonlinear Fourier domain (cf. Fig. 2). We use a simple training sequence based equalization algorithm to compensate linear polarization effects. The samples of the first NFDM symbol are used as the training sequence. The filter taps of the equalizer are determined using least-squares estimation. To obtain the *Q*-factor we average the BER over 200 random realizations of PMD for each data point.

Due to its statistical nature, the effects of PMD are often quantified in terms of outage probabilities. In practice, the system is designed to tolerate a certain amount of PMD, in this case by fixing the number of taps in the equalizer. When the DGD exceeds this margin, the system is said to be in outage. We first determine the number of taps to achieve a given outage probability. Fig. 8 shows the *Q*-factor as a function of the number of taps for different values of the PMD parameter. For *D*_{PMD} = 0 we only have random polarization rotations and no DGD. Hence there is no interaction between nonlinearity and PMD and we use this case as the reference. The upturn for a small number of taps is due to the fact that a couple of taps are required to fully compensate the polarization rotations in presence of noise.

For finite PMD, the curves for different PMD values converge to the same result within error bars. We can estimate the required number of taps from the Maxwell distribution. In order for the system to fully compensate PMD in 98.7% of the cases (1.3% outage probability), the equalization must cover a time interval equal to the mean plus 3 standard deviations of the DGD distribution. For example, for ${D}_{\text{PMD}}=0.1\text{ps}/\sqrt{\text{km}}$ this interval corresponds to 12.8ps. Using the sampling frequency we find that we approximately need 12 taps to converge, consistent with the figure.

We note that by construction our equalizer also corrects potential phase rotations and therefore at least part of the nonlinear effects. In order to separate effects due to interaction of nonlinearity and PMD, we performed a calculation where we reversed the polarization rotations and phases exactly (instead of using the equalizer), by keeping track of the randomly generated angles and DGD values in the fiber simulation. We find a small phase rotation consistent with but smaller than the one reported in [23]. In our case it remains negligible at peak power.

In Fig. 9 we compare the *Q*-factor as a function of launch power with and without PMD effects. Here we use the number of taps determined as described above. We observe a penalty of roughly 0.3dB at peak power for ${D}_{\text{PMD}}=0.2\text{ps}/\sqrt{\text{km}}$ relative to the case without PMD $({D}_{\text{PMD}}=0\text{ps}/\sqrt{\text{km}})$. The penalty is therefore not serious for typical fibers used in telecommunication. This is in agreement with known results for PMD in OFDM [43], although in that case PMD sometimes is even benificial, as it can reduce deterministic interference [22]. Compared to the case without PMD effects and random birefringence (labeled “no birefringence”), we find a penalty of roughly 1.2dB at peak power for the case of zero PMD due to the equalization.

## 6. Conclusions

In this paper, we have proposed polarization-division multiplexing based on the nonlinear Fourier transform. NFT algorithms are developed based on the Manakov equations. Our simulations demonstrate feasibility of polarization multiplexed NFDM transmission over standard single-mode fiber. The results show that data rates can approximately be doubled in polarization-multiplexed transmission compared to the single-polarization case. This is an important step to achieve data rates that can exceed those of conventional linear technology.

Numerical simulations of polarization multiplexed transmission over a realistic fiber model including randomly varying birefringence and polarization mode dispersion have shown that penalties due to PMD in real fibers do not seriously impact system performance. Our fiber simulations are based on the Manakov equations. As a next step, which is beyond the scope of this paper, the results should be verified experimentally.

The equation governing light propagation of *N* modes in multi-mode fibers in the strong coupling regime can be written in the form [44]

**A**= (

*A*

_{1}, …,

*A*

_{N})

*and ${\overline{\beta}}_{2}$ denotes the average group velocity dispersion. The NFT, as well as the algorithms presented here, generalize to this equation in a straightforward manner. Our approach therefore paves the way to combining the nonlinear Fourier transform with space-division multiplexing.*

^{T}## Acknowledgments

J-W.G. would like to thank Wasyhun A. Gemechu for helpful discussions. H.H. acknowledges useful discussions with Djalal Bendimerad, Majid Safari, Sergei K. Turitsyn and Huijian Zhang.

## References and links

**1. **C. S. Gardner, J. M. Greene, M. D. Kruskal, and R. M. Miura, “Method for solving the Korteweg-deVries equation,” Phys. Rev. Lett. **19**, 1095–1097 (1967). [CrossRef]

**2. **M. J. Ablowitz, D. J. Kaup, A. C. Newell, and H. Segur, “Nonlinear evolution equations of physical significance,” Phys. Rev. Lett. **31**, 125–127 (1973). [CrossRef]

**3. **M. J. Ablowitz, D. J. Kaup, A. C. Newell, and H. Segur, “The inverse scattering transform - Fourier analysis for nonlinear problems,” Stud. Appl. Math. **53**, 249–315 (1974). [CrossRef]

**4. **M. J. Ablowitz and H. Segur, *Solitons and the Inverse Scattering Transform* (SIAM, 1981). [CrossRef]

**5. **M. I. Yousefi and F. R. Kschischang, “Information transmission using the nonlinear Fourier transform, part I: Mathematical tools,” IEEE Trans. Inf. Theory **60**, 4312–4328 (2014). [CrossRef]

**6. **M. I. Yousefi and F. R. Kschischang, “Information transmission using the nonlinear Fourier transform, part II: Numerical methods,” IEEE Trans. Inf. Theory **60**, 4329–4345 (2014). [CrossRef]

**7. **M. I. Yousefi and F. R. Kschischang, “Information transmission using the nonlinear Fourier transform, part III: Spectrum modulation,” IEEE Trans. Inf. Theory **60**, 4346–4369 (2014). [CrossRef]

**8. **S. T. Le, J. E. Prilepsky, and S. K. Turitsyn, “Nonlinear inverse synthesis for high spectral efficiency transmission in optical fibers,” Opt. Express **22**, 26720–26741 (2014). [CrossRef] [PubMed]

**9. **S. T. Le, J. E. Prilepsky, and S. K. Turitsyn, “Nonlinear inverse synthesis technique for optical links with lumped amplification,” Opt. Express **23**, 8317–8328 (2015). [CrossRef] [PubMed]

**10. **M. I. Yousefi and X. Yangzhang, “Linear and nonlinear frequency-division multiplexing,” https://arxiv.org/abs/1603.04389.

**11. **I. Tavakkolnia and M. Safari, “Signaling on the continuous spectrum of nonlinear optical fiber,” Opt. Express **25**, 18685–18702 (2017).

**12. **A. Hasegawa and F. Tappert, “Transmission of stationary nonlinear optical pulses in dispersive dielectric fibers. I. anomalous dispersion,” Appl. Phys. Lett. **23**, 142–144 (1973). [CrossRef]

**13. **A. Hasegawa and T. Nyu, “Eigenvalue communication,” J. Lightwave Technol. **11**, 395–399 (1993). [CrossRef]

**14. **S. T. Le, I. Philips, J. Prilepsky, P. Harper, N. Doran, A. D. Ellis, and S. Turitsyn, “First experimental demonstration of nonlinear inverse synthesis transmission over transoceanic distances,” in “Optical Fiber Communications Conference,” (Optical Society of America, 2016), pp. 1–3.

**15. **S. T. Le, I. D. Philips, J. E. Prilepsky, M. Kamalian, A. D. Ellis, P. Harper, and S. K. Turitsyn, “Achievable information rate of nonlinear inverse synthesis based 16QAM OFDM transmission,” in 2016 42nd European Conference on Optical Communication (ECOC), (VDE, 2016), pp. 1–3.

**16. **S. T. Le, H. Buelow, and V. Aref, “Demonstration of 64×0.5Gbaud nonlinear frequency division multiplexed transmission with 32QAM,” in Optical Fiber Communication Conference, (Optical Society of America, 2017), pp. W3J–1.

**17. **V. Aref, H. Bülow, K. Schuh, and W. Idler, “Experimental demonstration of nonlinear frequency division multiplexed transmission,” in 2015 41st European Conference on Optical Communication (ECOC), (IEEE, 2015), pp. 1–3.

**18. **V. Aref, S. T. Le, and H. Buelow, “Demonstration of fully nonlinear spectrum modulated system in the highly nonlinear optical transmission regime,” in 2016 42nd European Conference on Optical Communication (ECOC), (VDE, 2016), pp. 1–3.

**19. **A. Maruta and Y. Matsuda, “Polarization division multiplexed optical eigenvalue modulation,” in 2015 International Conference on Photonics in Switching (PS), (IEEE, 2015), pp. 265–267.

**20. **X. Yangzhang, M. Yousefi, A. Alvarado, D. Lavery, and P. Bayvel, “Nonlinear frequency-division multiplexing in the focusing regime,” in Optical Fiber Communication Conference, (Optical Society of America, 2017), pp. Tu3D–1.

**21. **M. I. Yousefi, G. Kramer, and F. R. Kschischang, “Upper bound on the capacity of the nonlinear Schrödinger channel,” https://arxiv.org/abs/1502.06455.

**22. **P. Serena, N. Rossi, and A. Bononi, “Nonlinear penalty reduction induced by PMD in 112 Gbit/s WDM PDM-QPSK coherent systems,” in 2009 35th European Conference on Optical Communication, (2009), pp. 1–2.

**23. **I. Tavakkolnia and M. Safari, “Effect of PMD on the continuous spectrum of nonlinear optical fibre,” in 2017 65th European Conference on Optical Communication (ECOC), (2017), pp. ci-p-11.

**24. **C. R. Menyuk, “Pulse propagation in an elliptically birefringent Kerr medium,” IEEE J. Quantum. Electron. **25**, 2674–2682 (1989). [CrossRef]

**25. **P. Wai and C. R. Menyuk, “Polarization mode dispersion, decorrelation, and diffusion in optical fibers with randomly varying birefringence,” J. Lightwave Technol. **14**, 148–157 (1996). [CrossRef]

**26. **D. Marcuse, C. Manyuk, and P. Wai, “Application of the Manakov-PMD equation to studies of signal propagation in optical fibers with randomly varying birefringence,” J. Lightwave Technol. **15**, 1735–1746 (1997). [CrossRef]

**27. **C. R. Menyuk and B. S. Marks, “Interaction of polarization mode dispersion and nonlinearity in optical fiber transmission systems,” J. Lightwave Technol. **24**, 2806–2826 (2006). [CrossRef]

**28. **S. V. Manakov, “On the theory of two-dimensional stationary self-focusing of electromagnetic waves,” Soviet Physics-JETP **38**, 248–253 (1974).

**29. **A. Degasperis and S. Lombardo, “Integrability in action: Solitons, instability and rogue waves,” in *Rogue and Shock Waves in Nonlinear Dispersive Media*, (Springer, 2016), pp. 23–53. [CrossRef]

**30. **S. Wahls and H. V. Poor, “Fast numerical nonlinear Fourier transform,” IEEE Trans. Inf. Theory **61**, 6957–6974 (2015). [CrossRef]

**31. **S. Wahls and H. V. Poor, “Fast inverse nonlinear Fourier transform for generating multi-solitons in optical fiber,” in “Proc. IEEE Int. Symp. Inf. Theory (ISIT),” (2015), pp. 1676–1680.

**32. **V. Vaibhav, “Fast inverse nonlinear Fourier transforms using exponential one-step methods, part I: Darboux transformation,” https://arxiv.org/abs/1704.00951 (2017).

**33. **M. J. Ablowitz, Y. Ohta, and A. D. Trubatch, On discretization of the vector nonlinear Schrödinger equation, Phys. Lett. A **253**, 287–304 (1999). [CrossRef]

**34. **G. P. Agrawal, *Nonlinear Fiber Optics* (Academic Press, 2007).

**35. **S. Civelli, E. Forestieri, and M. Secondini, “Why noise and dispersion may seriously hamper nonlinear frequency-division multiplexing,” IEEE Photon. Technol. Lett. **3**, 1332 (2017.

**36. **L. F. Mollenauer, S. G. Evangelides, and H. A. Haus, “Long-distance soliton propagation using lumped amplifiers and dispersion shifted fiber,” J. Lightwave Technol. **9**, 194–197 (1991). [CrossRef]

**37. **K. Blow and N. Doran, “Average soliton dynamics and the operation of soliton systems with lumped amplifiers,” IEEE Photonics Technol. Lett. **3**, 369–371 (1991). [CrossRef]

**38. **P. Wai, C. R. Menyuk, and H. Chen, “Stability of solitons in randomly varying birefringent fibers,” Opt. Lett. **16**, 1231–1233 (1991). [CrossRef] [PubMed]

**39. **S. G. Evangelides, L. F. Mollenauer, J. P. Gordon, and N. S. Bergano, “Polarization multiplexing with solitons,” J. Lightwave Technol. **10**, 28–35 (1992). [CrossRef]

**40. **M. Bertolini, N. Rossi, P. Serena, and A. Bononi, “Do’s and don’ts for a correct nonlinear PMD emulation in 100Gb/s PDM-QPSK systems,” Opt. Fiber Technol. **16**, 274–278 (2010). [CrossRef]

**41. **M. Eberhard and C. Braimiotis, *Numerical Implementation of the Coarse-Step Method with a Varying Differential-Group Delay* (SpringerUS, Boston, MA, 2005), pp. 530–534.

**42. **C. D. Poole, J. H. Winters, and J. A. Nagel, “Dynamical equation for polarization dispersion,” Opt. Lett. **16**, 372–374 (1991). [CrossRef] [PubMed]

**43. **S. L. Jansen, I. Morita, T. C. W. Schenk, N. Takeda, and H. Tanaka, “Coherent optical 25.8-Gb/s OFDM transmission over 4160-km SSMF,” J. Lightwave Technol. **26**, 6 (2008). [CrossRef]

**44. **S. Mumtaz, R.-J. Essiambre, and G. P. Agrawal, “Nonlinear propagation in multimode and multicore fibers: Generalization of the Manakov equations,” J. Lightwave Technol. **31**, 398–406 (2013). [CrossRef]