Abstract
We propose a novel format-transparent phase estimation method in coherent optical systems by minimizing the Kullback-Leibler (KL) divergence between the signal constellation and the samples for estimation. The proposed metric exhibits the same standard deviation of estimation errors as that in conventional blind phase searching (BPS) with an infinite resolution, while eliminating the pre-decisions in BPS so that phase estimation can be realized using a recursive algorithm. The complexity and the performance of the proposed KL method are compared with previously reported 2-stage BPS, Kalman filtering, principal component analysis (PCA), and the PCA + BPS hybrid method. It is shown that this method is particularly suitable for probabilistically shaped (PS) formats in which the PCA method does not work properly. It exhibits better performance than Kalman filtering and the PCA + BPS hybrid method and has lower complexity than 2-stage BPS. It is also shown that the proposed KL method has a relaxed cycle-slip problem and is less sensitive to the noise due to a smaller number of samples compared to the 2-stage BPS and the PCA + BPS hybrid method. Therefore, the proposed method can be a very promising solution for format-transparent phase estimation in coherent optical systems, particularly using the PS formats.
© 2020 Optical Society of America under the terms of the OSA Open Access Publishing Agreement
1. Introduction
Future coherent optical systems will be high-capacity, power-efficient and adaptive, which are expected to be realized by using varied modulation levels with probabilistic shaping. Carrier phase estimation is an essential digital signal processing (DSP) unit in high-capacity coherent optical systems and there are many phase estimation methods proposed in the literature. Conventional Viterbi-Viterbi (VV) algorithm is only applicable to the quadrature phase shift keying (QPSK) format [1]. Various improved versions can extend this method to high-order quadrature amplitude modulation (QAM), such as QPSK partition or proper selection of certain rings in the QAM constellation [2–5]. However, these methods exhibit either format dependency or a reduced linewidth tolerance due to the reduction of available symbols for estimation. This problem becomes more severe for probabilistically-shaped (PS) formats where the occurrence probabilities of different rings vary and can be very small. On the other hand, blind phase searching (BPS) is format transparent and can realize the optimal performance given a sufficient number of tested phases [6]. However, this method has high complexity. Consequently, methods such as 2-stage BPS [7] or the combination of BPS with the VV algorithm [8] were proposed to reduce the complexity. Decision aided maximum likelihood (DA-ML) can also achieve excellent performance but it also exhibits high complexity similar to the conventional BPS [9–10]. Recently, Kalman filtering [11–12] and principal component analysis (PCA) [13] were proposed to balance the performance and the complexity. Some other algorithms were also proposed for specific formats by exploiting the unique properties of these formats [14–15].
In this paper, we propose a novel phase estimation method for coherent optical systems, which is based on the fact that the Kullback-Leibler (KL) divergence between the signal constellation and the samples for estimation should be minimized when the phase is correctly compensated. This metric can be viewed as a generalized case of that in conventional BPS but avoids the pre-decisions so that phase estimation can be realized using a recursive algorithm without multiple phase tests. We compare the proposed KL method with previously reported 2-stage BPS, Kalman filtering, PCA and the PCA + BPS hybrid method. It is shown that the proposed method is format transparent and is particularly suitable for the PS formats. It exhibits better performance than Kalman filtering and the PCA + BPS hybrid method, and lower complexity than 2-stage BPS. This method also has a relaxed cycle-slip problem and is robust to the noise due to a smaller number of samples compared to 2-stage BPS and the PCA + BPS hybrid method. Therefore, the proposed method can be a promising solution for format-transparent phase estimation in coherent systems particularly using the PS formats.
2. Principle
2.1 Principle of the proposed method based on the KL divergence
Because phase estimation is performed after static dispersion compensation, polarization demultiplexing and equalization, the effect of the phase noise can be modelled as:
where rn, sn and ϕn are the received and transmitted samples and the phase rotation at the time n, respectively. δn is the noise including but not limited to the amplified spontaneous emission (ASE) noise, thermal noise etc. The objective is to find an estimated phase φn,est=ϕn such that:We assume the PDF of the samples without the phase noise, i.e. the constellation in Fig. 1, is f(x, y) and that of rn·exp(-jφn) is ${f_{{r_n},{\varphi _n}}}(x,y)$, where x and y represent the real and imaginary variables in the constellation and φn is a phase variable. The metric to evaluate the distance between the two PDFs is the KL divergence or the relative entropy:
The above derivation shows that phase estimation can be realized by minimizing F(wn), which can be viewed as a generalized case of that in conventional BPS, as proved in the Appendix. Figures 2(a) and (b) show F(wn) versus phase error for PS 16QAM and PS 64QAM, respectively. E in the figure represents the entropy. The optical signal-to-noise ratios (OSNRs) are set such that the BERs in different cases are similar. These parameters match those in the simulations in the next section. It is seen that the value of F(wn) is indeed minimal at the point of ϕn-φn,est=0 regardless of the OSNR values and the formats. A steeper slope around ϕn-φn,est=0 is observed for a larger E due to a higher OSNR. In general, a steeper slope implies that F(wn) is more sensitive to the phase error and so it is easier to identify the point of ϕn-φn,est=0.
In practice, a limited number of samples should be used for phase estimation. In this case, the minimal value may not always occur at ϕn-φn,est=0, resulting in estimation errors. Figures 2(c) and (d) show the standard deviation (STD) of estimation errors versus the number of samples. The results of the conventional BPS with a sufficient number of tested phases are also given for comparison. In both methods, we use 200 sets of samples with the number of samples in each set varying from 6 to 300. In the proposed KL method, we calculate the F(wn) versus ϕn-φn,est curve similar to Figs. 2(a) and (b) and find the ϕn-φn,est that gives the minimal F(wn). Then we calculate the STD of ϕn-φn,est based on the 200 sets of samples. In the conventional BPS, the samples are pre-decided and the metric is the sum of the distances instead of F(wn). From the figure, it is seen that the metric F(wn) exhibits the same STD of estimation errors as that in the conventional BPS for both PS 16QAM and PS 64QAM. It is also seen that a steeper slope around the point of ϕn-φn,est=0 in Figs. 2(a) and (b) indeed results in a small STD. Note that the STD does not fully reflect the influence of the estimation error on the bit error rate (BER) performance. For example, the BER of PS 64QAM is degraded more significantly than that of PS 16QAM even under the same STD of estimation errors.
The STD in Figs. 2(c) and (d) represents the static performance of F(wn), given an arbitrary but unknown phase error. It does not consider the speed of phase change that is reflected by linewidth. In practice, we may utilize F(wn) in the same way as conventional BPS, i.e. test multiple phases, calculate F(wn) using rn and f(x,y) for each phase, and the optimal phase is that gives the minimal F(wn). Additional results show that the use of F(wn) in this way results in the same performance as conventional BPS. However, this implementation results in high complexity by testing multiple phases. Instead, because pre-decisions are not required in Eq. (4), the optimal wn can be obtained by a recursive algorithm:
The derivations above assume that the elements in rn have the same phase. When the linewidth is large, the phases within rn cannot be viewed as a constant and in this case the samples with the phase deviating more from the common phase error of rn may result in a larger Q(R{ri·wn}, I{ri·wn})·ri* and thus introduce additional noise to the algorithm. Therefore, we improve Eq. (8) by normalizing Q(R{ri·wn}, I{ri·wn})·ri* as:
Instead of complex multiplications in Eq. (8-1), we extract the phases of Q(x, y) and ri to normalize the impact of all samples in Eq. (9-1). In practice, Angle(ri) and Angle(Q(x, y)) can be readily realized by look-up tables. The complexity of the proposed method will be analyzed and compared with 2-stage BPS, PCA, and Kalman filtering in the next subsection.
After phase compensation, the PDF f(x, y) is used in the MLSD or BMD using SD-FEC for symbol/bit decision. In MLSD, the decision is made by:
2.2 Complexity analysis and comparison
We will compare the complexity of the proposed KL method with those of 2-stage BPS, PCA, PCA + BPS hybrid and Kalman filtering. In the square QAM, there is π/2 phase ambiguity. In this paper, this problem is solved by comparing the estimated phases of adjacent blocks [6], together with pilot symbols inserted every 120 symbols [18]. Note that as shown later, in contrast to BPS and PCA, the proposed KL method does not have the cycle-slip problem for small and moderate linewidths and only requires to correct the phase ambiguity for large linewidths. For simplicity, the complexity here excludes the correction of the phase ambiguity. The calculation is based on a block with 2 m samples.
In the proposed method, we calculate the complexity based on Eq. (9), which is used in the simulations in the next section. The calculation of 2m ri·wn in Eq. (9-1) requires 4 × 2m real multiplications and 2 × 2m real additions. For Angle(Q(R{ri·wn}, I{ri·wn})), we build up a two-dimensional look-up table with each dimension having a q-bit resolution. During the operation, we quantize the real and imaginary parts of ri·wn and read the corresponding Angle(Q(R{ri·wn}, I{ri·wn})) from the look-up table. The complexity of the quantization is q×2 × 2m comparisons. As shown later, q of 5∼6 can achieve the optimal performance for both PS 16QAM and PS 64QAM. Angle(ri) can be obtained in a similar way with q×2 × 2m comparisons for the quantization. Therefore, the total complexity of Eq. (9-1) is 4 × 2m+2 real multiplications, 2 × 2m+2m+2 × 2m real additions and q×4 × 2m comparisons. Finally, the complexity of the normalization in Eq. (9-2) is 5 real multiplications and 1 real addition. The above process is iterated for U times. Note that ri only needs to be quantized once regardless of U. Therefore, the total complexity is (8m+7)U real multiplications, (10m+1)U real additions, and 4mq(U+1) comparisons. As shown later, U of 1∼2 is sufficient for small and moderate linewidths but more iterations are required for large linewidths. In this paper, U is 4 unless otherwise stated.
In 2-stage BPS [7], the compensation of ri using each tested phase requires 4 × 2m real multiplications and 2 × 2m real additions. The obtained signals are then compared with the constellation of PS 16QAM or PS 64QAM to make pre-decisions. This requires 2m × log2(K) comparisons. The distances between the compensated ri and their pre-decisions are then calculated and summed, which requires 2m×2 real multiplications and 2m×4-1 real additions. The above process is iterated with the tested phases at the first and second stages as B1 and B2, respectively. B1-1 and B2-1 comparisons are also needed at each stage to find the optimal phase. Therefore, the total complexity is 12m(B1+B2) real multiplications, (12m-1)·(B1+B2) real additions and (2mlog2(K) + 1)·(B1+B2)-2 comparisons. It is shown later that B1 and B2 should be larger than 8 and 4 respectively to avoid the performance penalty.
In the PCA method [13], the square operation for the 2m samples requires 4 × 2m real multiplications and 2 × 2m real additions. The calculation of the covariance matrix in Eq. (2) of [13] requires 4 × 2m real multiplications and 4×(2m-1) real additions. Updating the principal component in Eq. (3) of [13] needs 8 real multiplications and 4 real additions, while the normalization in Eq. (4) of [13] requires 5 real multiplications and 1 real addition. Finally, the calculation of the phase requires 1 real multiplication and a look-up table. The total complexity is 16m+14 real multiplications and 12m+1 real additions.
As shown later, the PCA method exhibits significant performance degradation for the PS formats. This is because the low occurrence probability of high-power constellation points in the PS formats weakens the principal component and consequently degrades the performance. Therefore, we also compare the PCA + BPS hybrid method in this paper. In this case, the total complexity is 16m+14 + 12mB2 real multiplications, 12m+1+(12m-1)B2 real additions and (2mlog2(K) + 1)B2-1 comparisons, where the number of tested phases at the second stage, B2, should be larger than 4 to avoid the performance penalty.
Finally, we calculate the complexity of Kalman filtering, with the block diagram depicted in Fig. 2 of [11]. Note that different from the sample-by-sample estimation in [11], the calculation here is based on a block with 2 m samples. The influence of the number of samples will be investigated in the next section. The complexity of the pre-decision and the phase rotations before and after the pre-decision is 8 × 2 m real multiplications, 4 × 2 m real additions, and 2m × log2(K) comparators. The complexity to calculate the Kalman gain is 6 × 2 m real multiplications and 2 m real additions. The complexity to update the phase is 4 × 2 m real multiplications and 6 × 2 m real additions. Finally, updating the covariance matrix requires 4 × 2m+2 real multiplications and 4 × 2m-1 real additions. Therefore, the total complexity is 44m+2 real multiplications, 30m-1 real additions and 2m × log2(K) comparators.
Table 1 summarizes the complexity of different methods. Figure 3 shows the required number of multiplications and additions versus 2 m. In the figure, U in the proposed method is 4 or 2. In 2-stage BPS, B1=8 and B2=4. B2 in the PCA + BPS hybrid method is 4. These parameters match the simulations in the next section. It is seen that PCA exhibits the least complexity. However, this method shows significant performance degradation for the PS formats. Combination of PCA and BPS reduces the performance penalty but at the expense of higher complexity. Kalman filtering requires a smaller number of additions but more multiplications than the proposed KL method with U=4. As shown later, this method also has a less linewidth tolerance than the KL method with U=2. Finally, 2-stage BPS has the highest complexity.
3. Simulation setup and results
3.1 Simulation setup
Simulations were performed to verify the proposed theory. Five methods were compared: 2-stage BPS, PCA, PCA + BPS hybrid, Kalman filtering and the proposed KL method. All methods were based on the same system parameters. Figure 4 depicts the simulation setup. A bit sequence was sent to a CCDM encoder to generate PS 16QAM or PS 64QAM with different entropies. Generally, the CCDM encoder should be followed by an FEC encoder, which was not considered in this paper for simplicity. The generated symbols were up-sampled to two samples per symbol and passed a finite impulse response (FIR) filter to pre-compensate the responses of the drivers and digital-to-analogue converters (DACs) so that the signal after the modulator exhibited a raised-cosine spectral profile with a roll-off factor of 0.5. The pre-filtered signals were sent to 100-GS/s DACs with an 8-bit resolution, amplified by drivers, and modulated a CW light with a dual-polarization (DP) I/Q modulator. In the optical link, the OSNR was controlled by a variable optical attenuator (VOA) followed by an Erbium-doped fiber amplifier (EDFA). The out-of-band optical noise was filtered by an optical filter with 60-GHz 3-dB bandwidth. A DP coherent receiver with a 5-order Bessel response was used for detection. The 3-dB bandwidth of the receiver was 40 GHz. The powers of the local oscillator and the signal were 10 dBm and 0 dBm, respectively. The electronic signals were sampled by 100-GS/s analogue-to-digital converters (ADCs). In DSP, the signals were polarization demultiplexed and filtered by FIR filters. The signals of two polarizations were combined for phase estimation since the phase offset between them was generally static and could be readily corrected [15]. The number of samples per polarization in each block was 30 unless otherwise stated. The performance was evaluated in terms of either BER using MLSD before the FEC or NGMI. The number of simulated symbols was 300,000.
3.2 Simulation results
Figure 5 shows the performance of different methods for (a) PS 16QAM with an entropy of 3.6 and (b) PS 64QAM with an entropy of 5. The linewidth is used for both the transmitter laser and the local oscillator. In all methods, the parameters are optimized to balance the performance and complexity. Specifically, in the proposed method, U is 4 and α in Eq. (9) is optimized. Angle(Q(x, y)) is realized by a look-up table with the resolution q of 6 bits and 8 bits for PS 16QAM and PS 64QAM, respectively. In 2-stage BPS, B1 and B2 are 8 and 4, respectively. In Kalman filtering, the noise parameters Rn and Qn as defined in [11] are 10−5 and 10−4, respectively. In the PCA + BPS hybrid method, B2 is 4 and the BPS searching range is [-π/16+φn, stage1, π/16+φn, stage1], where φn, stage1 is the estimated phase in the first PCA stage. From Fig. 5, it is seen that PCA exhibits the worst performance in both PS 16QAM and PS 64QAM. As shown later, this method works well for the conventional QAM. However, in the PS formats, the principal component is weakened due to lower occurrence probabilities of high-power constellation points, resulting in performance degradation especially for PS 64QAM. Combination of PCA with BPS can improve the performance. However, even for B2=4 under which the complexity of the hybrid method is greatly increased, its performance is still poorer than that of the proposed KL method. On the other hand, Kalman filtering has slightly higher complexity than the KL method but also shows a less tolerance to the linewidth. Finally, 2-stage BPS has the best performance but requires the highest complexity as shown in Fig. 3.
In order to understand the results in Fig. 5 more clearly, we investigate each of previously reported methods in more details and compare them with the proposed KL method. Figure 6 shows the performance of the PCA + BPS hybrid method for different searching ranges at the BPS stage. B2 in this method is 4. The results for the KL method with U of 4 are also given. Because the performance of the PCA method is poor as shown in Fig. 5, the searching range of the BPS stage, I, should be carefully optimized. When I is small, once ϕn-φn, stage1 is beyond this range, the performance is degraded regardless of the resolution of the BPS stage. When I is large, the resolution is reduced under the fixed B2. Consequently, in Fig. 6(a), it is seen that a smaller I of π/16 results in a less tolerance to the linewidth, while by using a larger I of π/4, penalties are observed for small and moderate linewidths. In Fig. 6(b), there are penalties for small and moderate linewidths even by using I of π/16. This is because the performance of PCA is so poor in this case that ϕn-φn, stage1 is beyond the BPS searching range even for small linewidths. From both figures, it is confirmed that the proposed KL method shows better performance than the PCA + BPS hybrid method under optimized searching range.
Figure 7 shows the performance of Kalman filtering with different noise parameters Rn. The performances of the proposed KL method with U of 4 and 2 are also given. Under a fixed Qn, the value of Rn balances the estimation accuracy and the tracking speed. When Rn is larger, Kalman filtering can compensate the phase noise more accurately for small and moderate linewidths but has a less tolerance to the linewidth. In contrast, when Rn is smaller, it can track the phase change more quickly but at the expense of the penalty for small and moderate linewidths. Note that Kalman filtering is also capable to compensate amplitude noise, which is not observed in Fig. 7 but becomes more visible as the number of samples reduces as shown later. The results also show that the performance of Kalman filtering with the optimized Rn is even not as good as that of the proposed KL method with U of 2.
Figure 8 shows the performance of 2-stage BPS under different values of B1 and B2. The BER of the proposed method with U of 4 and 8 are also given. In Fig. 8(a), there is a slight penalty for small and moderate linewidths when B1 and B2 in the BPS method are 4 and 4, respectively. This penalty becomes significant in Fig. 8(b). This is because the performance of PS 64QAM is more sensitive to the phase noise. In addition, if the searching range is wrongly identified at the first stage, the performance is degraded regardless of the resolution at the second stage. Therefore, B1 should be larger than B2 to avoid an inaccurate searching range at the second stage. It is also seen that although the metric F(wn) has the same STD of estimation errors as that in BPS as shown in Fig. 2, the proposed method still has a less linewidth tolerance due to the convergence of the recursive algorithm. A larger U can enhance the tolerance but increase the complexity. Thus, U of 4 is used to balance the performance and complexity.
After comparisons with previously reported methods, we will characterize the proposed KL method in more details. Figure 9 shows the performance of the KL method using Eq. (6) or Eq. (7) as the PDF in Q(x, y) and Eq. (9) or Eq. (8) for the recursive algorithm. It is observed that different PDFs do not influence the performance. However, Eq. (9) results in better performance than Eq. (8). As discussed previously, although Eq. (8) can minimize the KL divergence, it assumes that the phase within rn is a constant. This assumption is not valid for large linewidths, in which the samples with the phases deviating more from the common phase error of rn may have a larger impact and thus reduce the stability. In contrast, Eq. (9) normalizes the impact of all samples and thus improves the tolerance to the linewidth.
Figure 10 depicts the performance versus (a) the number of iterations and (b) the resolution of the lookup table. For moderate linewidths, 1∼2 iterations are sufficient to achieve the optimal performance. Four iterations can realize the near-optimal performance for 5-MHz and 2-MHz linewidths in PS 16QAM and PS 64QAM, respectively. From Fig. 10(b), it is seen that the performance becomes stable when the resolution increases to 6 bits in both PS 16QAM and PS 64QAM. This confirms that the proposed KL method can indeed achieve the performance advantage with the complexity depicted in Fig. 3.
In the above investigations, the problem of the π/2 phase ambiguity is solved by comparing the estimated phases of adjacent blocks, together with pilot symbols inserted every 120 symbols. However, these processes are not necessary for all methods. Figure 11 shows the performance when the cycle slips are not compensated (a) by comparing the phases of adjacent blocks and (b) by the pilot symbols. It is found that the cycle-slip problem always exists in 2-stage BPS, PCA and the PCA + BPS hybrid method such that their performances are very poor if the cycle slips are not compensated by comparing the estimated phases of adjacent blocks. In contrast, the cycle-slip problem does not exist for small and moderate linewidths in the proposed KL method and Kalman filtering, whose performances in Fig. 11(a) are not degraded compared to those in Fig. 5(a). On the other hand, when the linewidth is large, there is a probability that the phase difference between adjacent blocks is too large to be recovered, resulting in cycle slips in all methods. Therefore, as shown in Fig. 11(b), if the pilot symbols are not used, the linewidth tolerances of all methods are reduced compared to those in Fig. 5(a). The results verify that the proposed KL method and Kalman filtering outperform other methods on the cycle-slip problem and only require the correction of the phase ambiguity for large linewidths.
We also investigate the performance of different methods under other entropies, as shown in Fig. 12. In the figure, U in the proposed method is 4. B1 and B2 in 2-stage BPS are 8 and 4, respectively. B2 in the PCA + BPS hybrid method is 4. Other parameters are optimized in a similar way as in Fig. 5. Comparison with Fig. 5(a) confirms that PCA does not work properly in the PS formats. However, this method shows good performance in the conventional QAM as shown in Fig. 12(b), due to a strong principal component. Kalman filtering always shows a smaller linewidth tolerance than the proposed KL method. On the other hand, the proposed KL method exhibits degraded performance compared to 2-stage BPS. This is because searching the minimal F(wn) is based on the recursive algorithm whose convergence speed may not be sufficient for large linewidths. Note that by using the algorithm, the complexity is much less than the one using multiple phase tests to find the minimal F(wn).
The above results are obtained when the number of samples per polarization is 30. In practice, the block length is chosen to balance the linewidth tolerance and the noise. Figure 13 shows the performance of different methods when the number of samples per polarization is (a) 8 and (b) 4. In both figures, the performance of Kalman filtering with a larger Rn is also shown in addition to that with the optimized Rn of 10−4. Note that this optimal Rn is different from that in Fig. 5 because of a different number of samples. U in the proposed method is 4. B1 and B2 in 2-stage BPS are 8 and 4, respectively. B2 in the PCA + BPS hybrid method is 4. Other parameters are optimized in a similar way as in Fig. 5. It is seen that PCA is more sensitive to the number of samples. This is because a smaller number of samples cannot provide sufficient statistics for PCA. In the PCA + BPS hybrid method, inaccurate estimation at the PCA stage may result in a wrong searching range at the second stage. Consequently, this method exhibits a penalty for small and moderate linewidths in Fig. 13(a) while this penalty becomes more significant in Fig. 13(b). Kalman filtering with the optimized Rn has a similar sensitivity to the number of samples compared to the proposed KL method, but always shows a smaller linewidth tolerance. Note that Kalman filtering with a larger Rn may mitigate the noise effect particularly in Fig. 13(b). However, the tolerance to the linewidth is also reduced significantly compared to that with the optimized Rn. Finally, 2-stage BPS works well in Fig. 13(a). However, it shows a penalty for small and moderate linewidths in Fig. 13(b), implying that this method is more sensitive to the reduction of the number of samples compared to the proposed method.
Finally, we investigate the performance at lower OSNRs, which correspond to the cases using SD-FEC and so the metric of NGMI is employed. Figure 14(a) shows NGMI versus linewidth at 19-dB OSNR. It is seen that conclusions similar to Fig. 5 can be drawn. Compared to other methods, the PCA method shows a NGMI penalty even at small linewidths. Kalman filtering exhibits a smaller linewidth tolerance than 2-stage BPS, the proposed KL method, and the PCA + PBS hybrid method. The performance of 2-stage BPS is the best but this method has the highest complexity. Figure 14(b) depicts NGMI versus OSNR at 1-MHz (solid) and 5-MHz (dashed) linewidths. For all methods, NGMI reduces as the OSNR decreases. At 1-MHz linewidth, the performance of the PCA method is poorer than other methods. On the other hand, when the linewidth increases to 5 MHz, the performance of Kalman filtering is degraded more significantly due to a limited linewidth tolerance. The proposed KL method makes a balance between the performance and the complexity.
4. Summary
We have proposed a novel format-transparent phase estimation method in coherent optical systems by minimizing the KL divergence between the signal constellation and the samples for estimation. The proposed metric is a generalized case of that in conventional BPS but eliminates the requirement of pre-decisions so that phase estimation can be realized by a recursive algorithm. The complexity and the performance of the proposed method are compared with previously reported 2-stage BPS, Kalman filtering, PCA, and the PCA + BPS hybrid method. It is shown that the proposed method is particularly suitable for the PS formats in which the PCA method does not work properly. It exhibits better performance than Kalman filtering and the PCA + BPS hybrid method and lower complexity than the 2-stage BPS method. It is also shown that the proposed KL method has a relaxed cycle-slip problem and is robust to the noise due to a smaller number of samples compared to 2-stage BPS and the PCA + BPS hybrid method. The study confirms that the proposed method can be a promising solution for format-transparent phase estimation in coherent optical systems particularly using the PS formats.
Appendix
We will show that the metric in the conventional BPS can be viewed as a special case of the proposed F(wn). We assume ri·wn is pre-decided as ai,opt and Eq. (5) is approximated as
Funding
National Natural Science Foundation of China (61971199); Science and Technology Planning Project of Guangdong Province (2019A050503003); Guangzhou Science and Technology Planning Project (201904010298); Fundamental Research Funds for the Central Universities (D2193110).
Disclosures
The author declares no conflicts of interest.
References
1. A. Viterbi and A. Viterbi, “Nonlinear estimation of PSK-modulated carrier phase with application to burst digital transmission,” IEEE Trans. Inf. Theory 29(4), 543–551 (1983). [CrossRef]
2. I. Fatadin, D. Ives, and S. J. Savory, “Laser linewidth tolerance for 16-QAM coherent optical systems using QPSK partitioning,” IEEE Photonics Technol. Lett. 22(9), 631–633 (2010). [CrossRef]
3. V. Rozental, D. Kong, B. Corcoran, D. Mello, and A. J. Lowery, “Filtered carrier phase estimator for high-order QAM optical systems,” J. Lightwave Technol. 36(14), 2980–2993 (2018). [CrossRef]
4. T. Yang, C. Shi, X. Chen, M. Zhang, Y. Ji, F. Hua, and Y. Chen, “Linewidth-tolerant and multi-format carrier phase estimation schemes for coherent optical m-QAM flexible transmission systems,” Opt. Express 26(8), 10599–10615 (2018). [CrossRef]
5. Y. Li, M. W. Wu, X. Du, T. Song, and P. Y. Kam, “A refinement to the Viterbi-Viterbi carrier phase estimator and an extension to the case with a Wiener carrier phase process,” IEEE Access 7, 78170–78184 (2019). [CrossRef]
6. T. Pfau, S. Hoffmann, and R. Noe, “Hardware-efficient coherent digital receiver concept with feedforward carrier recovery for M-QAM constellations,” J. Lightwave Technol. 27(8), 989–999 (2009). [CrossRef]
7. J. Li, L. Li, Z. Tao, T. Hoshida, and J. C. Rasmussen, “Laser-linewidth-tolerant feed-forward carrier phase estimator with reduced complexity for QAM,” J. Lightwave Technol. 29(16), 2358–2364 (2011). [CrossRef]
8. J. R. Navarro, A. Kakkar, R. Schatz, X. Pang, O. Ozolins, F. Nordwall, H. Louchet, S. Popov, and G. Jacobsen, “High performance and low complexity carrier phase recovery schemes for 64QAM coherent optical systems,” in Proc. Optical Fiber Communication Conference (2017), paper W2A.53.
9. A. Meiyappan, P. Kam, and H. Kim, “On decision aided carrier phase and frequency offset estimation in coherent optical receivers,” J. Lightwave Technol. 31(13), 2055–2069 (2013). [CrossRef]
10. J. Lu, X. Li, S. Fu, M. Luo, M. Xiang, H. Zhou, M. Tang, and D. Liu, “Joint carrier phase and frequency offset estimation with parallel implementation for dual-polarization coherent receiver,” Opt. Express 25(5), 5217–5231 (2017). [CrossRef]
11. L. Pakala and B. Schmauss, “Extended Kalman filtering for joint mitigation of phase and amplitude noise in coherent QAM systems,” Opt. Express 24(6), 6391–6401 (2016). [CrossRef]
12. L. Li, Y. Feng, W. Zhang, N. Cui, H. Xu, X. Tang, L. Xi, and X. Zhang, “Extended Kalman filter for carrier frequency offset and carrier phase noise,” in Proc. Conference on Lasers and Electro-Optics (2017), paper Stu3M.7.
13. J. C. M. Diniz, Q. Fan, S. M. Ranzini, F. N. Khan, F. D. Ros, D. Zibar, and A. P. T. Lau, “Low-complexity carrier phase recovery based on principal component analysis for square-QAM modulation formats,” Opt. Express 27(11), 15617–15626 (2019). [CrossRef]
14. J. Zhao and P. D. Townsend, “Independent component analysis for phase and residual frequency offset compensation in OQAM multicarrier systems,” J. Lightwave Technol. 10.1109/JLT.2020.2983413.
15. J. Lu, S. Fu, Z. Hu, L. Deng, M. Tang, D. Liu, and C. K. Chan, “Carrier phase recovery for set-partitioning QAM formats,” J. Lightwave Technol. 36(18), 4129–4137 (2018). [CrossRef]
16. J. Cho, L. Schmalen, and P. J. Winzer, “Normalized generalized mutual information as a forward error correction threshold for probabilistically shaped QAM,” in Proc. European Conference on Optical Communications (2017), paper M.2.D.2.
17. D. Pilori, A. Nespola, P. Poggiolini, F. Forghieri, and G. Bosco, “Low-complexity non-linear phase noise mitigation using a modified soft-decoding strategy,” in Proc. Optical Fiber Communication Conference (2019), paper M1I.2.
18. S. Okamoto, F. Hamaoka, M. Nakamura, and Y. Kisaka, “Laser phase noise tolerance of probabilistically-shaped constellations,” in Proc. Optical Fiber Communication Conference (2018), paper W2A.51.