Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Computing the information rate of discrete-time Wiener phase noise channels by parametric Bayesian tracking

Open Access Open Access

Abstract

A new upper bound (UB) on the information rate (IR) transferred through the additive white Gaussian noise channel affected by Wiener’s laser phase noise is proposed in the paper. The bound is based on Bayesian tracking of the noisy phase. Specifically, the predictive and posterior densities involved in the tracking are expressed in parametric form, therefore tracking is made on parameters. This make the method less computationally demanding than known non-parametric methods, e.g. methods based on phase quantization and trellis representation of phase memory. Simulation results show that the UB is so close to the lower bound that we can claim of having virtually computed the actual IR.

© 2016 Optical Society of America

1. Introduction

Multiplicative phase noise is a major source of impairment in radio and optical channels. Recent studies about the phase noise that arises in optical channels and about its effects in coherent optics can be found in [1–3]. The basic model of laser phase noise is a Wiener process, [3], [4], where the power spectral density (PSD) of the spectral line is a slope of −20 dB/decade. Several methods have been proposed in the literature to combat the detrimental effects of Wiener phase noise (WPN). Among these methods we cite iterative demodulation and decoding techniques of [5–7] and the insertion of pilot symbols (possibly with staged decoding [8].) Note however that single carrier systems affected by non-ideal frequency responses of the channel plus phase noise require specific mitigation techniques [9] that are not addressed in this paper.

The capacity of AWGN channels affected by phase noise with white PSD is studied in [1], [10, 11]. Computation of information rates (IRs) for the multiplicative WPN plus AWGN channel, which is a channel with memory and continuous state, is a challenging problem. Monte Carlo approaches based on non-parametric Bayesian tracking (BT) techniques of the hidden phase with memory, e.g. phase state-space quantization [12, 13] and particle filtering [14, 15], have been recently proposed for computing IRs. In [16] the method of particle filtering is adopted to work out an approximation to the IR.

The method proposed here is based on parametric representation of densities. This leads to efficient computation, since, as shown in the paper, the parameters can be updated in the iteration steps of the BT method by closed-form equations. Conversely, in non-parametric methods, densities are estimated and updated point-wisely in their support space, leading to a more demanding computation.

2. Channel and source model

Let uik indicate the vector (ui, ui+1, ⋯, uk) with uikUik, and let U indicate a stationary and ergodic process, U = (U0, U1, ⋯), whose generic realization is the sequence (u0, u1, ⋯). When Uik is a continuous set, p(uik) is used to indicate the multivariate probability density function (PDF), while when Uik is a discrete set p(uik) indicates the multivariate mass probability and |Ui| denotes the number of elements in Ui.

The k-th output of the channel is

yk=xkejϕk+wk,ϕk=[ϕk1+γvi]mod2π,k=1,2,,
where j is the imaginary unit, Y is the complex channel output process, X is the channel complex input modulation process, and Φ is the phase noise process which is assumed to be independent of X and W. We assume that the input process X is made of i.i.d. random variables with zero mean and unit variance. Process W is a complex AWGN process with zero mean and variance SNR−1. Process Φ is modeled as a Wiener process folded in the range [−π, π) where the frequency noise V is a i.i.d. sequence of Gaussian random variables with zero mean and unit variance, γ is a scalar, and ϕ0 is uniformly distributed in [−π, π).

Equations (1) can be cast in the framework of continuous-state first-order Markov channels, where the state at time k is ϕk. For this class of channels

p(ϕk|ϕk1,y1k1,x1k1)=p(ϕk|ϕk1).

Moreover, from (1) one realizes that the channel is memoryless given the state, therefore

p(yk,xk|ϕk,y1k1,x1k1)=p(yk,xk|ϕk).

3. Upper bound

Let h(U) denote the relative entropy rate of process U. According to [15], an upper bound (UB) I¯(X;Y) on the IR I(X;Y) is I¯(X;Y)=h¯(Y)h(Y|X,Φ)h(Φ)+h¯(Φ|Y,X). The terms h(Φ) and h(Y |X, Φ) can be evaluated in a straightforward manner, see [15]. The two UBs h¯(Y) and h¯(Φ|Y,X) can be based as in [15] on the normalized Kullback-Leibler (KL) divergence. Assuming that U is stationary and ergodic, one can invoke the Shannon-McMillan-Breiman theorem and the chain rule, getting

h¯(U)=limn1nk=1nlog2(1q(uk|u0k1)),
where u0n is generated according to the actual multivariate PDF p(u1n), and the initial condition u0 is given. The bound can be extended to the conditional relative entropy rate in a straightforward manner. The auxiliary PDF q(·) can be seen as an approximation to p(). In this perspective, the KL divergence is a measure of the quality of the fit between p(·) and q(), and the UB is equal to the actual relative entropy rate when the fit is ideal, that is when q(·) = p(·).

4. Bayesian tracking

In the first-order model of phase noise (1) the hidden state at time k is the phase ϕk. BT of the hidden state is based on the two-step iteration for k = 1, 2, ⋯

p¯(ϕk|z1k1)=ππp(ϕk|ϕk1)p(ϕk1|z1k1)dϕk1,p(ϕk|z1k)=p¯(ϕk|z1k1)p(zk|ϕk)p(zk|z1k1),
where p¯(ϕk|z1k1) is the predictive density p(ϕk|z1k) is the posterior density, z1kis the channel measurement up to time k, and the denominator of the above equation is computed by the Chapman-Kolmogoroff equation
p(zk|z1k1)=ππp¯(ϕk|z1k1)p(zk|ϕk)dϕk.

Note that the state transition probability p(ϕk |ϕk−1) appears in (5) in place of p(ϕk|ϕk1,z1k1) thanks to the Markovian property (2). Moreover, thanks to (3), the channel transition probability p(zk |ϕk) can be used in place of p(zk|ϕk,z1k1) in (5).

The predictive density (and, as a consequence, the posterior density as well) cannot be expressed in closed form and therefore it should be approximated in some way. We adopt parametric BT to obtain approximations to the probability density functions appearing in the relative entropy rates h(Y) and h(Φ|Y, X). When h(Φ|Y, X) is evaluated, one puts Z = (Y, X) and the auxiliary probability that appears inside the logarithm of (4) is evaluated as

q(ϕk|y1k,x1k,ϕk+1)=p(ϕk+1|ϕk)q(ϕk|y1k,x1k)ππp(ϕk+1|ϕk)q(ϕk|y1k,x1k)dϕk,p(ϕk|ϕk1)=i=g(ϕk1+2πi,γ2;ϕk),
where the approximation q(ϕk|y1k,x1k) to the actual posterior probability is obtained from the parameters worked out by the parametric BT, and g(μ, σ2; x) is a Gaussian density with mean μ and variance σ2 evaluated in x. When h(Y) is considered, Z = Y and the approximation q(yk|y1k1) to p(yk|y1k1) is obtained from the Chapman-Kolmogoroff equation (6) and used inside the base-2 logarithm of (4).

Two parametric forms are hereafter considered: Tikhonov and Fourier. As expected, Tikhonov parametrization outperforms Fourier parametrization in approximating p(ϕk|y1k,x1k), because channel and state transition probabilities are very well approximated by Tikhonov distributions of the phase ϕk (see (9) and (11)), hence Tikhonov parameters almost represent a sufficient statistics for computing the IRs. On the other hand, Fourier parametrization is well suited to approximate p(ϕk|y1k1), which is a periodic function of period π/2 and with a small bandwidth (hence a few Fourier coefficients completely describe the function) for most constellations of practical use. In general, using a parametrization that is far from representing a sufficient statistics, will give looser bounds on the IRs. See App. A and B for details about Tikhonov and Fourier parametrizations, respectively. As shown in the appendices, all the integrals involved in the evaluation of auxiliary PDF appearing in (4) can be computed in closed form. This makes the proposed method computationally efficient.

5. Simulation results

In this section, the new UB is compared to a lower bound worked out by the computationally demanding trellis-based method of [12]. Specifically, the lower bound has been obtained with 128 states. Fig. 1 reports the results obtained with 4-QAM and with γ = 0.125: This is the largest value of γ obtained in the experimental results of [3] and can be regarded as a case of strong phase noise in cases of practical interest. The lower bound is virtually indistinguishable from the UB obtained with parametric BT in the entire range of signal-to-noise ratios (SNRs). A similar analysis holds for the results obtained with 16-QAM and reported in Fig. 1. Also in this case, the UB obtained with parametric BT virtually gives the actual IRs for γ = 0.125 and any SNRs. The figures report also the UB computed with the non-parametric approach (Trellis) for 32 and 64 states: For both constellations we observe that such UBs are not tight and are often looser than the AWGN information curve, that serves as a trivial IR UB. Increasing the number of states of the non-parametric approach helps in getting tighter bounds, at the expense of a much higher computational complexity. When the actual IR saturates (SNR= {10, 20} dB resp. for 4- and 16-QAM), the gap between the 64-state UB and the actual IR is larger for 16-QAM. However, if the gaps are normalized to the maximum IR, they are the same for the two modulation formats. Moreover, observe that getting a good IR estimate at high SNR is much more difficult with a non-parametric approach rather than a parametric one, because in this regime estimating accurately the density tail is crucial [15]: At high SNR the densities become spiky and tails are better captured by a parametric description, with a negligible computational complexity.

 figure: Fig. 1

Fig. 1 Lower bound and upper bounds for 4-QAM (left) and 16-QAM (right), with γ = 0.125. The achievable information rate of the pure AWGN channel (γ = 0) is also reported.

Download Full Size | PDF

6. Conclusion

A new upper bound (UB) above the information rate transferred through the discrete-time WPN channel has been presented. The results, compared to the lower bound obtained with the computationally demanding non-parametric methods of [12], show that the UB is accurate in all cases of practical interest. Before concluding the paper, a remark is in order about phase noise of order higher than one, which is out of the scope of the present paper and that will be subject of future research. Extension of the parametric-based approach to phase noise with memory of order higher than one, as the one studied in [15], is feasible by proposing an appropriate parametric density. In contrast, quantizing a multidimensional state space according to the approach of [12] would lead to an exponential increase of the number of states of the trellis, making computation unfeasible. Also, non-parametric methods based on particle filtering [14,15] can suffer from the high space dimensionality, especially at high SNR. To get accurate UBs one is led to increase the number of particles, thus increasing the complexity of the computation.

A. Tikhonov parametrization

The method is based on the approximation of the posterior density at time k − 1 as a Tikhonov density of real variable ϕk−1 ∈ [−π, π) and complex parameter ωk−1

p(ϕk1|y1k1,x1k1)t(ωk1;ϕk1)=e{ωk1ejϕk1}2πI0(|ωk1|),
where I0(·) is the modified Bessel function of the first kind and ℜ{·} denotes the real part. The state transition probability is hereafter approximated as
p(ϕk|ϕk1)g(ϕk1,γ2;ϕk).

Using these approximations in place of the actual probability density functions in (5), and using the properties of Tikhonov densities listed in [18] one has

p¯(ϕk|y1k1,x1k1)g(ϕk1,γ2;ϕk)t(ωk1;ϕk1)dϕk1t(ω¯k;ϕk)
where ω¯k=ωk1/(1+γ2|ωk1|). The channel transition probability at time k can be approximated with a Tikhonov density of variable ϕk:
p(yk|xk,ϕk)2SNRI0(2SNR|ykxk|)exp(SNR(|yk|2+|xk|2))t(2SNRykxk;ϕk).

Using (10) and (11) into (5) gives

p(ϕk|y1k,x1k)t(2SNRykxk+ω¯k;ϕk)=t(ωk;ϕk),ωk=2SNRykxk+ω¯k.

Having computed ωk by parametric BT (10) and (12), one can get the contribution at time k to the entropy rate h(Φ|Y, X) by taking the logarithm of the auxiliary probability (7). Note that in the BT we assume that the posterior and the predictive densities are Tikhonov, thus getting a close approximation to the actual densities without the need of evaluating integrals numerically. However, in the evaluation of (7), the integral appearing in the denominator cannot be evaluated in closed form if we still adopt the Tikhonov parametrization. Therefore, we adopt a Gaussian density based on the complex parameter ωk obtained by BT as an approximation to the actual posterior density appearing in the right side of (7). Specifically, we put in (7) q(ϕk|y1k,x1k)=g(ωk,|ωk|1;ϕk). This allows to compute in closed form the integral appearing in the denominator of (7):

p(ϕk+1|ϕk)q(ϕk|y1k,x1k)dϕk=n=g(ϕk+2nπ,γ2;ϕk+1)g(ωk,|ωk|1;ϕk)dϕk=n=g(2nπ+ωk,γ2+|ωk|1;ϕk+1),
where the last equality follows because the integrals appearing inside the sum are convolutions between Gaussian densities.

B. Fourier parametrization

We assume that constellation X has a 4-fold symmetry, therefore at time k − 1 it is possible to approximate the posterior density p(ϕk1|y1k1) in the interval [−π, π) by the Fourier series

q(ϕk1|y1k1)=n=mmAn(k1)ej4nϕk1,
where {An(k1)}n=mm are the Fourier coefficients of the auxiliary posterior density at time k − 1, and m is responsible of the accuracy of the approximation. Using (14) and (9) into (5) one has
q¯(ϕk|y1k1)=ππn=mmAn(k1)ej4nϕk1g(ϕk1,γ2;ϕk)dϕk1=n=mmA¯N(k)ej4nϕk,
which provides us with the prediction step A¯n(k)=An(k1)e8n2γ2. Using the identity exp(xcosθ)=l=Il(x)exp(jlθ), one approximates the channel transition probability p(yk|ϕk) by the Fourier series
q(yk|ϕk)=n=Bn(k)ej4nϕk,Bn(k)=12π|X|xkX2SNRI|4n|(2SNR|ykxk|)exp(SNR(|yk|2+|xk|2))ej4n(ykxk).

The auxiliary posterior density is found by plugging (15) and (16) into (5) to obtain

q(ϕk|y1k)=n=mmAn(k)ej4nϕk.

Fourier coefficients An(k) can be obtained from the convolution An(k)A˜n(k)=i=mmA¯i(k)Bni(k) by suitably normalizing the A˜n’s such that (17) is a density. Note that the truncation for n =−m,…, m of the Fourier expansion in the right hand side of (17) might be such that q(ϕk|y1k) is negative for some values of ϕk. It is easy to show that a sufficient condition for (17) to be non-negative is

A˜0(k)2i=1m|A˜i(k)|,
that for m = 1 is also a necessary condition. At each iteration of the BT we check (18): if it is satisfied then the tracking algorithm can proceed by the normalization An(k)=A˜n(k)/(2πA˜0(k)), n = −m,…, m, otherwise we put A0(k)=(2π)1 and An(k)=A˜n(k)/(4πi=1m|A˜i(k)|),n0. In any case, the contribution at time k to the relative entropy rate h(Y) is obtained by putting inside the base-2 logarithm appearing in (4) the auxiliary probability
q(yk|y1k1)=ππq¯(ϕk|y1k1)q(yk|ϕk)dϕk=2πA˜0(k).

References and links

1. R.-J. Essiambre, G. Kramer, P. J. Winzer, G. J. Foschini, and B. Goebel, “Capacity limits of optical fiber networks,” J. Lightw. Technol. 28(7), 662–701 (2010). [CrossRef]  

2. T. Mizuochi, Y. Miyata, K. Kubo, T. Sugihara, K. Onohara, and H. Yoshida, “Progress in soft-decision FEC,” OFC/NFOEC (2011), paper NWC2.

3. M. Magarini, A. Spalvieri, F. Vacondio, M. Bertolini, M. Pepe, and G. Gavioli, “Empirical modeling and simulation of phase noise in long-haul coherent optical systems,” Opt. Express 19(23), 22455–22461 (2011). [CrossRef]   [PubMed]  

4. G. J. Foschini and G. Vannucci, “Characterizing filtered light waves corrupted by phase noise,” IEEE Trans. Inf. Theory 34(6), 1437–1448 (1988). [CrossRef]  

5. M. Peleg, S. Shamai, and S. Galan, “Iterative decoding for coded noncoherent MPSK communications over phase-noisy AWGN channel,” Proc. IEE Commun. 147, 87–95 (2000). [CrossRef]  

6. G. Colavolpe, A. Barbieri, and G. Caire, “Algorithms for iterative decoding in the presence of strong phase noise,” IEEE J. Selected Areas Commun. 23(9), 1748–1757 (2005). [CrossRef]  

7. A. Barbieri and G. Colavolpe, “Soft-output decoding of rotationally invariant invariant codes over channels with phase noise,” IEEE Trans. Commun. 55(11), 2125–2133 (2007). [CrossRef]  

8. L. Barletta, M. Magarini, and A. Spalvieri, “Staged demodulation and decoding,” Opt. Express 20(21), 23728–23734 (2012). [CrossRef]   [PubMed]  

9. M. Magarini, L. Barletta, and A. Spalvieri, “Efficient computation of the feedback filter for the hybrid decision feedback equalizer in highly dispersive channels,” IEEE Trans. Wirel. Commun. 11(6), 2245–2253 (2012). [CrossRef]  

10. B. Goebel, R.-J. Essiambre, G. Kramer, P. J. Winzer, and N. Hanik, “Calculation of mutual information for partially coherent Gaussian channels with application to fiber optics,” IEEE Trans. Inf. Theory 57(9), 5720–5736 (2011). [CrossRef]  

11. P. Hou, B. J. Belzer, and T. R. Fischer, “Shaping gain of the partially coherent additive white Gaussian noise channel,” IEEE Commun. Letters 6(5), 175–177 (2002). [CrossRef]  

12. L. Barletta, M. Magarini, and A. Spalvieri, “The information rate transferred through the discrete-time Wiener’s phase noise channel,” J. Lightw. Technol. 30(10), 1480–1486 (2012). [CrossRef]  

13. A. Barbieri and G. Colavolpe, “On the information rate and repeat-accumulate code design for phase noise channel,” IEEE Trans. Commun. 59(12), 3223–3228 (2011). [CrossRef]  

14. L. Barletta, M. Magarini, and A. Spalvieri, “Tight upper and lower bounds to the information rate of the phase noise channel,” IEEE Intern. Symposium Inf. Theory (2013), 2284–2288.

15. L. Barletta, M. Magarini, S. Pecorino, and A. Spalvieri, “Upper and lower bounds to the information rate transferred through first-order Markov channels with free-running continuous state,” IEEE Trans. Inf. Theory 60(7), 3834–3844 (2014). [CrossRef]  

16. J. Dauwels and H.-A. Loeliger, “Computation of information rates by particle methods,” IEEE Trans. Inf. Theory 54(1), 406–409 (2008). [CrossRef]  

17. D. Simon, Optimal State Estimation (John Wiley & Sons, 2006). [CrossRef]  

18. A. Barbieri, G. Colavolpe, and G. Caire, “Joint Iterative Detection and Decoding in the Presence of Phase Noise and Frequency Offset,” IEEE Trans. Commun. 55(1), 171–179 (2007). [CrossRef]  

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (1)

Fig. 1
Fig. 1 Lower bound and upper bounds for 4-QAM (left) and 16-QAM (right), with γ = 0.125. The achievable information rate of the pure AWGN channel (γ = 0) is also reported.

Equations (19)

Equations on this page are rendered with MathJax. Learn more.

y k = x k e j ϕ k + w k , ϕ k = [ ϕ k 1 + γ v i ] mod 2 π , k = 1 , 2 , ,
p ( ϕ k | ϕ k 1 , y 1 k 1 , x 1 k 1 ) = p ( ϕ k | ϕ k 1 ) .
p ( y k , x k | ϕ k , y 1 k 1 , x 1 k 1 ) = p ( y k , x k | ϕ k ) .
h ¯ ( U ) = lim n 1 n k = 1 n log 2 ( 1 q ( u k | u 0 k 1 ) ) ,
p ¯ ( ϕ k | z 1 k 1 ) = π π p ( ϕ k | ϕ k 1 ) p ( ϕ k 1 | z 1 k 1 ) d ϕ k 1 , p ( ϕ k | z 1 k ) = p ¯ ( ϕ k | z 1 k 1 ) p ( z k | ϕ k ) p ( z k | z 1 k 1 ) ,
p ( z k | z 1 k 1 ) = π π p ¯ ( ϕ k | z 1 k 1 ) p ( z k | ϕ k ) d ϕ k .
q ( ϕ k | y 1 k , x 1 k , ϕ k + 1 ) = p ( ϕ k + 1 | ϕ k ) q ( ϕ k | y 1 k , x 1 k ) π π p ( ϕ k + 1 | ϕ k ) q ( ϕ k | y 1 k , x 1 k ) d ϕ k , p ( ϕ k | ϕ k 1 ) = i = g ( ϕ k 1 + 2 π i , γ 2 ; ϕ k ) ,
p ( ϕ k 1 | y 1 k 1 , x 1 k 1 ) t ( ω k 1 ; ϕ k 1 ) = e { ω k 1 e j ϕ k 1 } 2 π I 0 ( | ω k 1 | ) ,
p ( ϕ k | ϕ k 1 ) g ( ϕ k 1 , γ 2 ; ϕ k ) .
p ¯ ( ϕ k | y 1 k 1 , x 1 k 1 ) g ( ϕ k 1 , γ 2 ; ϕ k ) t ( ω k 1 ; ϕ k 1 ) d ϕ k 1 t ( ω ¯ k ; ϕ k )
p ( y k | x k , ϕ k ) 2 SNR I 0 ( 2 SNR | y k x k | ) exp ( SNR ( | y k | 2 + | x k | 2 ) ) t ( 2 SNR y k x k ; ϕ k ) .
p ( ϕ k | y 1 k , x 1 k ) t ( 2 SNR y k x k + ω ¯ k ; ϕ k ) = t ( ω k ; ϕ k ) , ω k = 2 SNR y k x k + ω ¯ k .
p ( ϕ k + 1 | ϕ k ) q ( ϕ k | y 1 k , x 1 k ) d ϕ k = n = g ( ϕ k + 2 n π , γ 2 ; ϕ k + 1 ) g ( ω k , | ω k | 1 ; ϕ k ) d ϕ k = n = g ( 2 n π + ω k , γ 2 + | ω k | 1 ; ϕ k + 1 ) ,
q ( ϕ k 1 | y 1 k 1 ) = n = m m A n ( k 1 ) e j 4 n ϕ k 1 ,
q ¯ ( ϕ k | y 1 k 1 ) = π π n = m m A n ( k 1 ) e j 4 n ϕ k 1 g ( ϕ k 1 , γ 2 ; ϕ k ) d ϕ k 1 = n = m m A ¯ N ( k ) e j 4 n ϕ k ,
q ( y k | ϕ k ) = n = B n ( k ) e j 4 n ϕ k , B n ( k ) = 1 2 π | X | x k X 2 SNR I | 4 n | ( 2 SNR | y k x k | ) exp ( SNR ( | y k | 2 + | x k | 2 ) ) e j 4 n ( y k x k ) .
q ( ϕ k | y 1 k ) = n = m m A n ( k ) e j 4 n ϕ k .
A ˜ 0 ( k ) 2 i = 1 m | A ˜ i ( k ) | ,
q ( y k | y 1 k 1 ) = π π q ¯ ( ϕ k | y 1 k 1 ) q ( y k | ϕ k ) d ϕ k = 2 π A ˜ 0 ( k ) .
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.