Abstract
Channel matrix plays a critical role in receiver design and ultimate channel performance. To fully describe the channel matrix of a few-mode fiber (FMF), we choose the generalized high-dimensional Gell-Mann matrices, an equivalent of the 2-dimensional Pauli matrices used for a single-mode fiber (SMF), as the basis for the channel matrix decomposition. The frequency dependence of channel matrix can be studied in terms of the autocorrelation function (ACF), showing how fast channel changes in frequency domain. In this paper, we derive a canonical stochastic differential equation (SDE) for the FMF channel matrix in the regime of strong coupling. With the SDE, we develop an analytical form for the ACF of FMF channel matrix, from which the channel correlation bandwidth is obtained.
© 2013 Optical Society of America
1. Introduction
Few-mode fiber (FMF) is being actively explored as a promising transmission medium to surmount the capacity crunch in future fiber optic communication [1–3]. FMF supports multiple spatial modes co-propagating in one medium simultaneously, enhancing the capacity of information transmission. However, the random mode coupling caused by fiber index imperfections and mechanical perturbations leads to signal dispersion in FMF, called differential modal delay (DMD) [4]. While large DMD and mode coupling can improve the channel diversity, making the system robust against mode-dependent loss (MDL) [5], excessive DMD would increase the computational complexity of digital signal processing (DSP) required for the channel equalization [1]. An in-depth understanding of the statistic properties of FMF channel subject to DMD is essential to assist system design for achieving an optimal system performance.
With respect to single-mode fiber (SMF), there is a large body of studies on the statistics of SMF channel affected by polarization modes dispersion (PMD). The stochastic analysis is conducted in terms of either the PMD vector in the Stokes space [6,7] or the channel matrix in the Jones space [8,9]. Though the Stokes space and the Jones space are isomorphic [10], a PMD vector is more suitable for the analysis of time-domain direct detection systems, whereas a Jones matrix is more useful for the analysis of coherent detection systems. Based on a white Gaussian noise assumption for the birefringence vector with strong coupling, standard tools of stochastic calculus can be used to derive some important statistic properties for the PMD vector and Jones matrix [6–9].
The basic concepts and stochastic study methods for SMF can be extended to FMF. Since each spatial mode supports two orthogonal polarization modes, FMF with N spatial modes supports totally 2N modes with different group velocities. An extended 2N-dimensional vector is used to represent the propagating modes in FMF, and the linear effect of FMF channel can be described as a 2N x 2N channel matrix in the generalized Jones space. Recently, study has been conducted on the statistics of mode dispersion (MD) vector (a general form of PMD vector in SMF) for FMF [11,12]. A generalized Stokes space has been formulated to allow convenient representation of the mode coupling vector (a general form of birefringence vector in SMF) and the MD vector in FMF. Based on a white Gaussian noise model for the mode coupling vector, the statistic properties of MD vector have been derived in [11,12]. The statistics of group delays in FMF has also been derived in [5,13] by dividing fiber into many cascaded sections modeled by random matrices.
On the other hand, the latest decade has seen the revival of coherent detection, where DSP is performed on the channel matrix of fiber optic link [14,15]. Subsequently, the statistical analysis of FMF channel matrix has become critically important. In particular, the statistical analysis provides us the autocorrelation function (ACF) of FMF channel matrix, which gives an insight into the frequency behavior of FMF channel dispersion and helps to find the optimal receiver design. The channel correlation bandwidth derived from the ACF is also important for the coherent detection system design, as it is often needed for the channel estimation, for instance, for the minimum mean-square error (MMSE) based channel estimation [16].
As an extension of the Pauli matrices used as the basis of the Stokes space in SMF [9], high-dimensional trace orthogonal matrices have been proposed in [11] as the basis of the generalized Stokes space in FMF. In this work, we also resort to the trace orthogonal matrices as the basis for FMF channel matrix. In this report, a canonical stochastic differential equation (SDE) for FMF channel matrix is derived in the regime of strong coupling. Applying the standard tools of stochastic calculus to the SDE, we develop an analytical form for the autocorrelation function (ACF) of FMF channel matrix, from which the channel correlation bandwidth is obtained. A comparison of simulation and analytical results demonstrates the accuracy of our theoretical model.
2. FMF channel matrix and decomposition
The main notations used in the paper are as follows: denotes a column vector in the generalized Jones space of dimension 2N, denotes a column vector in the generalized Stokes space of dimension , denotes a column vector of dimension , a column vector with elements is denoted by ; Symbol ‘’ represents the transpose conjugate operation for a matrix, and ‘’ represents the transpose operation; E[ ] denotes the ensemble average, and Tr{ } denotes the trace of a matrix.
The optical field in an N-mode FMF at length z and frequency can be expressed as a 2N-dimensional complex vector . The linear channel of FMF can be treated as a 2N input 2N output linear system given by [11,12]
where U is the channel matrix of dimension 2N x 2N, describing the random mode coupling and group delays in optical channel. Without consideration of MDL, the channel matrix U is a unitary matrix with unit determinant.Similar to the Pauli matrices used as the basis for the 2x2 Jones matrix of SMF [9], trace orthogonal matrices form the basis for the 2N x 2N channel matrix of FMF. The trace orthogonal matrices and their associated properties have already been developed in [11]. Specifically, to represent the channel matrix U, and traceless Hermitian matrices (i = 1 to ) are constructed, satisfying the trace-orthogonal condition [11]
where is the Kronecker symbol . Note that in (2) we use a normalization factor of 2 instead of 2N in [11]. This follows the convention in other works on special unitary groups [17,18]. Hence, the channel matrix U could be expressed as a superposition of the trace orthogonal matrices [11]where the vector is an ensemble of the trace orthogonal matrices, and the complex vector is an ensemble of the weights of each trace orthogonal matrix. Since one set of weights can exclusively identify a matrix U, the properties of a 2N x 2N channel matrix U is fully represented by a vector of dimension . According to the trace-orthogonal condition (2), the weights of trace orthogonal matrices could be easily extracted by [11]In group theory, the traceless Hermitian matrices (i = 1 to ) form the generators (elements of the Lie algebra) of special unitary group of degree 2N (SU(2N)), satisfying [17]
where the indices k, m, n take values from 1 to , and the coefficients f and d are the structure constants. As provided by [11], a cross product “” of vectors and in the generalized Stokes space can be defined with the structure constants f aswhere is the m-th element in vector , is the n-th element in vector , and is a unit vector with 1 for the k-th element. For the convenience of the mathematical description of our problem, we also introduce a product “” of vectors and defined with structure constants d asWe call “” circle dot product to distinguish it from the conventional dot product ‘’. Then, the matrices and satisfy the relations Adding (9) and (10), we get the product of the matrices andThe choices for the trace-orthogonal matrices are many. The only guide for the choice is the simplicity of calculation. Modified Gell-Mann matrices have been used to build the trace-orthogonal matrices in [11]. In this paper, we select the generalized Gell-Mann matrices, since the structure constants (f and d), extracted from the generalized Gell-Mann matrices, are sufficiently simple for the calculation in appendix B. The generalized Gell-Mann matrices are described in appendix A. The off-diagonal ones are identical to those in [11]. Only the diagonal ones are slightly different from [11]. The generalized Gell-Mann matrices coincide with the Pauli matrices when N = 1.
3. SDE for channel matrix
The evolution of FMF channel matrix U along fiber position is governed by the mode coupling vector , a generalization of the birefringence vector in SMF
is required to be a Hermitian matrix to ensure the unitarity of U. Such that, all the elements in are real [11]. For a ‘long’ fiber, where the overall fiber length is significantly larger than the correlation length of local modal dispersion, the white Gaussian noise model is valid for the mode coupling vector given by [11]where the elements of are statistically uncorrelated white Gaussian processes with zero mean and unit variance, and the parameter represents the mode coupling strength.Substituting the channel matrix U in (12) with its vector representation (3), and using the product defined in (11), we obtain the evolution equation of the vector representation as
where is the differential of the Brownian motion . Equation (14) can be written in a canonical form of SDEwhere is the diffusion matrix of dimension xwith entries to be polynomials of the components inWe note that, normally a SDE is on a real-value space, whereas the dynamic vector in Eq. (15) is complex. However, we can consider (15) as a short hand for real-value SDEs. Namely, we can separate the real and imaginary part of the dynamic vector , and treat the two parts individually as real elements. As long as the Brownian motion term is real, the real and complex forms of SDE describe the same physics of stochastic process. For the sake of concision, we use the complex vector in our discussion in the remainder of this paper.
Derived from (3), (12) and (13), the m-th column in the diffusion matrix Q satisfies
Denote as the entry at x-th row and y-th column in the diffusion matrix Q. Using the trace-orthogonality of in (17), can be extracted asSome important statistic properties of the dynamic vector can be derived from the SDE (15) using standard tools of stochastic calculus. However, those standard tools are usually applied for the Ito form of SDE, whereas the product in (15), involving the Brownian motion, should be interpreted as the Stratonovich product [8,9]. Before (15) can be directly used for stochastic calculus, it should be converted from the Stratonovich form into its equivalent Ito form. In the following context, we will use two different approaches to carry out the conversion.3.1 Stratonovich-Ito conversion algorithm approach
In SDE theory, there is a standard algorithm to convert SDE (15) from the Stratonovich form to its equivalent Ito form [19], which is given by
where the diffusion matrix is the same as that in Eq. (15), and is the drift correction vector of dimension , whose l-th component is given bySince can be obtained using (18), the product of the vectors and can be written aswhere is already known in (17). Noting , (21) can be rewritten asAfter some length calculation shown in appendix B, we obtain
As such, the drift correction vector in (19) becomesThe i-th component in is proportional to the i-th component in .3.2 Stochastic integral definition approach
We can also develop the correction term from the first principle of definitions for the Stratonovich and Ito integral.
According to the definition of stochastic integral [20], the integral of canonical SDE (15) can be interpreted as
where (i = 0 to n-1) are n partitions in the intervals [a,b] with mesh , , and is the parameter defining different methods of integral. The case of = 0 corresponds to the Ito integral, whereas = 1/2 corresponds to the Stratonovich integral.SDE (15) is the canonical form of dynamic Eq. (12) with a vector representation for the channel matrix U. When we substitute the canonical SDE (15) in (25) with (12), the integral of dynamic Eq. (12) is given by
From (26), the correction term, defined as the conversion between the Ito and Strantonovich integral, is a given bySubstituting dU in (12) into (27), and using the rules of , () [21, Chapter III], and , we can finally reach the correction term asThe correction term (28) we get here is consistent with that we developed using the standard Stratonovich-Ito conversion algorithm in last section (24).It follows that the dynamic equation of U (12) in its Ito form can be expressed as
The dynamic equation of (14) in its Ito form isThe canonical form of SDE (30) is (19), with the diffusion matrix Q defined by (16), and the drift vector defined as (24). The standard tools of stochastic calculus can be directly applied to the Ito Eq. (19).
4. Autocorrelation function of channel matrix
The frequency correlation of channel matrix U can be defined as
R is also a unitary matrix, describing the disparity between the channel matrices U at frequencies and . Its expectation value, E[R], is the ACF of channel matrix, giving a measure of the statistical dependence of U at different frequencies, and indicating how quickly U diverges in frequency domain. The bandwidth of E[R] is called channel correlation bandwidth, representing a frequency span over which U can be approximately considered as a constant matrix from the viewpoint of DSP.Taking derivative of (31) with respect to length z, the evolution of R along z is
Substituting the evolution equation of U (12) into (32), we obtainTerm is a rotated white Gaussian noise, and can be denoted as , where can be regarded as the white Gaussian noise vector in a new coordinate system. From (33), we can observe that R is determined only by the frequency difference , such that (33) can be rewritten asEquation (34) has the same form as (12), substituting with . Following the steps in last section, we can also obtain the Ito equation for (34) in a similar form to (19), given bywhere is the vector representation of R on the basis matrices . The diffusion matrix is given byThe drift vector is given byThe Fokker-Planck equation (FPE) is a powerful tool of stochastic calculus, which can be used to derive the distribution of a dynamic process governed by SDE [21]. Applying FPE to the Ito Eq. (35) gives the probability density function (pdf) p of vector
where . Though the distribution of the vector representation for the channel matrix experiences an instant spread on unit sphere after fiber input due to the white Gaussian noise model adopted for the mode coupling vector [9], the distribution of the vector representation for the ACF shall experience a gradual spread on unit sphere as the random coupling is mitigated by the transpose conjugate operation in (31). The solution to the pdf in FPE (38) will be discussed in a separate future submission.On the other hand, the Dynkin formula gives the expectation of any smooth function f of the dynamic process without the knowledge of the pdf [21]
where G is the Ito generator given bySince is what we need to obtain the ACF, the smooth functions in our case should be . The second order differential operator to in (40) equals to 0, and only the first order differential operator left. Such that, the evolution of is only determined by the drift term . Therefore, the Dynkin formula givesThe correlation matrix R at the input of fiber is an identity matrix I for any frequencies. With the initial condition and , we obtainConsequently, the ACF of channel matrix can be expressed asFrom (43), the 3dB channel correlation bandwidth, where the frequency correlation level falls to 0.5 [22], is obtainedEquation (44) shows that the frequency dependence of channel matrix is related to the mode number N, and decreases with the square root of fiber length z.Taking derivative of (31) with respect to frequencies and , we obtain
Setting in (45), and noting that the Hermitian MD matrix is defined aswhere the subscript ‘’ stands for the derivative over , we can relate the frequency derivative of R to the MD matrix bywhere . To derive (47), we have first applied the identity of (46) then of (11). Provided that R is determined only by the frequency difference , . Substituting this into (47), we haveEquation (48) is consistent with the study reported in [11]. The MD vector of FMF undertakes a dimensional isotropic random walk, and its root-mean-square value grows as the square root of the fiber length z [11,13].From (43) and (48), the mean-square value of MD vector E[] and the ACF of channel matrix E[R] can be related by
Applying (48) into (44), the correlation bandwidth in relation to the channel matrix can be written as . On the other hand, the correlation bandwidth in relation to the square modulus of MD vector, as given in [11], is , which is times of . We conclude that the MD vector is always slightly more stable in frequency domain compared to the channel matrix. Our conclusions about the ACF of the FMF (43) and (49) coincide with those of the SMF studied in [9] when N = 1. We extend the work of [9] to a more general case of arbitrary number of modes. We note that the MD vector defined in (46) has a factor of difference than that in [11]. With the MD vector definition in (46), the ACF is independent of the number of modes for the same mean-square value of MD vector E[]. Namely, E[] will be a good indicator of the correlation bandwidth regardless of the number of modes under the condition of strong coupling.
5. Simulation
We conduct the Monte-Carlo simulation to verify our theoretical result obtained in the last section. A FMF is divided into 100 sections, assuming that the length of one section is longer than the correlation length. The local principle modes (PMs) in separate sections are considered independent and uniformly distributed. The propagation of each section is modeled as a 2N x 2N matrix with equivalent statistical properties, given by [5,13]
where and are independent random unitary matrices corresponding to the random coupling of the PMs at the input and the output of the section, and T is a diagonal matrix representing the impact of group delays on the PMswhere (i = 1 to 2N) are the group delays of the local PMs. The average delay is a constant by setting . The channel matrix of the whole fiber is obtained by multiplexing the propagation matrices V for each section. The choice of the individual group delays does not have significant impact on the statistical properties of the global fiber as long as the modes are strongly coupled [13]. We just need to ensure that the MD in each section satisfies the requirement of the global MD .After the channel matrices U at different frequencies are obtained using the Monte-Carlo simulation, their correlation R could be calculated using (31). We measured the ACF for FMFs with a 10-ps global MD in the bandwidth of 250 GHz. 10000 fiber configurations are used in the simulation to obtain the ensemble averaging. The simulation results and theoretical predictions of for FMFs with 2 to 4 spatial modes are plotted in Fig. 1. The accuracy of our theory result is justified by the excellent match between the simulation and theoretical results.
6. Conclusion
In this paper, we have used two different approaches to derive the SDE for the FMF channel matrix in the regime of strong coupling. The channel matrix is decomposed over the generalized high-dimensional Gell-Mann matrices, an equivalent of two-dimensional Pauli matrices. We also develop the ACF of channel matrix for a strong coupling FMF. The channel correlation bandwidth obtained from the ACF indicates that the frequency dependence of the channel matrix decreases with the square root of fiber length. The validity of our analytical result is confirmed by the Monte-Carlo simulation.
Appendix A
We choose the generalized Gell-Mann matrices as the basis matrices for the decomposition of the channel matrix. The n-dimensional Gell-Mann matrices are constructed by the following algorithm:
The first n-1matrices are diagonal with the form
where the first k diagonal entries are 1, and the k + 1th diagonal entry is –k. The rest entries are all 0. The coefficient is added to satisfy the trace-orthogonal condition .The rest Gell-Mann matrices are off-diagonal. Half of them are symmetric
where the entries at the p-th row, q-th column and the q-th row, p-th column are 1, and the rest entries are 0. The other half are antisymmetricwhere the entry at the p-th row and q-th column is –i, the entry at the q-th row and p-th column is i, and the rest entries are 0.Following this construction algorithm, we can easily prove out that.
Appendix B
The product and in (22) can also be expanded as (5) and (6). Using the trace-orthogonality of , the structure constants and can be extracted by
Switching indices m with n in the above functions, we have , and. Substituting n with m in (53), we have. Then, . Since we know, we can easily prove out .Substituting (5) and (6) into the second term on the right side of (22), we have
Using the properties of coefficients and we have discussed above, (54) can be simplified asFor the Gell-Mann matrices described in appendix A, we can easily prove out that for the case of , and for the case of . Thus, (22) can be written asReferences and links
1. A. Li, A. A. Amin, X. Chen, and W. Shieh, “Reception of mode and polarization multiplexed 107-Gb/s CO-OFDM signal over a two-mode fiber,” OFC/NFOEC2011, PDPB8 (2011).
2. R. Ryf, S. Randel, A. H. Gnauck, C. Bolle, A. Sierra, S. Mumtaz, M. Esmaeelpour, E. C. Burrows, R. Essiambre, P. J. Winzer, D. W. Peckham, A. H. McCurdy, and R. Lingle, “Mode-division multiplexing over 96 km of few-mode fiber using coherent 6x6 MIMO processing,” J. Lightwave Technol. 30(4), 521–531 (2012). [CrossRef]
3. N. Bai, E. Ip, Y. K. Huang, E. Mateo, F. Yaman, M. J. Li, S. Bickham, S. Ten, J. Liñares, C. Montero, V. Moreno, X. Prieto, V. Tse, K. Man Chung, A. P. T. Lau, H. Y. Tam, C. Lu, Y. Luo, G. D. Peng, G. Li, and T. Wang, “Mode-division multiplexed transmission with inline few-mode fiber amplifier,” Opt. Express 20(3), 2668–2680 (2012). [CrossRef] [PubMed]
4. R. Olshansky, “Mode-coupling effects in graded-index optical fibers,” Appl. Opt. 14(4), 935–945 (1975). [CrossRef] [PubMed]
5. K.-P. Ho and J. M. Kahn, “Frequency diversity in mode-division multiplexing systems,” J. Lightwave Technol. 29(24), 3719–3726 (2011). [CrossRef]
6. P. K. A. Wai and C. R. Menyuk, “Polarization mode dispersion, decorrelation, and diffusion in optical fiber with randomly varying birefringence,” J. Lightwave Technol. 14(2), 148–157 (1996). [CrossRef]
7. G. J. Foschini and C. D. Poole, “Statistical theory of polarization dispersion in single mode fibers,” J. Lightwave Technol. 9(11), 1439–1456 (1991). [CrossRef]
8. A. Bononi and A. Vannucci, “Statistics of the Jones matrix of fibers affected by polarization mode dispersion,” Opt. Lett. 26(10), 675–677 (2001). [CrossRef] [PubMed]
9. A. Vannucci and A. Bononi, “Statistical characterization of the Jones matrix of long fibers affected by polarization mode dispersion (PMD),” J. Lightwave Technol. 20(5), 783–793 (2002). [CrossRef]
10. J. P. Gordon and H. Kogelnik, “PMD fundamentals: polarization mode dispersion in optical fibers,” Proc. Natl. Acad. Sci. U.S.A. 97(9), 4541–4550 (2000). [CrossRef] [PubMed]
11. C. Antonelli, A. Mecozzi, M. Shtaif, and P. J. Winzer, “Stokes-space analysis of modal dispersion in fiber with multiple mode transmission,” Opt. Express 20, 11718–11733 (2012).
12. C. Antonelli, A. Mecozzi, and M. Shtaif, “Modeling of linear and nonlinear coupling in multiple-mode fiber optic transmission with MIMO signal processing,” Signals, Systems and Computers (Asilomar, 2012), 645–649.
13. K.-P. Ho and J. M. Kahn, “Statistics of group delays in multimode fiber with strong mode coupling,” J. Lightwave Technol. 29(21), 3119–3128 (2011). [CrossRef]
14. A. Li, A. Al Amin, X. Chen, and W. Shieh, “Transmission of 107-Gb/s mode and polarization multiplexed CO-OFDM signal over a two-mode fiber,” Opt. Express 19(9), 8808–8814 (2011). [CrossRef] [PubMed]
15. A. Al Amin, A. Li, S. Chen, X. Chen, G. Gao, and W. Shieh, “Dual-LP11 mode 4×4 MIMO-OFDM transmission over a two-mode fiber,” Opt. Express 19(17), 16672–16679 (2011). [CrossRef] [PubMed]
16. M. Hsieh and C. Wei, “Channel estimation for OFDM systems based in comb-type pilot arrangement in frequency selective fading channels,” IEEE Trans. Consum. Electron. 44(1), 217–225 (1998). [CrossRef]
17. S. Weigert, “Baker-Campbell-Hausdorff relation for special unitary groups SU(N),” J. Phys. Math. Gen. 30(24), 8739–8749 (1997). [CrossRef]
18. M. Gell-Man, “Symmetries of Baryons and Mesons,” Phys. Rev. 125(3), 1067–1084 (1962). [CrossRef]
19. L. Arnold, Stochastic Differential Equations Theory and Application (Wiley 1974).
20. T. C. Gard, Introduction to Stochastic Differential Equations (Marcel Dekker, INC. 1988).
21. B. Øksendal, Stochastic Differential Equation (Springer-Verlag Berlin Heidelberg 1989).
22. D. C. Cox and R. Leck, “Correlation bandwidth and delay spread multipath propagation statistics for 91-MHz urban mobile radio channels,” IEEE Trans. Commun. 23(11), 1271–1280 (1975). [CrossRef]