Expand this Topic clickable element to expand a topic
Skip to content
Optica Publishing Group

Prediction of radiation pressure force exerted on moving particles by the two-level skeletonization

Open Access Open Access

Abstract

A fast full-wave method for computing radiation pressure force (RPF) exerted by shaped light beams on moving particles is presented. The problem of evaluating RPF exerted on a moving particle by a single excitation beam is converted into that of computing RPF’s exerted on a static particle by multiple beams. The discretization of different beams leads to distinct right hand sides (RHS’s) for the matrix system. To avoid solving each RHS by the brute-force manner, the algorithm conducts low-rank decomposition on the excitation matrix consisting of all RHS’s to figure out the so-called skeleton light beams by interpolative decomposition (ID). The peak memory requirement of the skeletonization is a bottle-neck if the particle is large. A two-level skeletonization scheme is proposed to solve this problem. Some numerical experiments on arbitrarily shaped homogeneous particles are performed to illustrate the performance and capability of the developed method.

© 2014 Optical Society of America

1. Introduction

The micro-manipulation [1] based on optical tweezers [24] offers many benefits over its competing technologies, most notably the ability to work with delicate or potentially harmful samples in sealed environments without direct mechanical access. Some of the more sophisticated optically actuated devices make use of interactions among multiple rotating objects. The rotation control and rotational interactions have not only been demonstrated in two dimensions but also in three dimensions [5]. Precise control of the motion of a particle in tweezers relies on the prediction of RPF exerted on the particle. Obviously, the more accurately the RPF can be predicted, the more reliable the complex manipulation can be reached. Therefore, an accurate and efficient computational tool to predict RPF exerted on moving particles is badly required to implement the optical tweezers devices with increasing complexity. From the simulation point of view, we can either simulate the moving particle in the fixed optical tweezers directly or change the state of the optical tweezers by varying parameters of the light beam while assuming the particle is static. To ease the implementation, the latter is employed in our study.

Different types of methods can be employed to predict RPF because the behavior of optical tweezers depends upon the size of the trapped particle relative to the wavelength of light used to trap it. In cases where the dimensions of the particle are much greater than the wavelength, a simple ray optics treatment is sufficient. If the wavelength of light far exceeds the particle dimensions, the particles can be treated as electric dipoles in an electric field. For optical trapping of dielectric objects of dimensions within an order of magnitude of the trapping beam wavelength, the only accurate models involve the treatment of either time dependent or time harmonic Maxwell equations using appropriate boundary conditions. These models can be generally grouped into analytical approaches or full-wave algorithms. Analytical methods are always limited to particles with canonical shapes while full-wave ones are always expensive. In [69], analytical algorithms focused on the spherical particles have been reported. It is a much more difficult task to predict the RPF exerted on nonspherical particles. Many efforts have been reported on this topic [1016]. The sizes of the arbitrarily shaped particles in these methods are so severely limited that the beam profile effect on RPF is difficultly exploited. Most recently, multilevel fast multipole algorithm (MLFMA), one of the most widely used full-wave method in computational electromagnetic community, has been employed to predict RPF on arbitrarily shaped particles [17]. Suppose N is the number of unknowns, MLFMA can reduce both the computational complexity of O(N2N3) and the storage complexity of O(N2) for surface integral equation (SIE) to O(NlogN) [1821]. Additionally, the use of SIE with triangular patches reduces considerably the unknowns since only the discretization on the particle surface is necessary. However, to model a moving particle in optical tweezers, a set of shaped light beams should be used. Numerically, discretizing different excitation beams gives rise to multiple right hand sides (RHS’s) for the matrix system. As each RHS must be solved separately, the time cost becomes O(NbNlogN) with Nb the number of excitation beams. When Nb becomes large, the simulation can be prohibitively expensive. For multiple incidents electromagnetic scattering problem, some methods based on model-based parameter estimation (MBPE) [22], asymptotic waveform evaluation (AWE) [23], singular value decomposition (SVD)/QR factorization [24] or other techniques [25, 26] are developed. Nonetheless, their application is either limited by the lack of rigorous error control schemes or confined to the cases when Nb is moderate due to the efficiency issue.

Encouraged by the skeletonization framework in [27], a full-wave algorithm is proposed to predict RPF exerted on moving particles in this work. The algorithm is based on the interpolative decomposition (ID) [2830] and skeleton concept [3134]. The algorithm introduces skeleton beams to accelerate the MLFMA computation, which are selected by conducting ID skeletonization on the excitation matrix consisting of all RHS vectors. A bottle-neck in this process is that the peak memory usage of skeletonization may exceed what hardware can offer. By dividing the whole parameter space into serval intervals, the peak memory usage can be decreased. However, the efficiency of the computation would degrade because the number of skeletons would increase as the number of intervals grows. To solve this difficulty, a two-level skeletonization scheme is proposed.

The remainder of the paper is organized as follows. Section 2 outlines combined tangential formulation (CTF) of SIE and the method of moments (MoM) system for a moving particle in optical tweezers. Section 3 details the proposed method, including the skeletonization framework, the cost analysis and the two-level scheme of the algorithm. Section 4 presents numerical experiments to show performance of the proposed algorithm. Section 5 concludes the paper.

2. CTF and MoM

Figure 1 shows the typical time-harmonic scattering (with a time factor ejωt of the angular frequency ω) by the arbitrarily shaped homogeneous particle with permittivity εi, and permeability μi (i denotes internal) embedded in an homogeneous lossless medium with constitutive parameters εe; μe (e denotes external). Ωi and Ωe, respectively, denote the inner and outer regions, which are assumed to be open sets and are separated by a boundary surface S, with a unit normal (directed from S toward Ωe). Einc and Hinc are the impressed electric and magnetic fields in Ωe. By invoking Love’s equivalent principle for the exterior medium, the region Ωi is filled up with the same material of the region Ωe, and the sources in Ωi are removed. The equivalent currents Je=n^×Hetot and Me=Eetot×n^, positioned on the external face Se of the surface S, generate the original scattered fields Eesca and Hesca in the region Ωe, and null fields in the region Ωi, i.e.,

Eesca(r,Je,Me)+Einc={0ifrΩiEetot,ifrΩe
Hesca(r,Je,Me)+Hinc={0ifrΩiHetot,ifrΩe
An equivalent problem can be established for Ωi as well, where Ωe is filled up with the same material of Ωi, and the sources in Ωe are removed. The equivalent currents ( Ji=n^×Hitot, Mi=Eitot×n^), defined on the external face Si of the surface S, produce the original scattered fields Eisca and Hisca in Ωi, and null fields in Ωe:
Eisca(r,Ji,Mi)={Eitot,ifrΩi0,ifrΩe
Hisca(r,Ji,Mi)={Hitot,ifrΩi0ifrΩe
The scattered electric and magnetic fields in Eqs. (1)(4) can be evaluated as
Eqsca(r,Jq,Mq)=ηqq(Jq)𝒦q(Mq),
Hqsca(r,Jq,Mq)=ηq1q(Mq)+𝒦q(Jq),
where q = i, e; and ηq=μq/εq is the the wave impedance for the region q. The operators and 𝒦 corresponding to the region q can be written as
q{X}(r)=jkqSdS[I+1kq2]X(r)gq(r,r),
𝒦q{X}(r)=Ω(r)4πX(r)+×SdSgq(r,r)X(r),
where stands for the principal value integration; 0 ≤ Ω(r) ≤ 4π is the internal solid angle; gq(r,r)=ejkq|rr|4π|rr|, denotes the homogeneous-space Green’s function with kq=ωμqεq the wavenumber for the region q. It holds true from the boundary condition and the definition of equivalent currents that Ji = −Je and Mi = −Me. The system consisting of the four equations (1)–(4) is overdetermined for the two unknowns (Je, Me). To solve (Je, Me), different SIE formulations are generally constructed by combining tangential (T) and/or normal (N) forms of the four equations: tangential electric field integral equation (T-EFIE), tangential magnetic field integral equation (T-MFIE), normal electric field integral equation (N-EFIE) and normal magnetic field integral equation (N-MFIE). The CTF of SIE is the combination of [35, 36],
ηe1(T-EFIE)e+ηi1(T-EFIE)i,
ηe(T-MFIE)e+ηi(T-MFIE)i.
where ηe/i is the wave impedance for the region e or i. For the numerical solution of SIEs, the surfaces are discretized by small planar triangles and (Je, Me) are expanded by Rao-Wilton-Glisson (RWG) [37] basis functions. Using a Galerkin scheme with the RWG testing functions, the CTF can be recast in the matrix form as
Zx=b,or(ZJ,JZJ,MZM,JZM,M){xJxM}={bJbM},
where Z is the impedance matrix of size 2N × 2N with N the number of coefficients of Je or Me. b is the RHS vector obtained by discretizing the incident field. The detailed discretization procedure can be found in literature, such as [38, 39]. After the coefficient vector of effective currents x is obtained, the scattered fields Esca and Hsca at any position and thus the associated RPF can be obtained by the manner described in [40].

 figure: Fig. 1:

Fig. 1: The red blood cell (RBC) model.

Download Full Size | PDF

To investigate a moving particle in optical tweezers, we equivalently fix the particle at the origin of the coordinate system while shifting and tilting the excitation light beams. Therefore, the RPF exerted on a moving particle can be studied by changing the parameters characterizing the beams. Without loss of generality, this work uses Gaussian light beams as the excitation. The parameters to describe a Gaussian beam include: θ, φ, ro, wo, where (θ, φ) denotes the beam direction, ro is the beam center, wo is the beam waist. Using MoM to study the optical tweezers generated by the beams within the parameter space of κ ∈ [κmin, κmax] (κ = θ, φ, ro, wo, or their combinations) amounts to solving a set of systems of linear algebraic equations that are of the matrix form as:

ZX(κ)=B(κ),
where B(κ) is an 2N × Nb known excitation matrix consisting of Nb RHS vectors, X(κ) is the unknown solution matrix. For simplicity, κ and (θ, φ, ro, wo) are mutually used or both dropped when no confusion would arise. If Z−1 is available, (11) can be solved efficiently. However, when N is large, direct evaluation of Z−1 is always unaffordable, both in memory usage and CPU time. The simulation is therefore conventionally limited to electrically small dielectric objects. With the aid of MLFMA, both computational and storage complexities are reduced to O(NlogN) for the iterative solution of each RHS. The total cost of obtaining the solution matrix X is thus O(NbNlogN). For a prescribed resolution on κ, the exact value of Nb is hardly estimated in advance. People always use a large Nb empirically to satisfy the resolution requirement. Consequently, the computation may become expensive even for the small particle. The excitation matrix B is always rank deficient and admits low-rank decomposition, e.g., SVD, QR, to reduce the cost. However, SVD or QR is only efficient for small matrix due to the high computational complexity of the decomposition itself.

3. The proposed fast algorithm

3.1. The skeletonization framework

Most recently, an efficient rank revealing algorithm, interpolative decomposition (ID), has been developed in [28]. ID has been proved much more efficient than SVD or QR in conducting low-rank decomposition [29, 34]. Suppose C is a complex m × n matrix of rank r with rm and rn, ID states that there exists a complex m × r matrix S whose columns consist of a subset of the columns of C and a complex r × n matrix R such that

Cm×nSm×rRr×n,
when the exact rank of Cm×n is greater than r, but the (r + 1)-st greatest singular value of Cm×n is small. Before conducting the decomposition as shown in (12), a threshold εID is often prescribed to control the error of the approximation. In the form of L2-norm, εID measures the difference between C and S · R [28]. The column vectors in S are generally called as skeleton vectors [29]. By applying ID to B(κ), we can reach
B(κ)=BS(κ)R(κ),
where BS(κ) and R(κ) are, respectively, the N × Nskel skeletonized excitation matrix and the Nskel × Nb projection matrix, with Nskel the number of skeleton light beams. Substituting (13) into (11) yields
ZX(κ)=BS(κ)R(κ).
Suppose
XS(κ)=Z1BS(κ),
where XS(κ) is the solution matrix corresponding to BS(κ). According to (14) and (15), X(κ) can be obtained as
X(κ)=XS(κ)R(κ).
Equation (16) states that the complete solution matrix X can be obtained after XS is obtained.

Except for the error arising from the ID skeletonization, no approximation is introduced during the deduction from (11) to (16). Therefore, the accuracy of proposed algorithm can be strictly manipulated by εID since ID is error controllable [28].

The direct implementation of ID in (12) requires [28]

CID=lCH+O(rm+rln)
floating-point operations, where CH is the cost of applying CH to a vector, the superscript “H” denotes transpose conjugate operation. As shown in [28], l = r + 5 or l = r + 10 is sufficient. From the theoretical point of view, r is a constant for a given physical system. If C is dense but rank deficient, the cost can be decreased to (mnlog(r) + r2(m + n)) by making use of randomness [28, 29]. Since the matrix to be decomposed in this work is always sufficiently rank deficient, rn. The time for the ID skeletonization is thus dominated by the term O(mn), which is negligible in comparison with the total solution time as shown in Section 4. Additionally, it increases only linearly with n. When applying ID to B, m is equal to 2N, n is identical to Nb and r is Nskel. The relationship is used by default throughout the paper if it is not pointed out specifically. The peak memory usage of ID (in byte) is about
MID=16(mn+2nm2+17m)16(2mn+17m),
where m2 is the greatest integer less than or equal to m, such that m2 is a positive integer power of 2. To remain the high accuracy of ID, double precision is generally employed. Although the memory required by ID can be released after skeleton beams are figured out, the peak memory consumption can be a bottle-neck as it may exceed what the computer can offer when both N and Nb are large. To fit the memory limit, a two-level scheme is developed.

3.2. The two-level scheme

It is not suggested to decrease MID by reducing Nb at the risk of violating the resolution. Here, a two-level scheme is proposed as the remedy. It is based on the multi-interval variation of the skeletonization algorithm, where n = Nb beams are usually uniformly distributed into multiple intervals. Suppose Nintv is the number of intervals, the number of beams in the i-th interval is ni = n/Nintv. According to (18), the peak memory usage of ID skeletonization decreases linearly with the growth of Nintv. Additionally, skeletonization is conducted independently for each interval, still in the error controllable fashion. Thus, the multi-interval strategy will give rise to no loss of accuracy. The total time used by the skeletonization (to say, O(mn)) does not increase with Nintv because n=iNintvni. However, its side-effect is that the total number of skeleton beams, Nskel, would increase since skeletonization for each single interval can not access the global information in B. Obviously, many intervals would be generated if mn is large. The growth of Nskel will degrade the efficiency of the proposed method. The side-effect can be eliminated by the proposed two-level scheme.

The two-level scheme consists of two stages as shown in Algorithm 1 and 2. The former describes how to carry out the skeletonization, while the latter details the procedure to recover the complete solution matrix X. In both Algorithm 1 and 2, κ denotes one of the parameters (κ = θ, φ, ro, wo) or their combinations. Algorithm 1 begins with dividing the whole parameter space into several intervals. The criterion for the division is usually the memory limit. And then, a loop is performed for the i-th interval to find out the Ni,skel skeleton beams { κiS} within the associated parameter sub-space, and to construct the skeletonized RHS matrix BiS and projection matrix Ri. Next, a new RHS matrix H is produced by collecting all BiS. Through ID skeletonization on H, a set of skeleton beams {κS} are generated and stored in HS. The skeletons {κS} are named as two-level skeletons because they are selected from { κiS}. Algorithm 2 starts from obtaining the skeletonized solution matrix XS by the standard MLFMA computation (other fast methods are also applicable). Next, the solution matrix XH corresponding to { κiS}(i = 1, 2,··· ,Nintv) is computed through the ID procedure. After the skeletonized solution matrix XiS is extracted from XH, it is used to recover Xi for the i-th interval. At last, the complete solution matrix is generated by assembling all Xi’s together.

Tables Icon

Algorithm 1:. the two-level multi-interval strategy–skeletonization stage

Tables Icon

Algorithm 2:. The two-level multi-interval strategy–recovering stage

4. Numerical experiments

The numerical experiments are performed on an IBM server configured by two 6-core X5650 CPUs and 64 gigabytes (GB) memory. In all computations, 12 OpenMP threads are forked to accelerate the MLFMA as one CPU has 6 cores. The generalized minimum residual (GMRES) iteration process is terminated when the L2-norm of the residual vector is reduced to 10−4. εID = 10−4 and κ is sampled uniformly. The Davis-Barton fifth-order approximation electromagnetic fields [40] are used. The wavelength in the background medium is noted by λ in the following. To investigate the error arising from skeletonization, we define the difference of RPF as δRPFp(κ)=|Fbfp(κ)Fskelp(κ)||Fbfp(κ)|, where Fbfp and Fskelp are, respectively, the RPF obtained from the brute-force approach and the proposed fast algorithm, and p denotes one of the Cartesian components. The speed-up is computed by Cspdp=TbfTskel, where Tskel and Tbf are, respectively, the time with and without the skeleton-based algorithm; if brute-force computation is not conducted, Tbf is estimated by assuming the time for the solution of each RHS identical. Tskel and Tbf do not include the time for computing RPF from (Je, Me). It should be noted that Cspdp heavily depends on Nb. In the computations, RPF and incident angle, respectively, is defined in terms of (x, y, z) and (ξ, γ, ζ) coordinate, as shown in Fig. 1 and 2. The two coordinates are such defined that x = −ξ, y = γ and z = −ζ.

 figure: Fig. 2:

Fig. 2: Comparison of z components of RPF by Gaussian beams (Nb = 401, λ = 514.5nm and wo = 2λ) on a prolate slightly volatile silicone oil particle (ε = 2.25) computed with and without the skeletonization. The transversal radius of the particle is b = 1μm while the radius a along z axis varies. The step of ζo is 0.5μm. ”BF” denotes brute-force.

Download Full Size | PDF

4.1. Accuracy and efficiency

Accuracy and efficiency of the proposed two-level scheme is validated by numerical experiments on a set of oil micro-particle models. Specifically, spheroidal particles of slightly volatile silicone oil (relative permittivity ε = 2.25) illuminated by Gaussian beams are considered, as shown in Fig. 2. Being same as those in [17], the ratio of a/b for the three oil particles are, respectively, 1.00, 1.05 and 1.10, where b is fixed at 1μm. All the three particles are discretized by 30924 triangle patches, leading to 92772 unknowns. That is, m in (18) is 92772. The center of the particle is fixed at the coordinate origin and λ = 514.5nm. By setting the incident angle q = 0° and varying zo from −100μm to 100μm with the step of 0.5μm (ξo = 0 and γo = 0), RPF’s are computed by the brute-force manner and by the proposed algorithm. In the brute-force computation, 401 RHS’s have to be solved separately. As a result, Tbf’s for the three cases are, respectively, 280.7, 401.0 and 441.1 minutes. The corresponding Tskel’s are 3.5, 5.0 and 5.5 minutes because only 5 skeleton beams are selected for all the three cases, with the Cspdp of about 80. To investigate the error caused by skeletonization, the relative errors are presented in Fig. 3. As it is shown, the maximum δRPFz is less than 2.0 × 10−4.

 figure: Fig. 3:

Fig. 3: The relative error of z components of RPF by Gaussian beams (Nb = 401, λ = 514.5nm and wo = 2λ) on a prolate slightly volatile silicone oil particle (ε = 2.25) computed with and without the skeletonization. The transversal radius of the particle is b = 1μm while the radius a along z axis varies. The step of ζo is 0.5μm.

Download Full Size | PDF

MID reaches 13986 MB for the computations above because only one interval is used to find the skeletons. In the following, the performance of the two-level scheme is revealed by calculations on the oil particle with a/b = 1.10 through manually setting Nintv to be 2, 3, 4 and 5. The associated statistics is listed in table 1. It can be seen that MID decreases linearly with Nintv at the expense of the monotonic growth of iNintvNi,skel. Definitely, the efficiency would degrade in the original multi-interval computations. However, the degradation is avoided when the two-level scheme is applied because Nskel remains a constant. Theoretically, the number of skeleton beams is a constant if the excitation matrix B delivers sufficient resolution. When multiple intervals are employed to figure out skeletons, Nskel=iNintvNi,skel increases with the number of intervals because the global information is not available in each interval. Since these RHS’s are required to be solved separately, the efficiency of the multi-interval variation decreases with the increasing of Nintv. In the two-level scheme, Nskel becomes independent of Nintv and remains a constant because the global information is successfully recovered during the second level skeletonization. The additional cost of the two-level skeletonization scheme is that the time used by the second level skeletonization. It is always negligible in comparison with those for the Nintv intervals because iNintvNi,skel is much less than Nb, as shown by the statistics. Consequently, the cost of the two-level strategy is insensitive to Nintv and the decrease of efficiency is avoided. The two-level scheme is still error controllable since the ID skeletonization is well manipulated. As shown in Fig. 4, the relative errors for most of the RHS’s are less than 10−4. The maximum δRPFz is only about 8.0×10−4. The two-level scheme makes the choice of Nb easy. The exact Nb is hardly estimated precisely for a prescribed resolution. In practice, it is chosen heuristically. If it is too small, the resolution requirement can hardly be satisfied. In contrast, the efficiency would become a problematic issue if it is unnecessarily large. This problem is not completely overcome by the multi-interval strategy because Nskel=iNintvNi,skel grows with Nintv. However, it is substantially solved because Nskel is independent of Nb as well as Nintv. To validate the analysis, we increase Nb to 801 for the computation on the particle with a/b = 1.10. The brute-force computation requires 884.3 minutes to solve all the RHS’s while the computation with the two-level scheme remains 5.5 minutes, the same as the case with Nb = 401, since Nskel is still 5. The accuracy is either under control as shown in Fig. 5.

Tables Icon

Table 1:. Statistics of computations on the spheroid oil particle with a/b = 1.10 when different number of intervals are employed in the two-level scheme.

 figure: Fig. 4:

Fig. 4: The relative error of z components of RPF when different number of intervals are employed to figure out the skeleton beams (b = 1.00μm and a = 1.10μm; the other parameters are the same as those for the computations in Fig. 2).

Download Full Size | PDF

 figure: Fig. 5:

Fig. 5: z components of RPF (Nb = 801, the parameters are the same as those for Fig. 2).

Download Full Size | PDF

4.2. Capability

As discussed in Section 1, the problem of evaluating RPF exerted on a moving particle by a single excitation beam can be converted into that of computing RPF’s exerted on a static particle by multiple beams. The latter is much easier to be implemented than the former. With the conversion, numerical experiments on a moving red blood cell (RBC) model in water is conducted to demonstrate the capability of the proposed algorithm, as shown in Fig. 1. An ordinary RBC is always modeled by a rotationally-symmetric biconcave surface defined similar to [20, 41],

r(Θ,Φ)=asinsΘ+b
where s = 5, a = 3.30μm, and b = 0.55μm. Different from the model depicted by (19), the dimensions along x and y of our model are, respectively, scaled by the factors of 1.8 and 0.5. Additionally, the angle Θc is equal to 10°, as shown in Fig. 1. The volume of the model remains 80.9μm3. The relative permittivity ε is 1.3 for the water and 1.4 for the RBC model. For all Gaussian beams, λ = 532nm, wo = 2λ, ξo = 0 and γo = 0. Multiple Gaussian beams are produced by varying θ, φ and ζo. To be specific, the ranges of θ and φ are [45°, 55°] and [85°, 95°]. The angle step is 0.2° along both θ and φ. ζo varies within the range of [−2λ, 2λ] with the step of 0.1λ. As a result, 106 641 = 51 × 51 × 41 RHS’s are involved in the computation. For such a large target, iterative solution of each RHS costs about half an hour for the MLFMA. Consequently, the brute-force computation requires more than 53 320 hours to complete the iteration. However, the time can be substantially decreased by the skeletonization algorithm because Nskel is only 63. The time used to figure out the skeletons is only about 12 hours for such a challengeable problem. Figure 6 and 7 presents the RPF varies with φ and ζo when θ is fixed at 45°, 50° and 55°. In the figure, the unit of force is 10−9N/W.

 figure: Fig. 6:

Fig. 6: x and y components of RPF (denoted by Fx and Fy) exerted on the RBC model when θ, φ and ζo of the beam vary within [45°, 55°], [85°, 95°] and [−2λ, 2λ]. The step of angle and distance is, respectively, 0.2° and 0.1λ. The unit of force is 10−9N/W.

Download Full Size | PDF

 figure: Fig. 7:

Fig. 7: z components of RPF (Fz) exerted on the RBC model (The other parameters are the same as those in 6).

Download Full Size | PDF

Due to the complex shape of the particle, x, y and z components of RPF behave quite differently. It is interesting to see that strong inverse RPF appears within some particular attitudes. With both positive and negative RPF’s, the complicated motion/rotation of the particle can be realized and manipulated. Theoretically, positive force is caused by reflected fields while negative force results from refracted fields. Inverse RPF is obtained if negative force is stronger than positive one [9,42]. However, the property of the fields and thus the RPF is dependent on many factors including the constitutive parameters of the particle and the background, the shape of the particle, the profile of the incident beam, the relative position between the particle and the beam, etc. Detailed analysis and explanation on RPF for moving particles with complex shapes require a large volume of numerical experiments. Indeed, it is one of our undergoing research work.

5. Conclusion

The proposed algorithm conducts low-rank decomposition on the excitation matrix consisting of all RHS’s to figure out the so-called skeleton light beams by interpolative decomposition (ID). A two-level skeletonization scheme is proposed to overcome the bottleneck associated with the peak memory usage during the ID skeletonization. Numerical experiments show that the proposed algorithm is efficient and error controllable. Although not demonstrated here, the performance of the algorithm is not sensitive to shape of moving particles or the excitation light beams because the ID skeletonization is a pure algebraic tool.

Acknowledgments

This work was partly supported by Program for New Century Excellent Talents in University under Grant NCET-12-0045, by the 973 Program under Grant 2012CB720702, by the NSFC under Grants 60901005 and 61371002, and by the Excellent Scholars Support Fund of Beijing under Grant 2012D009011000002.

References and links

1. K. C. Neuman and S. M. Block, “Optical trapping,” Rev. Sci. Instrum. 75, 2787–2809 (2004). [CrossRef]  

2. A. Ashkin, J. M. Dziedzic, J. E. Bjorkholm, and S. Chu, “Observation of a single-beam gradient force optical trap for dielectric particles,” Opt. Lett. 11, 288–290 (1986). [CrossRef]   [PubMed]  

3. A. Ashkin, J. M. Dziedzic, and T. Yamane, “Optical trapping and manipulation of single cells using infrared-laser beams,” Nature 330, 769–771 (1987). [CrossRef]   [PubMed]  

4. O. M. Marago, P. H. Jones, P. G. Gucciardi, G. Volpe, and A. C. Ferrari, “Optical trapping and manipulation of nanostructures,” Nature Nanotech. 8, 807–819 (2013). [CrossRef]  

5. H. Shpaisman, D. B. Ruffner, and D. G. Grier, “Light-driven three-dimensional rotational motion of dandelion-shaped microparticles,” Appl. Phys. Lett. 102, 071103 (2013). [CrossRef]  

6. G. Roosen and C. Imbert, “Optical levitation by means of two horizontal laser beams: A theoretical and experimental study,” Phys. Lett. A 59, 6–8 (1976). [CrossRef]  

7. J. S. Kim and S. S. Lee, “Radiation pressure on a dielectric sphere in a gaussian laser beam,” Opt. Acta 29, 801–806 (1982). [CrossRef]  

8. K. F. Ren, G. Greha, and G. Gouesbet, “Radiation pressure forces exerted on a particle arbitrarily located in a gaussian beam by using the generalized lorenz-mie theory, and associated resonance effects,” Opt. Commun. 108, 343–354 (1994). [CrossRef]  

9. K. F. Ren, G. Grehan, and G. Gouesbet, “Prediction of reverse radiation pressure by generalized lorenz-mie theory,” Appl. Opt. 35, 2702–2710 (1996). [CrossRef]   [PubMed]  

10. F. Xu, K.-F. Ren, G. Gouesbet, X.-S. Cai, and G. Grehan, “Theoretical prediction of radiation pressure force exerted on a spheroid by an arbitrarily shaped beam,” Phys. Rev. E 75, 026613 (2007). [CrossRef]  

11. B. T. Draine and P. J. Flatau, “Discrete-dipole approximation for scattering calculations,” J. Opt. Soc. Am. A 11, 1491–1499 (1994). [CrossRef]  

12. S. H. Simpson and S. Hanna, “Computational study of the optical trapping of ellipsoidal particles,” Phys. Rev. A 84, 053808 (2011). [CrossRef]  

13. C.-F. Kuo and S.-C. Chu, “Numerical study of the properties of optical vortex array laser tweezers,” Opt. Express 21, 26418–26431 (2013). [CrossRef]   [PubMed]  

14. M. I. Mishchenko, “Radiation force caused by scattering, absorption, and emission of light by nonspherical particles,” J. Quant. Spectrosc. Radiat. Transfer 70, 811–816 (2001). [CrossRef]  

15. F. Borghese, P. Denti, R. Saija, and A. l. Maria, “Optical trapping of nonspherical particles in the t-matrix formalism,” Opt. Express 15, 11984–11998 (2007). [CrossRef]   [PubMed]  

16. L. Bi and P. Yang, “Modeling of light scattering by biconcave and deformed red blood cells with the invariant imbedding T-matrix method,” J. Bio. Opt. 18, 055001 (2013). [CrossRef]  

17. M. L. Yang, K. F. Ren, M. J. Gou, and X. Q. Sheng, “Computation of radiation pressure force on arbitrary shaped homogenous particles by multilevel fast multipole algorithm,” Opt. Lett. 38, 1784–1786 (2013). [CrossRef]   [PubMed]  

18. R. Coifman, V. Rokhlin, and S. Wandzura, “The fast multipole method for the wave equation: a pedestrian prescription,” IEEE Antennas Propag. Mag. 35, 7–12 (1993). [CrossRef]  

19. X. M. Pan, W. Pi, M. L. Yang, Z. Peng, and X. Q. Sheng, “Solving problems with over one billion unknowns by the mlfma,” IEEE Trans. Antennas Propag. 60, 2571–2574 (2012). [CrossRef]  

20. O. Ergul, A. Arslan-Ergul, and L. Gurel, “Computational study of scattering from healthy and diseased red blood cells,” J. Bio. Opt. 15, 045004(2010). [CrossRef]  

21. X. Q. Sheng, J. M. Jin, J. Song, W. C. Chew, and C. C. Lu, “Solution of combined-field integral equation using multilevel fast multipole algorithm for scattering by homogeneous bodies,” IEEE Trans. Antennas Propag. 46, 1718–1726 (1998). [CrossRef]  

22. X. Wang and D. H. Werner, “Improved model-based parameter estimation approach for accelerated periodic method of moments solutions with application to the analysis of convoluted frequency selected surfaces and metamaterials,” IEEE Trans. Antennas Propag. 58, 122–131 (2010). [CrossRef]  

23. B.-Y. Wu and X. Q. Sheng, “Application of asymptotic waveform evaluation to hybrid FE-BI-MLFMA for fast RCS computation over a frequency band,” IEEE Trans. Antennas Propag. 61, 2597–2604 (2013). [CrossRef]  

24. A. Schroder, H. D. Bruxns, and C. Schuster, “A hybrid approach for rapid computation of two-dimensional monostatic radar cross section problems with the multilevel fast multipole algorithm,” IEEE Trans. Antennas Propag. 60, 6058–6061 (2012). [CrossRef]  

25. P. D. Ledger and K. Morgan, “An adjoint enhanced reduced-order model for monostatic RCS computation,” Electromagnetics 28, 54–76 (2008). [CrossRef]  

26. P. Zhen, M. B. Stephanson, and J. F. Lee, “Fast computation of angular responses of large-scale three-dimensional electromagnetic wave scattering,” IEEE Trans. Antennas Propag. 58, 3004–3012 (2010). [CrossRef]  

27. X. M. Pan and X. Q. Sheng, “Fast computation of two-dimensional spatial electromagnetic scattering from large-scale targets,” Computational Electromagnetics Workshop (CEM), 2013 pp. 1–3 (2013). [CrossRef]  

28. E. Liberty, F. Woolfe, P. G. Martinsson, V. Rokhlin, and M. Tygert, “Randomized algorithms for the low-rank approximation of matrices,” Proc. Natl. Acad. Sci. USA 104, 20167–20172 (2007). [CrossRef]   [PubMed]  

29. N. Halko, P. G. Martinsson, and J. A. Tropp, “Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions,” SIAM Rev. 53, 72 (2011). [CrossRef]  

30. X. M. Pan and X. Q. Sheng, “Improved algebraic preconditioning for mom solutions of large-scale electromagnetic problems,” IEEE Antennas Wireless Propag. Lett. 13, 106–109 (2014). [CrossRef]  

31. K. L. Ho and L. Greengard, “A fast direct solver for structured linear systems by recursive skeletonization,” SIAM J. Sci. Comput. 34, A2507–A2532 (2012). [CrossRef]  

32. X. M. Pan and X. Q. Sheng, “Hierarchical interpolative decomposition multilevel fast multipole algorithm for dynamic electromagnetic simulations,” Progr. Electromagn. Res. 134, 79–94 (2013). [CrossRef]  

33. X. M. Pan and X. Q. Sheng, “Preconditioning technique in the interpolative decomposition multilevel fast multipole algorithm,” IEEE Trans. Antennas Propag. 61, 3373–3377 (2013). [CrossRef]  

34. X. M. Pan, J. G. Wei, Z. Peng, and X. Q. Sheng, “A fast algorithm for multiscale electromagnetic problems using interpolative decomposition and multilevel fast multipole algorithm,” Radio Sci. 47, RS1011 (2012). [CrossRef]  

35. M. G. Araujo, J. M. Taboada, D. M. Solis, J. Rivero, L. Landesa, and F. Obelleiro, “Comparison of surface integral equation formulations for electromagnetic analysis of plasmonic nanoscatterers,” Opt. Express 20, 9161–9171 (2012). [CrossRef]   [PubMed]  

36. L. Landesa, M. G. Araujo, J. M. Taboada, L. Bote, and F. Obelleiro, “Improving condition number and convergence of the surface integral-equation method of moments for penetrable bodies,” Opt. Express 20, 17237–17249 (2012). [CrossRef]  

37. S. M. Rao, D. R. Wilton, and A. W. Glisson, “Electromagnetic scattering by surfaces of arbitrary shape,” IEEE Trans. Antennas Propag. 30, 409–418 (1982). [CrossRef]  

38. P. Yla-Oijala, M. Taskinen, and S. Jarvenpaa, “Surface integral equation formulations for solving electromagnetic scattering problems with iterative methods,” Radio Sci. 40, RS6002 (2005). [CrossRef]  

39. O. Ergul and L. Gurel, “Comparison of integral-equation formulations for the fast and accurate solution of scattering problems involving dielectric objects with the multilevel fast multipole algorithm,” IEEE Trans. Antennas Propag. 57, 176–187 (2009). [CrossRef]  

40. J. P. Barton and D. R. Alexander, “Fifth-order corrected electromagnetic field components for a fundamental gaussian beam,” J. Appl. Phys. 66, 2800–2802 (1989). [CrossRef]  

41. J. Q. Lu, P. Yang, and X.-H. Hu, “Simulations of light scattering from a biconcave red blood cell using the finite-difference time-domain method,” J. Bio. Opt. 10, 024022 (2005). [CrossRef]  

42. T. C. B. Schut, G. Hesselink, B. G. De Grooth, and J. Greve, “Experimental and theoretical investigations on the validity of the geometrical optics model for calculating the stability of optical traps,” Cytometry 12, 479–485 (1991). [CrossRef]   [PubMed]  

Cited By

Optica participates in Crossref's Cited-By Linking service. Citing articles from Optica Publishing Group journals and other participating publishers are listed here.

Alert me when this article is cited.


Figures (7)

Fig. 1:
Fig. 1: The red blood cell (RBC) model.
Fig. 2:
Fig. 2: Comparison of z components of RPF by Gaussian beams (Nb = 401, λ = 514.5nm and wo = 2λ) on a prolate slightly volatile silicone oil particle (ε = 2.25) computed with and without the skeletonization. The transversal radius of the particle is b = 1μm while the radius a along z axis varies. The step of ζo is 0.5μm. ”BF” denotes brute-force.
Fig. 3:
Fig. 3: The relative error of z components of RPF by Gaussian beams (Nb = 401, λ = 514.5nm and wo = 2λ) on a prolate slightly volatile silicone oil particle (ε = 2.25) computed with and without the skeletonization. The transversal radius of the particle is b = 1μm while the radius a along z axis varies. The step of ζo is 0.5μm.
Fig. 4:
Fig. 4: The relative error of z components of RPF when different number of intervals are employed to figure out the skeleton beams (b = 1.00μm and a = 1.10μm; the other parameters are the same as those for the computations in Fig. 2).
Fig. 5:
Fig. 5: z components of RPF (Nb = 801, the parameters are the same as those for Fig. 2).
Fig. 6:
Fig. 6: x and y components of RPF (denoted by Fx and Fy) exerted on the RBC model when θ, φ and ζo of the beam vary within [45°, 55°], [85°, 95°] and [−2λ, 2λ]. The step of angle and distance is, respectively, 0.2° and 0.1λ. The unit of force is 10−9N/W.
Fig. 7:
Fig. 7: z components of RPF (Fz) exerted on the RBC model (The other parameters are the same as those in 6).

Tables (3)

Tables Icon

Algorithm 1: the two-level multi-interval strategy–skeletonization stage

Tables Icon

Algorithm 2: The two-level multi-interval strategy–recovering stage

Tables Icon

Table 1: Statistics of computations on the spheroid oil particle with a/b = 1.10 when different number of intervals are employed in the two-level scheme.

Equations (20)

Equations on this page are rendered with MathJax. Learn more.

E e sca ( r , J e , M e ) + E inc = { 0 if r Ω i E e tot , if r Ω e
H e sca ( r , J e , M e ) + H inc = { 0 if r Ω i H e tot , if r Ω e
E i sca ( r , J i , M i ) = { E i tot , if r Ω i 0 , if r Ω e
H i sca ( r , J i , M i ) = { H i tot , if r Ω i 0 if r Ω e
E q sca ( r , J q , M q ) = η q q ( J q ) 𝒦 q ( M q ) ,
H q sca ( r , J q , M q ) = η q 1 q ( M q ) + 𝒦 q ( J q ) ,
q { X } ( r ) = j k q S d S [ I + 1 k q 2 ] X ( r ) g q ( r , r ) ,
𝒦 q { X } ( r ) = Ω ( r ) 4 π X ( r ) + × S d S g q ( r , r ) X ( r ) ,
η e 1 ( T-EFIE ) e + η i 1 ( T-EFIE ) i ,
η e ( T-MFIE ) e + η i ( T-MFIE ) i .
Z x = b , or ( Z J , J Z J , M Z M , J Z M , M ) { x J x M } = { b J b M } ,
Z X ( κ ) = B ( κ ) ,
C m × n S m × r R r × n ,
B ( κ ) = B S ( κ ) R ( κ ) ,
Z X ( κ ) = B S ( κ ) R ( κ ) .
X S ( κ ) = Z 1 B S ( κ ) ,
X ( κ ) = X S ( κ ) R ( κ ) .
C ID = l C H + O ( r m + r l n )
M ID = 16 ( m n + 2 n m 2 + 17 m ) 16 ( 2 m n + 17 m ) ,
r ( Θ , Φ ) = a sin s Θ + b
Select as filters


Select Topics Cancel
© Copyright 2024 | Optica Publishing Group. All rights reserved, including rights for text and data mining and training of artificial technologies or similar technologies.