Recovering the phase and amplitude of X-ray FEL pulses using neural networks and differentiable models

D. Ratner; F. Christie; J. P. Cryan; J. P. Cryan; A. Edelen; A. Lutman; X. Zhang

doi:10.1364/OE.432488

1. Motivation

X-ray free-electron lasers (XFELs) produce femtosecond pulses of X-rays for a wide variety of photon science applications. The Linac coherent light source (LCLS) [1] uses the self-amplified spontaneous emission (SASE) process to generate a train of 120 independent pulses per second, with each pulse itself composed of statistically independent spikes. While most current XFEL applications assume only average properties of the pulses, in principle new science could emerge with pulse-to-pulse measurements of the individual X-ray spikes as a function of time. For example, probing ultrafast dynamics of linear X-ray-matter interactions requires only knowledge of the X-ray arrival times (delays) and possibly the X-ray spectrum. To this end, we previously investigated applying time-domain ghost imaging to linear pump-probe experiments, using the fluctuating separation of SASE spikes to recover dynamics of a sample [2]. However, if one hopes to exploit more powerful X-ray analogues of optical nonlinear spectroscopies [3–6] precise knowledge of the full electric field of the X-ray pulse is necessary, i.e., both the amplitude and the phase. For example, in the recently-demonstrated stimulated X-ray Raman scattering technique [7–9], the relative phase between the field at the pump and Stokes transitions can alter the population transfer dynamics. Beyond this basic non-linearity, higher-dimensional measurements can provide detailed information of time-dependent electronic structure with atomic site-specificity [10].

At present, direct measurement of full X-ray fields is not feasible in general, and even time-domain measurement of the power remains a challenge. Recent single-shot full-field measurements at FERMI used a variation of FROG [11], but this technique has yet to be demonstrated in the X-ray regime or for the higher number of modes present in SASE XFELs and requires a challenging experimental setup. At the shortest pulse lengths (sub-10fs) the power profile can be retrieved by THz or infrared streaking of photoelectrons generated by the X-ray interaction with a gas [12–16]. At longer pulse lengths, indirect measurement is possible by observing the pattern of energy loss in the electron beam that generated the X-rays using an X-band transverse cavity (XTCAV) [17–19]. The XTCAV is the dominant method used for pulse measurement at LCLS, but the resolution is not sufficient to observe individual SASE spikes except at the longest wavelengths, and the XTCAV measurement contains no phase information. On the other hand, frequency-domain measurements are relatively straightforward [20,21], and there is a long history in optics of applying iterative phase retrieval to spectral measurements to reconstruct ultrafast pulses (see e.g., FROG, SPIDER, and more recent variations [22–28]). The same concept can be applied to XFELS, combining corrupted measurements of the X-ray power in both the time and frequency domains to improve the resolution of the time-domain measurement [29,30]. In this paper we extend the work of [30] to reconstruct the full field, giving not only higher resolution of the time-domain power, but also revealing the relative phase between spikes. We do so with an efficient machine-learning implementation that can run at the full beam-rate of future MHz XFELs.

2. Pulse reconstruction task and proposed method

Formally, our pulse reconstruction scheme seeks to recover the X-ray field, $\boldsymbol {f}(t) \in \mathbb {C}^{p}$, given the time domain power, $\boldsymbol {P}(t)\in \mathbb {R}^{m_t}$, and the spectral power, $\boldsymbol {S}(\nu )\in \mathbb {R}^{m_\nu }$, where each measurement is divided into $m_t$ and $m_\nu$ discrete time and frequency measurements respectively and the reconstructed field has $p$ discrete points. In practice, we are only given a corrupted measurement of both domains, which we will denote $\tilde {\boldsymbol {P}}(t), \tilde {\boldsymbol {S}}(\nu )$, and our goal is to find a mapping

(1)$$\tilde{\boldsymbol{P}}(t),\tilde{\boldsymbol{S}}(\nu) \rightarrow \boldsymbol{f}(t) \,.$$

Note that recovery of a unique phase is not strictly guaranteed for one-dimensional functions. However, the correct solution can often be found given sufficient constraints from the time and frequency properties of the function, and it has been shown [31] that a one-dimensional function is nearly uniquely determined from its modulus and the modulus of its Fourier transform. For the case of pulse reconstruction, we must accept certain unavoidable ambiguities, for example the overall phase of the field or time-reversal of the phase for time-symmetric pulses. In the special case of a multi-spike SASE FEL pulse, which can be modeled as a sum of finite, independent Gaussian pulses, we will see that recovery is often feasible and unique.

Equation (1) is an inverse problem: while the mapping from power to field is difficult to learn, we can write down a ‘forward’ mapping $\boldsymbol {\Phi }: \boldsymbol {f}(t) \rightarrow \tilde {\boldsymbol {P}}(t),\tilde {\boldsymbol {S}}(\nu )$ that, if given the true field, can predict the associated corrupted measurements:

(2)$$\begin{aligned} \tilde{\boldsymbol{P}}(t) &= | \boldsymbol{f}(t) |^{2} \star {R_t} + \boldsymbol{\mathcal{N}}_t\\ \tilde{\boldsymbol{S}}(\nu) &= | \mathcal{FT}\{\boldsymbol{f}(t)\} |^{2} \star {R_\nu} + \boldsymbol{\mathcal{N}}_\nu , \end{aligned}$$

where ${R_t}$ and ${R_\nu }$ are the measurement resolution functions in the time and frequency domains respectively, $\boldsymbol {\mathcal {N}}_t$ and $\boldsymbol {\mathcal {N}}_\nu$ are measurement Gaussian noise in the time and frequency domains respectively, $\mathcal {FT}$ denotes a Fourier transform, and ‘$\star$’ denotes a convolution. Inverting Eq. (2) is difficult due to the loss of phase information. The existing state-of-art for X-ray FEL pulse recovery [30] works by guessing a solution for $\boldsymbol {f}(t)$, and iteratively updating that guess to improve consistency with Eq. (2). The problem is that the iterative task must be repeated for every pulse; with the LCLS-II upgrade operating at up to a MHz (generating 10 s of billions of pulses for a single experiment) and each iterative solution taking seconds to converge (see Appendix, Section C), recovering the field for every pulse in an experiment is unrealistic.

Instead, we will train a neural network (NN) inverse mapping, an approach that has found success for inverse problems in other areas of physics [32], including recent works for two-dimensional phase retrieval [33–36] and accelerators [37]. Though NN training is computationally expensive, inference requires only a single forward-pass through the network, potentially many orders of magnitude faster than the existing iterative solution. Neural networks also can leverage prior information contained in the training set (e.g., the FEL coherence length) which could both recover a higher resolution amplitude and also reveal the phase of the field. In our case, the NN’s input features are the power measurements, $\boldsymbol {x}\!\!: \tilde {\boldsymbol {P}}(t),\tilde {\boldsymbol {S}}(\nu )$, and the labels are the complex field, $\boldsymbol {y}\!\!: \boldsymbol {f}(t)$. The complex field labels, $\boldsymbol {y}$, consist of either the field’s real and imaginary components, $\mathcal {R}\{\boldsymbol {f}(t)\}, \mathcal {I}\{\boldsymbol {f}(t)\}$, or by writing the field as $\boldsymbol {f}(t) = \boldsymbol {A}(t) \exp (i\boldsymbol {\phi })$, the amplitude and phase, $\boldsymbol {A}, \boldsymbol {\phi }$; in the first case we form $\boldsymbol {y}$ by concatenating $\mathcal {R}\{\boldsymbol {f}(t)\}$ and $\mathcal {I}\{\boldsymbol {f}(t)\}$, and in the second case by concatenating $\boldsymbol {A}$ and $\boldsymbol {\phi }$, in either representation making a vector of $2p$ labels for each example. We then train the network by minimizing the difference between the predicted labels, $\boldsymbol {y}_{\mathrm {pred}}$, and the ground truth labels, $\boldsymbol {y}_{\mathrm {GT}}$, with a mean absolute error loss function

(3)$$\mathcal{L_{\mathrm{mae}}} = \frac{1}{2np} \sum_i^{n} \sum_j^{2p} {\bigg|} \boldsymbol{y}_{\mathrm{pred}}^{i}(j) - \boldsymbol{y}_{\mathrm{GT}}^{i}(j) {\bigg|} ,$$

where the sum is over the $n$ training examples and the $2p$ labels in each example. For the amplitude/phase representation we again use mean absolute error for the amplitudes, but use a modified loss function on the phase portion of the labels

(4)$$\begin{aligned} \mathcal{L}_{\mathrm{amp}} &= \frac{1}{np} \sum_i^{n} \sum_j^{p} {\bigg|} \boldsymbol{A}_{\mathrm{pred}}^{i} (j) - \boldsymbol{A}_{\mathrm{GT}}^{i} (j) {\bigg|}\\ \mathcal{L}_{\mathrm{sine}} &= \frac{1}{np} \sum_i^{n} \sum_j^{p} {\bigg|} \boldsymbol{A}_{\mathrm{GT}}^{i} (j) \,\sin\!{\bigg[}\frac{\phi_{\mathrm{pred}}^{i}(j) - \phi_{\mathrm{GT}}^{i}(j) }{2} {\bigg]}{\bigg|} \,. \end{aligned}$$

The sine loss was designed so that a phase error of $\pi$ has maximal penalty, and an offset of 2$\pi$ is not penalized. Including $\boldsymbol {A}_{\mathrm {GT}}$ ensures that phase errors are ignored in regions with zero amplitude where the phase is undefined. In the end, we will train with the real/imaginary representation, but the phase representation is useful for evaluating the performance separately on amplitude and phase components.

The difficulty in training a NN with Eq. (3) again lies in the phase recovery: because the absolute phase cannot be measured, each pulse corresponds to a family of solutions

(5)$$\boldsymbol{\phi} \in \{\boldsymbol{\phi}_0 + \boldsymbol{\xi}\} \,\,\, \mathrm{with} \,\,\,\boldsymbol{\xi} \, \mathrm{mod} \,2\pi = \mathrm{const.},$$

with $\boldsymbol {\xi }$ a manifold of solutions that are all equally valid. During training, there is no way for the network to know which particular solution from the manifold was chosen as the label, and the loss between the given label and another equally valid solution on the same manifold could be as large as that of an incorrect solution on a different manifold. Figures 1(c) and 1(d) illustrate an example: given the solid lines as labels, Eq. (3) would strongly penalize a NN for producing the equall valid dashed lines.

Fig. 1. We simulate power measurements in the time domain (a) and spectral domain (b) including noise and measurement resolution. Our goal is to recover the complex field, represented either as the real and imaginary components (c) or equivalently the phase and amplitude (d). The solid and dashed lines show fields with different phases that map to the identical power; both fields should be considered correct solutions to the inverse problem.

Download Full Size | PDF

In this paper, we employ two solutions to handle the arbitrary absolute phase. First, we select a single solution on the phase manifold so that each example corresponds to a well-defined label. We chose to define $\phi =0$ (or equivalently purely real field) in the center of the pulse, and then train using Eq. (3). Though the definition collapses the phase manifold to a single point, if the center of the pulse is a waist between spikes, the phase may be changing rapidly, which will amplify errors. We will call this first approach the ‘$\mathcal {L_{\mathrm {mae}}}$ NN.’

Second, rather than restricting the labels to a single point on the manifold, we instead map the entire manifold back onto the well-defined input space, and define the loss function on the input space. The loss function then becomes

(6)$$\mathcal{L}_{\boldsymbol{\Phi}} = \sum_i^{n} \sum_j^{m_t+m_\nu} {\bigg|} \boldsymbol{\Phi}(\boldsymbol{y}_{\mathrm{pred}}^{i} )(j) - \boldsymbol{x}^{i}(j) {\bigg|}^{2} ,$$

where $\boldsymbol {\Phi }$ is the forward model defined by Eq. (2). This second approach is functionally equivalent to the physics-informed neural network (PINN), which was previously introduced to solve inverse problems in the context of partial-differential equations [38]. We will use the PINN terminology going forward.

We can understand the PINN approach through the concept of generative NNs: rather than trying to predict a specific label, the NN learns to draw samples from the correct manifold. We start by considering a generative adversarial network (GAN) [39], which consists of two networks: a generator that creates new examples given a random ‘latent’ variable, and a discriminator that classifies examples as from the training set or from the generator; the training process rewards the discriminator for correctly classifying examples, and rewards the generator when it tricks the discriminator. In our case, the first network would need to be a “conditional” generator [40], i.e. it would take power measurements $\tilde {\boldsymbol {P}}(t),\tilde {\boldsymbol {S}}(\nu )$ as inputs, and then generate corresponding example fields $\mathcal {R}\{\boldsymbol {f}(t)\}, \mathcal {I}\{\boldsymbol {f}(t)\}$ as outputs. The generators task is to learn to produce outputs drawn from the manifold corresponding to the inputs.

In a typical GAN, the discriminator’s role is to identify examples drawn from an incorrect manifold. In our setup we can replace the learned NN discriminator with the known, physical forward model, Eq. (2), which maps the generator’s output back onto the measured input power space. This forward model loss function plays the same role as the discriminator, penalizing solutions from the generator that are drawn from the wrong manifold. Crucially, by writing the forward model in a differentiable language (in this case PyTorch), we can directly apply the forward model to the NN output and train with standard NN platforms.

Before continuing, we note that similar forward-model approaches have been implemented over the last few years. For example, automatic differentiation has been used to incorporate physical models into inverse problems [34,41–43], and is expected to play a growing role in machine learning for the sciences broadly [44]. We also note similarity of our approach to the recently published phaseGAN [36], which is in turn derived from the cycleGAN architecture [45]; both make use of untrained models to build loss functions without labels. In a similar spirit, recent works have used another type of generative model known as a variational autoencoder to resolve ambiguities in signal reconstruction [46,47]. Finally, we note that in our example we do not want to map out the full manifold, so there is no need to input a random latent variable. As such our architecture is not strictly-speaking a generative network, and instead becomes equivalent to a PINN.

We present both training architectures in Fig. 2. We highlight that Eq. (6) does not include $\boldsymbol {y}_{\mathrm {GT}}$, meaning that the PINN is trained without labeled data; the only constraint is that the predictions must be consistent with the measurements, as defined by $\boldsymbol {\Phi }$. We can further extend the PINN concept to also force the predictions to obey statistical properties. For example, FEL pulses have a correlation length given by the Pierce $\rho$-parameter, which can be observed through the first order correlation function [48]

(7)$$g_1(\tau) = \frac{\langle \boldsymbol{f}(t) \boldsymbol{f}(t-\tau)^{*} \rangle }{ \sqrt{\langle |\boldsymbol{f}(t)| \rangle \langle |\boldsymbol{f}(t-\tau)| \rangle}} ,$$

where the expectation values are taken over time, $t$, and $\tau$ is the time-domain delay. Just as Eq. (6) forces the solution to be consistent with the measurements, the $\mathcal {L}_{g1}$ loss penalizes solutions for not matching the expected $g_1(\tau )$ function:

(8)$$\mathcal{L}_{g1} = \sum_\tau^{d}{\bigg|} g_{1\mathrm{pred}}(\tau) - g_{1\mathrm{GT}}(\tau) {\bigg|}^{2} ,$$

where $d$ is the number of delays in the comparison. The ground truth correlation function, $g_{1\mathrm {GT}}$, can be approximated analytically from the measured FEL gain length (see e.g., Eq. 6.67 in [48]) or calculated numerically from a small number of simulations, so as in Eq. (6) there is no need for a full labeled dataset. Note that $g_1$ is a statistical property, and the expectation value in Eq. (7) is taken over all examples in each training batch, so there is no explicit summation over examples in Eq. (8).

Fig. 2. Schematic of the training flow. At left, the input to the NN inverse model is the power and spectrum from either measured or simulated pulses. The NN outputs are the real and imaginary field components (center, green). If using simulations, a standard labeled loss function (e.g., Eq. (3)) updates the NN parameters to minimize the difference between predicted and simulated fields. Alternatively, a physics forward model, $\boldsymbol {\Phi }$, maps the predicted outputs into predicted inputs (right, green), and an unlabeled loss function (Eq. (6)) updates the NN parameters by comparison to the true inputs. Additional loss terms can be imposed directly on the statistical properties of the predictions, for example the first-order correlation, $g_1$ (Eq. (7)). Combinations of all three loss functions are also possible.

Download Full Size | PDF

3. Results and discussion

We have implemented NNs using both a standard labeled loss function with $\mathcal {L_{\mathrm {mae}}}$, as well as a PINN trained with $\mathcal {L}_{\boldsymbol {\Phi }}$. The simulation parameters are taken from typical short-pulse, soft X-ray mode at LCLS, with 2.5 nm wavelength, 8-12 fs pulse length, and 2-5 spikes per pulse. The training set consists of 120k 1D FEL simulations [49], with a split of 100k/10k/10k for training, validation, and testing. The $\mathcal {L_{\mathrm {mae}}}$ was trained with both real/imaginary and amplitude/phase labels and both had similar performance, so we only present the real/imaginary-trained NN here. We run simulations with various diagnostic resolutions ${R_t}$ and ${R_\nu }$, and apply measurement noise of 2% during training and testing. Because the forward model may not be known perfectly, particularly for the measured ${R_t}$ resolution function of the time-domain measurements, we introduce errors during training to make the network less susceptible to ${R_t}$ errors during inference. In the examples given, we train with an average ${R_t}$ 10% higher than the value used in the test set. We then randomly vary the value of ${R_t}$ by 15% rms during creation of the training dataset for both methods, and additionally in the loss function (Eq. (2)) while training the PINN. Note that robustness to systematic errors has not been studied and requires more assessment.

We start by assuming FWHM resolutions of ${R_t}=3$ fs for the XTCAV and ${R_\nu }=80$ meV for the spectrometer, with results shown in Figs. 3 and 4. Table 1 gives summary statistics for other resolution values. To give a sense for the quality of predictions, we show additional pulse examples in the appendix, Figs. 5 and 6. We find that training with $\mathcal {L_{\mathrm {mae}}}$ gives the best performance. However, some users may still prefer the PINN because it takes only unlabeled inputs for training, and thus is trainable directly on experimental data; there is no need to simulate a new training dataset matching the exact experimental conditions, and the parameters of the training set are guaranteed to match those of the test data. The PINN is also more resistant to over-training, with equal losses for training and test sets. A combined training, using both $\mathcal {L}_{\boldsymbol {\Phi }}$ and $\mathcal {L}_{g1}$ loss functions and with $\mathcal {L_{\mathrm {mae}}}$ applied to the amplitude portion of the outputs, had the best performance predicting field amplitudes (see Appendix, Section A). The higher accuracy of the labeled NNs compared to the PINN is expected; because the labeled NNs are trained on pulses drawn from the same distribution as the test data, these models effectively have a strong prior influencing the field reconstruction. The poor temporal resolution of the power measurement increases the impact of the $\mathcal {L_{\mathrm {mae}}}$ NN’s prior. While using the $\mathcal {L}_{g1}$ loss gives the PINN weak prior information, additional physical constraints may be needed for the PINN’s performance to equal that of the $\mathcal {L_{\mathrm {mae}}}$ NN.

Fig. 3. Pulse example assuming ${R_t}=3$ fs and ${R_\nu }=80$ meV with error near the median score: $\mathcal {L_{\mathrm {mae}}}$ NN error = 0.16 (median 0.16) and PINN error = 0.52 (median 0.51). Top row shows the power and spectral power features used as inputs to the model (black dotted line), as well as the result of applying the forward model to the predicted fields (blue and orange). Middle and bottom rows show the ground truth and predictions in real/imaginary representation (middle row) or amplitude and phase (bottom row). The phases have no meaning at the edges of the pulse where the amplitude goes to zero. For visualization purposes, we run an optimization on the prediction to find the solution on the predicted manifold that is closest to the label; this would not be necessary during inference.

Download Full Size | PDF

Fig. 4. Error distribution for model with ${R_t}=3$ fs and ${R_\nu }=80$ meV. (A) Comparison of errors for the $\mathcal {L_{\mathrm {mae}}}$ and PINN models. Selecting shots with $\mathcal {L}_{\boldsymbol {\Phi }}$ below a threshold (red points) reduces prediction errors during inference. (b) $\mathcal {L_{\mathrm {mae}}}$ NN, field error as a function of the full-width pulse length shows a slight increase at longer pulses. Below, errors are shown separately for the amplitude (left) and phase (right). Black stars denote reconstruction errors for the pulse in Fig. 3.

Download Full Size | PDF

Fig. 5. Example plots of reconstructions using the $\mathcal {L_{\mathrm {mae}}}$ NN, with field errors of 0.11 for row 1 (median score for 5% cut on forward model error), 0.31 for row 2 (worst case for 5% cut on forward model error), and 0.55 for row 3 (99$^{\mathrm {th}}$ percentile worst score over all). All plots use ${R_t}=3$ fs and ${R_\nu }=80$ meV.

Download Full Size | PDF

Fig. 6. Example pulse reconstructions using the PINN, with field errors of 0.14 for row 1 (median score for ${R_t}={R_\nu }=0$), 0.28 for row 2 (median score for 5% cut on forward model error with ${R_t}=3$ fs, ${R_\nu }=80$ meV), and 1.03 for row 3 (90$^{\mathrm {th}}$ percentile worst score over all with ${R_t}=3$ fs, ${R_\nu }=80$ meV).

Download Full Size | PDF

Table 1. Summary of Performance for Different Values of Temporal and Spectral Resolution^a

View Table | View all tables in this article

The PINN’s predicted fields have larger errors than the $\mathcal {L_{\mathrm {mae}}}$ NN’s, but in many examples the corresponding forward-model errors are approximately the same. This observation could indicate inherent phase ambiguity, which may be present for fields with a high degree of symmetry. However, for SASE FEL pulses with at least a few spikes, the phase errors appear dominated by the poor time-resolution of the XTCAV diagnostic. As a check, we trained a PINN under the assumption of perfect XTCAV resolution (see final line Table 1). We find field errors decrease by a factor of 3.7 and phase errors decrease by a factor of 3.3 compared to the standard 3 fs case, confirming the phase reconstruction errors in Figs. 3 and 4 are dominated by diagnostic resolution rather than inherent ambiguities in the phase. (See Figs. 6(a) and (b) for an example pulse with median error.) Indeed in this idealized case, the PINN nearly matches the $\mathcal {L_{\mathrm {mae}}}$ NN performance. Even though the PINN is not able to fully resolve features at the time-scale of single spikes with realistic diagnostics, the predictions still provide more information about the amplitude and phase than is currently available to users.

With any machine learning algorithm, it is important to understand when the method fails. The PINN has the advantage that it is trained by the same metric as used in [30], so a user can easily compare the results through the residuals of the predicted inputs. On the other hand, in Fig. 4 the $\mathcal {L_{\mathrm {mae}}}$ NN’s field errors are correlated to $\mathcal {L}_{\boldsymbol {\Phi }}$, which can be calculated even for test data. We can use this correlation to throw away points that are more likely to have large field errors. For LCLS-II, when an experiment can collect a million shots per second, it may be feasible to throw away questionable data aggressively. For example, a cut retaining just 5% of the data reduces the $\mathcal {L_{\mathrm {mae}}}$ NN’s median field label error by 30% and the maximum error by more than a factor of three (see red points in Fig. 4). Similarly for the PINN, a 5% cut reduces the median field label error by 40% and the maximum field error by a factor of 2. Future work could also estimate the uncertainty of the networks themselves on a shot-by-shot basis, for example using Bayesian dropout during inference [50].

While the iterative method used in Refs. [29,30] does not provide estimates of the phases, we can compare the NN and iterative performance on the field amplitudes. The NN approaches achieve higher accuracy than the iterative method, with the $\mathcal {L_{\mathrm {mae}}}$ NN’s median amplitude errors more than a factor of 3 smaller, and the PINN’s errors 20% smaller than the iterative method. As expected, the NN is also many orders of magnitude faster, capable of processing millions of shots in a second (see Appendix, Section C). It may also be possible to combine the NN and iterative methods, using the NN as a starting guess for the iterative process; this should allow faster convergence than starting from a random guess, while providing strictly equal or better performance compared to the NN. Similar ideas have been implemented in other phase retrieval problems [35,51].

4. Conclusion and future outlook

We have presented a method to recover the full field, phase and amplitude, shot-by-shot for XFEL pulses, opening a path towards coherent-control experiments. We describe two possible architectures, one trained on standard labeled data, and one trained solely using a physical model, allowing that version to be learned directly from unlabeled experimental data. Both methods recover the amplitude with higher accuracy–by a factor of 3 for the labeled NN–than the existing iterative method. With sufficient diagnostic resolution, the proposed NNs also have the ability to recover the phase, which has yet to be demonstrated for multi-spike SASE XFEL pulses by any method. Finally, the NNs have potential to run in real time for MHz repetition rate machines.

Additional applications of the concept may be found for advanced operating modes, for example to resolve microbunching-instability sidebands in seeded FELs [52–56], which can then be accounted for during data analysis [57]. The examples given in this work are for soft X-ray parameters, and though there is no hard limit, the performance is expected to degrade as the number of modes increases and the XTCAV resolution decreases at shorter X-ray wavelengths. On the other hand, seeded FELs have long coherence lengths, and an XTCAV likely would have sufficient resolution for application to hard X-ray self-seeding [58–61].

One advantage of the NN approach is the computational efficiency, which makes it well-suited to high repetition-rate XFELs. However, LCLS time-domain measurements currently rely on an XTCAV which runs at only 120 Hz, limited in rate both by the need for strong X-band streaking and the damage threshold of the electron YAG screen. Taking full advantage of the NN speed will require development of new diagnostics. For example, replacing the X-band cavity with a passive dechirper would avoid the need for a high-rate, high-voltage X-band source [62]. Alternatively, bending radiation can reveal microbunching imprinted on the beam by the FEL process [63], avoiding need for an electron screen. As described in the introduction, direct measurement of X-rays is possible for short pulses [12–16]. Nearly all of these methods could also benefit from including the diagnostics analysis itself as part of the machine learning pipeline; for example, the NN could take XTCAV image data directly as input and treat the XTCAV analysis as part of the forward model [19], or for attosecond pulses the NN could take as inputs the X-ray diagnostics [15,16]. Ultimately, verification of any XFEL phase prediction will require development of phase-sensitive experiments.

Appendix A. Additional examples of reconstructions

To give a sense of the quality of reconstructions associated with various levels of field errors, we show additional examples using both models. Figure 5 shows three additional examples using the $\mathcal {L_{\mathrm {mae}}}$ NN: Row 1 (field error = 0.11) and row 2 (error = 0.31) give the median and worst case values respectively for the 5% of shots with the lowest forward model error [i.e., the red points in Fig. 4(a)]. Row 2 is also equivalent to the 90$^{\mathrm {th}}$ percentile worst case for all pulses. Row 3 (error = 0.6) is the 99$^{\mathrm {th}}$ percentile worst case for all pulses.

Figure 6 shows three examples using the PINN: Row 1 (error = 0.14) is the median value for a model trained with perfect diagnostic resolution (${R_t}={R_\nu } = 0$). Row 2 (error = 0.3) is the median error for the top 5% of shots with the lowest forward model error in Fig. 4(a). Row 3 (error = 1.0) is the 90$^{\mathrm {th}}$ percentile worst case for all pulses. The high-quality predictions for the model trained with perfect diagnostics [Table 1 and Fig. 6(b)] suggest that the phase errors in models with realistic diagnostic resolution [e.g., Fig. 6(f)] are driven by the resolution and not inherent phase ambiguity.

Appendix B. Neural network details

The models presented in the paper use the same inputs and outputs, as well as the same architecture, with the only differences in the loss function and regularization. Each input consists of the time-domain power ($n_p=180$) concatenated with the spectral power ($n_s=200$), for a total length of 380 points per example. The outputs consist of the real and imaginary components of field in the time domain, with a total 200 points ($p=100$) for each example. To improve the spectral resolution we zero-pad the simulated time-domain field to make the time window 5-fold larger. The networks contain four convolutional layers, each with 20 filters and a stride of two, and nine fully-connected layers, each with 200 neurons. All activations are rectified linear units (ReLU) except for the output layer, which has linear activation. The losses described in the main text (including the forward model) are implemented as custom loss functions written in PyTorch. See Table 2 for architecture details.

Table 2. Description of the NN Architecture Used for all Models (Given by the PyTorch ‘Summary’ Function)^a

View Table | View all tables in this article

To reduce over-fitting and match typical experimental noise, both inputs include a Gaussian noise layer of 2% rms noise. The standard NN also uses dropout of 0.05 on the first fully-connected layer. The PINN did not benefit from additional regularization. The training consisted of 3000 epochs with a mini-batch size of 1000 and learning rate of $3\times 10^{-4}$, followed by 3000 epochs with a mini-batch size of 5000 and learning rate of $1 \times 10^{-4}$. All training used the Adam optimizer with default PyTorch settings for beta of 0.9 and 0.999. We did only rough optimization of hyper-parameters (regularization, network width and depth, mini-batch size), and improved performance should be possible from additional optimization. The PINN loss improved by 10% after doubling samples from 50k to 100k, suggesting additional gains are possible from a larger test set.

For the PINN, we apply additional constraints on the coherence length ($g_1$) and smoothness of the solutions. The smoothness constraint penalizes the gradient of the prediction,

(9)$$\mathcal{L}_s = \frac{1}{2np} \sum_i^{n} \sum_t^{p} {\bigg|} \boldsymbol{y}^{i}(t) - \boldsymbol{y}^{i}(t-1) {\bigg|}^{2} ,$$

where $t$ is the time index and the sum is over all $n$ examples in the minibatch. The total loss function for the PINN is then

(10)$$\mathcal{L}_{\mathrm{PINN}} = \mathcal{L}_{\boldsymbol{\Phi}} + \lambda_{g1} \mathcal{L}_{g1} + \lambda_s \mathcal{L}_s \,.$$

The PINN results in the paper use $\lambda _{g1}=0.1$ and $\lambda _s=1$ for the first 3000 epochs, and either $\lambda _s=0.1$ for the final 3000 epochs or $\lambda _s=0.01$ (for the perfect resolution case). As with other hyperparameters, we expect performance gains are possible with careful optimization. For the combined model we use

(11)$$\mathcal{L}_{\mathrm{comb}} = (1-\lambda_{\mathrm{amp}}) \mathcal{L}_{\boldsymbol{\Phi}} + \lambda_{g1} \mathcal{L}_{g1} + \lambda_s \mathcal{L}_s + \lambda_{\mathrm{amp}}\mathcal{L}_{\mathrm{amp}} \,.$$

Using $\lambda _{\mathrm {mae}}=0.5$, we find the combined model has smaller errors on amplitudes (1.0 compared to 1.3 amplitude error), but worse performance on phase (2.1 compared to 0.8 sine error). The combined model outperforms the PINN on both metrics, but loses the benefit of unlabeled training.

Appendix C. Computation time comparison

The computation speed was calculated from the total time required for processing a batch of 10,000 examples on an Intel E5-2640 v3 CPU. The iterative approach required on average 20 sec/pulse, while the NN models required on average 20 $\mathrm{\mu}$sec/pulse for inference plus 30 $\mathrm{\mu}$sec/pulse data load time, nearly six orders of magnitude faster. Furthermore, a principal advantage of NNs is the ability to run efficiently on GPUs, and running on a GeForce RTX 2080 we found an average inference time of just 0.2 $\mathrm{\mu}$sec/pulse. While we expect the iterative method could be sped-up through both algorithm and hardware improvements, we do not see a path to handling the full beam rate of a MHz FEL with iterative solvers.

Funding

U.S. Department of Energy (DE-AC02-76SF00515); Office of Science; Basic Energy Sciences.

Acknowledgments

We would like to thank Andre Al Haddad, T.J. Lane, and Gabriel Marcus for helpful discussions and assistance with simulations. This work was supported by the U.S. Department of Energy, under DOE Contract No. DE-AC02-76SF00515 and the Office of Science, Office of Basic Energy Sciences.

Disclosures

The authors declare no conflicts of interest.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

References

1. P. Emma, R. Akre, J. Arthur, R. Bionta, C. Bostedt, J. Bozek, A. Brachmann, P. Bucksbaum, R. Coffee, F.-J. Decker, Y. Ding, D. Dowell, S. Edstrom, A. Fisher, J. Frisch, S. Gilevich, J. Hastings, G. Hays, P. Hering, Z. Huang, R. Iverson, H. Loos, M. Messerschmidt, A. Miahnahri, S. Moeller, H.-D. Nuhn, G. Pile, D. Ratner, J. Rzepiela, D. Schultz, T. Smith, P. Stefan, H. Tompkins, J. Turner, J. Welch, W. White, J. Wu, G. Yocky, and J. Galayda, “First lasing and operation of an angstrom-wavelength free-electron laser,” Nat. Photonics 4(9), 641–647 (2010). [CrossRef]

2. D. Ratner, J. Cryan, T. Lane, S. Li, and G. Stupakov, “Pump-probe ghost imaging with sase fels,” Phys. Rev. X 9(1), 011045 (2019). [CrossRef]

3. J. D. Biggs, Y. Zhang, D. Healion, and S. Mukamel, “Two-dimensional stimulated resonance raman spectroscopy of molecules with broadband x-ray pulses,” J. Chem. Phys. 136(17), 174117 (2012). [CrossRef]

4. J. D. Biggs, Y. Zhang, D. Healion, and S. Mukamel, “Watching energy transfer in metalloporphyrin heterodimers using stimulated x-ray raman spectroscopy,” Proc. Natl. Acad. Sci. 110(39), 15597–15601 (2013). [CrossRef]

5. S. Mukamel, D. Healion, Y. Zhang, and J. D. Biggs, “Multidimensional attosecond resonant x-ray spectroscopy of molecules: Lessons from the optical regime,” Annu. Rev. Phys. Chem. 64(1), 101–127 (2013). [CrossRef]

6. I. V. Schweigert and S. Mukamel, “Coherent ultrafast core-hole correlation spectroscopy: X-ray analogues of multidimensional nmr,” Phys. Rev. Lett. 99(16), 163001 (2007). [CrossRef]

7. C. Weninger, M. Purvis, D. Ryan, R. A. London, J. D. Bozek, C. Bostedt, A. Graf, G. Brown, J. J. Rocca, and N. Rohringer, “Stimulated electronic x-ray raman scattering,” Phys. Rev. Lett. 111(23), 233902 (2013). [CrossRef]

8. J. T. O’Neal, E. G. Champenois, S. Oberli, R. Obaid, A. Al-Haddad, J. Barnard, N. Berrah, R. Coffee, J. Duris, G. Galinis, D. Garratt, J. M. Glownia, D. Haxton, P. Ho, S. Li, X. Li, J. MacArthur, J. P. Marangos, A. Natan, N. Shivaram, D. S. Slaughter, P. Walter, S. Wandel, L. Young, C. Bostedt, P. H. Bucksbaum, A. Picón, A. Marinelli, and J. P. Cryan, “Electronic population transfer via impulsive stimulated x-ray raman scattering with attosecond soft-x-ray pulses,” Phys. Rev. Lett. 125(7), 073203 (2020). [CrossRef]

9. U. Eichmann, H. Rottke, S. Meise, J.-E. Rubensson, J. Söderström, M. Agåker, C. Såthe, M. Meyer, T. M. Baumann, R. Boll, A. De Fanis, P. Grychtol, M. Ilchen, T. Mazza, J. Montano, V. Music, Y. Ovcharenko, D. E. Rivas, S. Serkez, R. Wagner, and S. Eisebitt, “Photon-recoil imaging: Expanding the view of nonlinear x-ray physics,” Science 369(6511), 1630–1633 (2020). [CrossRef]

10. I. V. Schweigert and S. Mukamel, “Probing interactions between core-electron transitions by ultrafast two-dimensional x-ray coherent correlation spectroscopy,” J. Chem. Phys. 128(18), 184307 (2008). [CrossRef]

11. W. K. Peters, T. Jones, A. Efimov, E. Pedersoli, L. Foglia, R. Mincigrucci, I. Nikolov, R. Trebino, M. B. Danailov, F. Capotondi, F. Bencivenga, and P. Bowlan, “All-optical single-shot complete electric field measurement of extreme ultraviolet free electron laser pulses,” Optica 8(4), 545–550 (2021). [CrossRef]

12. U. Frühling, M. Wieland, M. Gensch, T. Gebert, B. Schütte, M. Krikunova, R. Kalms, F. Budzyn, O. Grimm, J. Rossbach, E. Plönjes, and M. Drescher, “Single-shot terahertz-field-driven x-ray streak camera,” Nat. Photonics 3(9), 523–528 (2009). [CrossRef]

13. I. Grguraš, A. R. Maier, C. Behrens, T. Mazza, T. J. Kelly, P. Radcliffe, S. Düsterer, A. K. Kazansky, N. M. Kabachnik, T. Tschentscher, J. T. Costello, M. Meyer, M. C. Hoffmann, H. Schlarb, and A. L. Cavalieri, “Ultrafast x-ray pulse characterization at free-electron lasers,” Nat. Photonics 6(12), 852–857 (2012). [CrossRef]

14. S. Li, E. G. Champenois, R. Coffee, Z. Guo, K. Hegazy, A. Kamalov, A. Natan, J. O’Neal, T. Osipov, M. Owens, D. Ray, D. Rich, P. Walter, A. Marinelli, and J. P. Cryan, “A co-axial velocity map imaging spectrometer for electrons,” AIP Adv. 8(11), 115308 (2018). [CrossRef]

15. N. Hartmann, G. Hartmann, R. Heider, M. S. Wagner, M. Ilchen, J. Buck, A. O. Lindahl, C. Benko, J. Grünert, J. Krzywinski, J. Liu, A. A. Lutman, A. Marinelli, T. Maxwell, A. A. Miahnahri, S. P. Moeller, M. Planas, J. Robinson, A. K. Kazansky, N. M. Kabachnik, J. Viefhaus, T. Feurer, R. Kienberger, R. N. Coffee, and W. Helml, “Attosecond time–energy structure of x-ray free-electron laser pulses,” Nat. Photonics 12(4), 215–220 (2018). [CrossRef]

16. S. Li, Z. Guo, R. N. Coffee, K. Hegazy, Z. Huang, A. Natan, T. Osipov, D. Ray, A. Marinelli, and J. P. Cryan, “Characterizing isolated attosecond pulses with angular streaking,” Opt. Express 26(4), 4531–4547 (2018). [CrossRef]

17. Y. Ding, C. Behrens, P. Emma, J. Frisch, Z. Huang, H. Loos, P. Krejcik, and M. Wang, “Femtosecond x-ray pulse temporal characterization in free-electron lasers using a transverse deflector,” Phys. Rev. Accel. Beams 14(12), 120701 (2011). [CrossRef]

18. C. Behrens, F.-J. Decker, Y. Ding, V. A. Dolgashev, J. Frisch, Z. Huang, P. Krejcik, H. Loos, A. Lutman, T. J. Maxwell, J. Turner, J. Wang, M.-H. Wang, J. Welch, and J. Wu, “Few-femtosecond time-resolved measurements of x-ray free-electron lasers,” Nat. Commun. 5(1), 3762 (2014). [CrossRef]

19. X. Ren, A. Edelen, A. Lutman, G. Marcus, T. Maxwell, and D. Ratner, “Temporal power reconstruction for an x-ray free-electron laser using convolutional neural networks,” Phys. Rev. Accel. Beams 23(4), 040701 (2020). [CrossRef]

20. P. Heimann, O. Krupin, W. F. Schlotter, J. Turner, J. Krzywinski, F. Sorgenfrei, M. Messerschmidt, D. Bernstein, J. Chalupský, V. Hájková, S. Hau-Riege, M. Holmes, L. Juha, N. Kelez, J. Lüning, D. Nordlund, M. Fernandez Perea, A. Scherz, R. Soufli, W. Wurth, and M. Rowen, “Linac coherent light source soft x-ray materials science instrument optical design and monochromator commissioning,” Rev. Sci. Instrum. 82(9), 093104 (2011). [CrossRef]

21. D. Zhu, M. Cammarata, J. M. Feldkamp, D. M. Fritz, J. B. Hastings, S. Lee, H. T. Lemke, A. Robert, J. L. Turner, and Y. Feng, “A single-shot transmissive spectrometer for hard x-ray free electron lasers,” Appl. Phys. Lett. 101(3), 034103 (2012). [CrossRef]

22. K. W. DeLong, R. Trebino, J. Hunter, and W. E. White, “Frequency-resolved optical gating with the use of second-harmonic generation,” J. Opt. Soc. Am. B 11(11), 2206–2215 (1994). [CrossRef]

23. C. Iaconis and I. A. Walmsey, “Spectral phase interferometry for direct electric field reconstruction of ultrashort optical pulses,” Opt. Lett. 23(10), 792–794 (1998). [CrossRef]

24. Y. Mairesse and F. Quéré, “Frequency-resolved optical gating for complete reconstruction of attosecond bursts,” Phys. Rev. A 71(1), 011401 (2005). [CrossRef]

25. O. Raz, B. Leshem, J. Miao, B. Nadler, D. Oron, and N. Dudovich, “Direct phase retrieval in double blind fourier holography,” Opt. Express 22(21), 24935–24950 (2014). [CrossRef]

26. P. Sidorenko, O. Lahav, Z. Avnat, and O. Cohen, “Ptychographic reconstruction algorithm for frequency-resolved optical gating: super-resolution and supreme robustness,” Optica 3(12), 1320–1330 (2016). [CrossRef]

27. T. Gaumnitz, A. Jain, and H. J. Wörner, “Complete reconstruction of ultra-broadband isolated attosecond pulses including partial averaging over the angular distribution,” Opt. Express 26(11), 14719–14740 (2018). [CrossRef]

28. T. Schweizer, M. H. Brügmann, W. Helml, N. Hartmann, R. Coffee, and T. Feurer, “Attoclock ptychography,” Appl. Sci. 8(7), 1039 (2018). [CrossRef]

29. F. Christie, Y. Ding, Z. Huang, V. A. Jhalani, J. Krzywinski, A. A. Lutman, T. J. Maxwell, D. Ratner, J. Rönsch-Schulenburg, and M. Vogt, “Temporal X-ray reconstruction using temporal and spectral measurements,” Sci. Rep. 10(1), 9799 (2020). [CrossRef]

30. F. Christie, A. A. Lutman, Y. Ding, Z. Huang, V. A. Jhalani, J. Krzywinski, T. J. Maxwell, D. Ratner, J. Rönsch-Schulenburg, and M. Vogt, “Temporal x-ray reconstruction using temporal and spectral measurements at lcls,” J. Physics: Conf. Ser. 1067, 032011 (2020). [CrossRef]

31. J.-H. Chung and A. M. Weiner, “Ambiguity of ultrashort pulse shapes retrieved from the intensity autocorrelation and the power spectrum,” IEEE J. Sel. Top. Quantum Electron. 7(4), 656–666 (2001). [CrossRef]

32. Y. D. Hezaveh, L. P. Levasseur, and P. J. Marshall, “Fast automated analysis of strong gravitational lenses with convolutional neural networks,” Nature 548(7669), 555–557 (2017). [CrossRef]

33. M. Cherukara, Y. Nashed, and R. Harder, “Real-time coherent diffraction inversion using deep generative networks,” Sci. Rep. 8(1), 16520 (2018). [CrossRef]

34. M. J. Cherukara, T. Zhou, Y. Nashed, P. Enfedaque, A. Hexemer, R. J. Harder, and M. V. Holt, “Ai-enabled high-resolution scanning coherent diffraction imaging,” Appl. Phys. Lett. 117(4), 044103 (2020). [CrossRef]

35. H. Chan, Y. S. G. Nashed, S. Kandel, S. Hruszkewycz, S. Sankaranarayanan, R. J. Harder, and M. J. Cherukara, “Real-time 3d nanoscale coherent imaging via physics-aware deep learning,” (2020).

36. Y. Zhang, M. A. Noack, P. Vagovic, K. Fezzaa, F. Garcia-Moreno, T. Ritschel, and P. Villanueva-Perez, “Phasegan: A deep-learning phase-retrieval approach for unpaired datasets,” (2020).

37. A. L. Edelen, J. P. Edelen, S. G. Biedron, S. V. Milton, and P. J. van der Slot, “Using neural network control policies for rapid switching between beam parameters in a free electron laser,” in Workshop on Deep Learning for Physical Sciences (DLPS 2017), NeurIPS 2017, (Long Beach, CA, USA, 2017).

38. M. Raissi, P. Perdikaris, and G. E. Karniadakis, “Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations,” J. Comput. Phys. 378, 686–707 (2019). [CrossRef]

39. I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, “Generative adversarial networks,” Proceedings of the International Conference on Neural Information Processing Systems (NeurIPS 2014) p. 2672 (2014).

40. M. Mirza and S. Osindero, “Conditional generative adversarial nets,” arXiv:1411.1784 [cs.LG] (2014).

41. Y. S. G. Nashed, S. Kandel, M. Du, and C. Jacobsen, “Learning phase retrieval with backpropagation,” Microsc. Microanal. 25(S2), 62–63 (2019). [CrossRef]

42. S. Kandel, S. Maddali, M. Allain, S. O. Hruszkewycz, C. Jacobsen, and Y. S. G. Nashed, “Using automatic differentiation as a general framework for ptychographic reconstruction,” Opt. Express 27(13), 18653–18672 (2019). [CrossRef]

43. M. Du, S. Kandel, J. Deng, X. Huang, A. Demortiere, T. T. Nguyen, R. Tucoulou, V. D. Andrade, Q. Jin, and C. Jacobsen, “Adorym: a multi-platform generic x-ray image reconstruction framework based on automatic differentiation,” Opt. Express 29(7), 10000–10035 (2021). [CrossRef]

44. A. G. Baydin, B. A. Pearlmutter, A. A. Radul, and J. M. Siskind, “Automatic differentiation in machine learning: a survey,” J. Machine Learning Res. 18, 1–43 (2018). [CrossRef]

45. J. Zhu, T. Park, P. Isola, and A. A. Efros, “Unpaired image-to-image translation using cycle-consistent adversarial networks,” in 2017 IEEE International Conference on Computer Vision (ICCV), (2017), pp. 2242–2251.

46. F. Tonolini, J. Radford, A. Turpin, D. Faccio, and R. Murray-Smith, “Variational inference for computational imaging inverse problems,” J. Machine Learning Res. 21, 1–46 (2020).

47. Z. Zhu, Y. Sun, J. White, Z. Chang, and S. Pang, “Signal retrieval with measurement system knowledge using variational generative model,” IEEE Access 8, 47963–47972 (2020). [CrossRef]

48. E. Saldin, E. Schneidmiller, and M. Yurkov, The physics of free electron lasers (Springer, 2000).

49. D. F. code used courtesy of Zhirong Huang.

50. Y. Gal and Z. Ghahramani, “Dropout as a bayesian approximation: Representing model uncertainty in deep learning,” Proceedings of The 33rd International Conference on Machine Learning48, 1050 (2016).

51. A. Scheinker and R. Pokharel, “Adaptive 3d convolutional neural network-based reconstruction method for 3d coherent diffraction imaging,” J. Appl. Phys. 128(18), 184901 (2020). [CrossRef]

52. D. Ratner, R. Abela, J. Amann, C. Behrens, D. Bohler, G. Bouchard, C. Bostedt, M. Boyes, K. Chow, D. Cocco, F. J. Decker, Y. Ding, C. Eckman, P. Emma, D. Fairley, Y. Feng, C. Field, U. Flechsig, G. Gassner, J. Hastings, P. Heimann, Z. Huang, N. Kelez, J. Krzywinski, H. Loos, A. Lutman, A. Marinelli, G. Marcus, T. Maxwell, P. Montanez, S. Moeller, D. Morton, H. D. Nuhn, N. Rodes, W. Schlotter, S. Serkez, T. Stevens, J. Turner, D. Walz, J. Welch, and J. Wu, “Experimental demonstration of a soft x-ray self-seeded free-electron laser,” Phys. Rev. Lett. 114(5), 054801 (2015). [CrossRef]

53. J. Bödewadt, S. Ackermann, R. Aßmann, N. Ekanayake, B. Faatz, G. Feng, I. Hartl, R. Ivanov, T. Laarmann, J. Mueller, T. Tanikawa, P. Amstutz, A. Azima, M. Drescher, L. Lazzarino, C. Lechner, T. Maltezopoulos, V. Miltchev, T. Plath, J. Roßbach, K. Hacker, S. Khan, and R. Molo, “Recent results from fel seeding at flash,” in Proceedings of IPAC2015, (Richmond, VA, USA, 2015), p. TUBC3.

54. P. Rebernik Ribič, A. Abrami, L. Badano, M. Bossi, H.-H. Braun, N. Bruchon, F. Capotondi, D. Castronovo, M. Cautero, P. Cinquegrana, M. Coreno, M. E. Couprie, I. Cudin, M. Boyanov Danailov, G. De Ninno, A. Demidovich, S. Di Mitri, B. Diviacco, W. M. Fawley, C. Feng, M. Ferianis, E. Ferrari, L. Foglia, F. Frassetto, G. Gaio, D. Garzella, A. Ghaith, F. Giacuzzo, L. Giannessi, V. Grattoni, S. Grulja, E. Hemsing, F. Iazzourene, G. Kurdi, M. Lonza, N. Mahne, M. Malvestuto, M. Manfredda, C. Masciovecchio, P. Miotti, N. S. Mirian, I. Petrov Nikolov, G. M. Penco, G. Penn, L. Poletto, M. Pop, E. Prat, E. Principi, L. Raimondi, S. Reiche, E. Roussel, R. Sauro, C. Scafuri, P. Sigalotti, S. Spampinati, C. Spezzani, L. Sturari, M. Svandrlik, T. Tanikawa, M. Trovó, M. Veronese, D. Vivoda, D. Xiang, M. Zaccaria, D. Zangrando, M. Zangrando, and E. M. Allaria, “Coherent soft x-ray pulses from an echo-enabled harmonic generation free-electron laser,” Nat. Photonics 13(8), 555–561 (2019). [CrossRef]

55. G. Marcus, W. M. Fawley, D. Bohler, Y. Ding, Y. Feng, E. Hemsing, Z. Huang, J. Krzywinski, A. Lutman, and D. Ratner, “Experimental observations of seed growth and accompanying pedestal contamination in a self-seeded, soft x-ray free-electron laser,” Phys. Rev. Accel. Beams 22(8), 080702 (2019). [CrossRef]

56. Z. Zhang, G. Marcus, E. Hemsing, W. M. Fawley, Z. Huang, and A. Lutman, “Statistical analysis of a self-seeded x-ray free-electron laser in the presence of the microbunching instability,” Phys. Rev. Accel. Beams 23(1), 010704 (2020). [CrossRef]

57. T. J. Lane and D. Ratner, “What are the advantages of ghost imaging? multiplexing for x-ray and electron imaging,” Opt. Express 28(5), 5898–5918 (2020). [CrossRef]

58. J. Amann, W. Berg, V. Blank, F. J. Decker, Y. Ding, P. Emma, Y. Feng, J. Frisch, D. Fritz, J. Hastings, Z. Huang, J. Krzywinski, R. Lindberg, H. Loos, A. Lutman, H. D. Nuhn, D. Ratner, J. Rzepiela, D. Shu, Y. Shvyd’ko, S. Spampinati, S. Stoupin, S. Terentyev, E. Trakhtenberg, D. Walz, J. Welch, J. Wu, A. Zholents, and D. Zhu, “Demonstration of self-seeding in a hard-x-ray free-electron laser,” Nat. Photonics 6(10), 693–698 (2012). [CrossRef]

59. I. Inoue, T. Osaka, T. Hara, T. Tanaka, T. Inagaki, T. Fukui, S. Goto, Y. Inubushi, H. Kimura, R. Kinjo, H. Ohashi, K. Togawa, K. Tono, M. Yamaga, H. Tanaka, T. Ishikawa, and M. Yabashi, “Generation of narrow-band x-ray free-electron laser via reflection self-seeding,” Nat. Photonics 13(5), 319–322 (2019). [CrossRef]

60. C.-K. Min, I. Nam, H. Yang, G. Kim, C. H. Shim, J. H. Ko, M.-H. Cho, H. Heo, B. Oh, Y. J. Suh, M. J. Kim, D. Na, C. Kim, Y. Kim, S. H. Chun, J. H. Lee, J. Kim, S. Kim, I. Eom, S. N. Kim, T.-Y. Koo, S. Rah, Y. Shvyd’ko, D. Shu, K.-J. Kim, S. Terentyev, V. Blank, and H.-S. Kang, “Hard X-ray self-seeding commissioning at PAL-XFEL,” J. Synchrotron Radiat. 26(4), 1101–1109 (2019). [CrossRef]

61. I. Nam, C.-K. Min, B. Oh, G. Kim, D. Na, Y. J. Suh, H. Yang, M. H. Cho, C. Kim, M.-J. Kim, C. H. Shim, J. H. Ko, H. Heo, J. Park, J. Kim, S. Park, G. Park, S. Kim, S. H. Chun, H. Hyun, J. H. Lee, K. S. Kim, I. Eom, S. Rah, D. Shu, K.-J. Kim, S. Terentyev, V. Blank, Y. Shvyd’ko, S. J. Lee, and H.-S. Kang, “High-brightness self-seeded x-ray free-electron laser covering the 3.5 kev to 14.6 kev range,” Nat. Photonics 15(6), 435–441 (2021). [CrossRef]

62. A. A. Lutman, T. J. Maxwell, J. P. MacArthur, M. W. Guetg, N. Berrah, R. N. Coffee, Y. Ding, Z. Huang, A. Marinelli, S. Moeller, and J. C. U. Zemella, “Fresh-slice multicolour x-ray free-electron lasers,” Nat. Photonics 10(11), 745–750 (2016). [CrossRef]

63. E. L. Saldin, E. A. Schneidmiller, and M. V. Yurkov, “Optical afterburner for an x-ray free electron laser as a tool for pump-probe experiments,” Phys. Rev. Spec. Top.--Accel. Beams 13(3), 030701 (2010). [CrossRef]

$R_{t}$	$R_{ν}$	Field err $L_{m a e} / L_{Φ}$	Amplitude err $L_{m a e} / L_{Φ}$	Phase err $L_{m a e} / L_{Φ}$
1.5 fs	80 meV	0.13/0.40	0.90/2.66	0.69/2.14
3.0 fs	80 meV	0.16/0.51	1.27/3.71	0.79/ 2.75
5.0 fs	80 meV	0.18/0.60	1.49/4.90	0.86/2.91
0.6 fs	400 meV	0.14/0.46	0.76/1.62	0.82/3.04
0.0 fs	0 meV	0.10/0.14	0.65/0.65	0.57/0.84

Layer	Output Shape	Parameters
Conv1d	[-1, 20, 366]	320
Conv1d	[-1, 20, 176]	6,020
Conv1d	[-1, 20, 81]	6,020
Conv1d	[-1, 20, 34]	6,020
Flatten	[-1, 680]	0
Dense	[-1, 200]	136,200
Dropout	[-1, 200]	0
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200

$R_{t}$	$R_{ν}$	Field err $L_{m a e} / L_{Φ}$	Amplitude err $L_{m a e} / L_{Φ}$	Phase err $L_{m a e} / L_{Φ}$
1.5 fs	80 meV	0.13/0.40	0.90/2.66	0.69/2.14
3.0 fs	80 meV	0.16/0.51	1.27/3.71	0.79/ 2.75
5.0 fs	80 meV	0.18/0.60	1.49/4.90	0.86/2.91
0.6 fs	400 meV	0.14/0.46	0.76/1.62	0.82/3.04
0.0 fs	0 meV	0.10/0.14	0.65/0.65	0.57/0.84

Layer	Output Shape	Parameters
Conv1d	[-1, 20, 366]	320
Conv1d	[-1, 20, 176]	6,020
Conv1d	[-1, 20, 81]	6,020
Conv1d	[-1, 20, 34]	6,020
Flatten	[-1, 680]	0
Dense	[-1, 200]	136,200
Dropout	[-1, 200]	0
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200
Dense	[-1, 200]	40,200

Recovering the phase and amplitude of X-ray FEL pulses using neural networks and differentiable models

Abstract

1. Motivation

2. Pulse reconstruction task and proposed method

3. Results and discussion

4. Conclusion and future outlook

Appendix A. Additional examples of reconstructions

Appendix B. Neural network details

Appendix C. Computation time comparison

Funding

Acknowledgments

Disclosures

Data availability

References

Data availability

Cited By

Figures (6)

Tables (2)

Equations (11)

Optics Express