Automated machine learning for secure key rate in discrete-modulated continuous-variable quantum key distribution

Zhi-Ping Liu; Min-Gang Zhou; Wen-Bo Liu; Chen-Long Li; Jie Gu; Hua-Lei Yin; Hua-Lei Yin; Zeng-Bing Chen; Zeng-Bing Chen

doi:10.1364/OE.455762

1. Introduction

In recent decades, machine learning (ML) has gained impressive breakthroughs that deeply impact both industry and academia, including autonomous driving [1,2], natural language processing [3,4], protein structure prediction [5] and even proving mathematical conjectures [6]. ML aims to recognize patterns in data, especially multidimensional data, and generalize them to new instances, which contributes to automating tasks and reveals hidden patterns beyond humans intuition. This modern information-processing technology also benefits solving intractable quantum tasks, since quantum tasks are usually counterintuitive and involve high dimensions. Several significant advances have been made by applying ML to quantum physics, from classifying quantum states [7–9], quantum control [10–12] to quantum metrology [13].

Quantum key distribution (QKD) enables unconditional security between two legitimate users (Alice and Bob) against any eavesdropper called Eve [14,15], which is guaranteed by quantum mechanics laws [16]. According to different detection methods, QKD is currently divided into two categories: discrete-variable (DV) QKD [17,18] and continuous-variable (CV) QKD [19–22]. Between these two categories, CV QKD has unique edges on a higher secret key rate and excellent compatibility with standard communication components [23–25], which enables CV QKD to be competitive at a metropolitan distance [26]. To enhance the practicality of CV QKD, several works introduce machine learning-based methodologies to the CV QKD area, such as developing a novel CV QKD scheme [27,28], parameter prediction [29] and detecting quantum attacks [30].

CV QKD protocols with discrete modulation have attracted increasing attention for decades. Its appealing advantages include easier experimental implementation and higher error-correction efficiencies which promote CV QKD over longer distances [31–34]. These properties bring potential advantages in large-scale deployment in quantum-secured networks [35]. However, the security analysis of discrete-modulated CV QKD protocols is more complicated owing to the lack of symmetry [36]. Recently, some novel numerical approaches [37,38] have been proposed to analyze the security of discrete-modulation protocols against collective attacks, where key rate calculation involves minimizing a convex function over all eavesdropping attacks that are consistent with the experimental data. These numerical approaches achieve much higher key rates over significantly longer distances compared with previous security analyses. Based on these numerical approaches, a neural network model was presented to quickly predict the secure key rate of discrete-modulated CV QKD with high reliability (secure probability as high as $99.2\%$). This neural network model learns the mapping between input parameters and key rates from datasets generated by numerical methods, which supports the computation of secure key rates in real time [39]. However, the mapping complexity between input parameters and key rates depends on the solving complexity of discrete-modulated protocols’ key rates through numerical approaches [40]. Selecting architectures and hyperparameters plays a critical role in the performance of a neural network. Therefore, to learn different mappings from different protocols, the architectures of neural networks and the corresponding hyperparameters should be adjusted carefully by humans, which comes at a great price [41].

Here, we propose a more flexible and automatic neural network model combined with Bayesian optimization [42], which maintains extremely high reliability and efficiency and reduces complicated manual adjustment. Our method is universal for a variety of unstructured QKD protocols that lack analytical tools and rely on numerical methods. We apply our model to two variants of discrete-modulated CV QKD protocols and acquire high secure key rates with considerable tightness in both cases. We then compare the time consumption of our model with the numerical method proposed in Ref. [37], which shows a great speedup of approximately $10^{7}$.

This paper is organized as follows. In section 2, we introduce the numerical method for CV QKD with discrete modulation proposed in Ref. [37], and we rely on it to collect a dataset to train and test the model. In section 3, we introduce more details about the Bayesian optimization used in this paper. In section 4, we demonstrate all the main results of this paper. Section 5 provides a discussion and concludes this paper.

2. Numerical method for CV QKD with discrete modulation

In this work, we apply the model in two discrete-modulated CV QKD protocols with different detection techniques to demonstrate the generalizability of our model. One is the quadrature phase-shift-keying (QPSK) heterodyne detection protocol [37], and the other is an improved QPSK homodyne detection protocol [43]. To collect a dataset for training neural networks, we generate secure key rates of both protocols by applying the same numerical method [37,40]. In the following, we briefly introduce how computing key rates can be transformed into a relevant convex objective function for numerical optimization. A more detailed description can be found in Ref. [37].

Here, we consider a CV QKD protocol with quaternary modulation that involves two parties: a sender Alice and a receiver Bob. During each time in an iteration of N rounds, Alice randomly prepares one of the four coherent states $\left |\alpha _{k}\right \rangle =\left || \alpha |e^{i (2 k \pi /4+\pi / 4)}\right \rangle$, where $k \in \{0,1,2,3\}$, and sends it to Bob via an untrusted quantum channel. Then, Bob uses either homodyne or heterodyne detection to estimate k. The secret key rate under collective attacks in the asymptotic limit is given by the following expression according to the Devetak-Winter formula [44]

(1)$$R^{\infty}=p_{\text{pass }}[\min _{\rho \in \mathbf{S}} H(\mathbf{Z} \mid E)-\delta_{\mathrm{EC}}]$$

where $H\left (\mathbf {Z} \mid E\right )$ is conditional von Neumann entropy, which describes the uncertainty of the string $\mathbf {Z}$ in Eve’s view. Eve’s maximal knowledge of Bob’s string $\mathbf {Z}$ requires the minimum uncertainty of $\mathbf {Z}$ under a certain density matrix $\rho$. Therefore, we need to find the optimum $\rho ^{*}$ in feasible domain $\mathbf {S}$ to minimize $H\left (\mathbf {Z} \mid E\right )$, $p_{\text {pass }}$ is the sifting probability, and $\delta _{\mathrm{EC}}$ is the actual amount of information leakage per signal in the error-correction step. To turn this problem into a convex optimization problem, the above expression can be reformulated as

(2)$$R^{\infty}=\min _{\rho_{A B} \in \mathrm{S}} D\left(\mathcal{G}\left(\rho_{A B}\right) \| \mathcal{Z}\left[\mathcal{G}\left(\rho_{A B}\right)\right]\right)-p_{\text{pass }} \delta_{\mathrm{EC}}$$

in which $D(\rho \| \sigma )=\operatorname {Tr}\left (\rho \log _{2} \rho \right )-\operatorname {Tr}\left (\rho \log _{2} \sigma \right )$. As shown in Ref. [38], $\mathcal {G}$ is a completely positive and trace nonincreasing map that describes the postprocessing of different quadratures. $\mathcal {Z}$ is a pinching quantum channel that reads out the key information.

Since the term $p_{\text {pass }} \delta _{\mathrm{EC}}$ in formula $\left (2\right )$ is easy to compute, we can only consider the following relevant optimization problem:

(3)$$\begin{aligned} &\mathrm{minimize} D\left(\mathcal{G}\left(\rho_{A B}\right) \| \mathcal{Z}\left[\mathcal{G}\left(\rho_{A B}\right)\right]\right) \\ \text{subject to }\\ & \end{aligned}$$

(4)$$\mathrm{Tr}\left[\rho_{A B}\left(|k\rangle\left\langle\left. k\right|_{A} \otimes \hat{q}\right)\right]=p_{k}\langle\hat{q}\rangle_{k}\right.$$

(5)$$\mathrm{Tr}\left[\rho_{A B}\left(|k\rangle\left\langle\left. k\right|_{A} \otimes \hat{p}\right)\right]=p_{k}\langle\hat{p}\rangle_{k}\right.$$

(6)$$\mathrm{Tr}\left[\rho_{A B}\left(|k\rangle\left\langle\left. k\right|_{A} \otimes \hat{n}\right)\right]=p_{k}\langle\hat{n}\rangle_{k}\right.$$

(7)$$\mathrm{Tr}[\rho_{A B}(|k\rangle \left\langle k\right|_{A} \otimes \hat{d})]=p_{k}\langle\hat{d}\rangle_{k}$$

(8)$$\mathrm{Tr}\left[\rho_{A B}\right]=1$$

(9)$$\rho_{A B} \geq 0$$

(10)$$\mathrm{Tr}_{B}\left[\rho_{A B}\right]=\sum_{i, j=0}^{3} \sqrt{p_{i} p_{j}}\left\langle\alpha_{j} \mid \alpha_{i}\right\rangle|i\rangle\left\langle\left. j\right|_{A}\right.$$

where $k \in \left \{0,1,2,3\right \}$, $\langle \hat {q}\rangle _{k}$, $\langle \hat {p}\rangle _{k}$, $\langle \hat {n}\rangle _{k}$ and $\langle \hat {d}\rangle _{k}$ denote the expectation values of corresponding operators when Bob measures states labeled by $k$. These expectation values can be obtained through homodyne or heterodyne measurements. These first four constraints come from experimental outcomes. The next two constraints are natural requirements since $\rho _{AB}$ is a density matrix. The last constraint on the partial trace of system B comes from the fact that the quantum channel cannot influence system A of Alice. We can handle the above density matrix and operators in finite dimensions $N_c$ after imposing the photon-number cutoff assumption on this optimization problem [37,45]. Then, this problem can be solved numerically. Eventually, we solve this minimization problem by the numerical method proposed in Ref. [38]. The specific implementation of this numerical method in our work can be found in Ref. [43]. This method involves two steps:

1. Find a solution that is close to optimal, which gives an upper bound on the key rate.

2. Convert this upper bound to a lower bound on the key rate by considering its dual problem.

3. Bayesian optimization

In this section, we present a brief introduction to Bayesian optimization. Bayesian optimization is a powerful strategy for global optimization of objective functions that are expensive to evaluate [42,46]. This method is gaining great popularity in hyperparameter optimization. In particular, hyperparameter optimization in machine learning can be represented as follows:

(11)$$ x^{{\star}}=\arg \min _{x \in \mathcal{X}} f(x),$$

where $f(x): \mathcal {X} \rightarrow \mathbb {R}$ is an objective function to minimize, $x^{\star }$ is a hyperparameter vector yielding the lowest value of $f$, and the dimension of domain $\mathcal {X}$ depends on the total type of concerned hyperparameters. In practice, the evaluation of the objective function is extremely costly, which leads to selecting proper hyperparameters by hand becoming intractable. Beyond the manual tuning method, grid search and random search [47] are two common methods that perform slightly better. However, these methods still waste a large amount of time evaluating poor hyperparameters across the entire search space, which is relatively inefficient. In contrast, Bayesian optimization estimates the true objective function with a probability model. Then, it utilizes Bayes’ theorem to update this model based on previous results and chooses the next promising hyperparameters. In practice, this method can find better hyperparameters in less time. Figure 1 illustrates the Bayesian optimization procedure.

Fig. 1. Illustration of the Bayesian optimization procedure. Bayesian optimization estimates the true objective function with a probability model called a surrogate. The blue real curve represents the true objective function. Red cross points are sampling points for the true objective function $f(x)$. The black dotted curve and purple shadow represent the mean and confidence intervals estimated with the surrogate of the objective function.

Download Full Size | PDF

Sequential model-based optimization (SMBO) algorithms are formalizations of Bayesian optimization. [46] These algorithms have two key ingredients:

1. A probabilistic surrogate model $S$. SMBO approximates the objective function $f$ with a probabilistic model called a surrogate, which is cheaper to evaluate. This surrogate contains a prior distribution capturing beliefs about the behavior of the objective function and is then updated sequentially after each new trial.

2. An acquisition function $\alpha : \mathcal {X} \rightarrow \mathbb {R}$. The acquisition function is the criterion by which the next vector of hyperparameters is chosen from the surrogate function.

For an SMBO algorithm at iteration n, the next location $x_{n+1}$ is selected by optimizing $\alpha _n$ and to evaluate the true $f$ to obtain a result $y_{n+1} = f(x_{n+1})$. The new tuple $(x_{n+1},y_{n+1})$ is appended to the historical set $\mathcal {H}$. Then, the surrogate model $S$ is updated incorporating the new results, which means that the prior is updated to produce a more informative posterior distribution over the space of objective functions. The pseudocode of this framework is summarized in Algorithm 1.

The most common choice of acquisition function is expected improvement (EI):

(12)$$\mathrm{E} \mathrm{I}_{y^{*}}(x):=\int_{-\infty}^{\infty} \max \left(y^{*}-y, 0\right) p_{S}\left(y \mid x\right) d y$$

Here $y^{*}$ is a threshold value of the objective function $f$, and $p_{S}\left (y\mid x\right )$ represents the surrogate probability model. If this expectation is positive, then the vector of hyprparameters $x$ is expected to produce a better result than $y^{*}$. There are several different strategies for constructing the surrogate model: a Gaussian process approach [48], random forests [49] and a tree-structured Parzen estimator(TPE) [46]. In this work, the TPE approach is adopted, which supports continuous, categorical and conditional parameters, as well as priors for each hyperparameter over which values are expected to perform best [50]. In contrast, the Gaussian process approach and random forests only support one or two types of the above parameters, which are not capable of our following task covering continuous, categorical and conditional parameters. Instead of directly modeling $p\left (y\mid x\right )$, this method models $p\left (x\mid y\right )$ using two such densities over the configuration space $\mathcal {X}$:

(13)$$p\left(x \mid y\right)= \left\{\begin{array}{cc}\ell(x) & \text{ if } y<y^{*} \\ g(x) & \text{ if } y \geq y^{*}\end{array}\right.$$

This algorithm chooses $y^{*}$ to be some quantile $\gamma$ of the observed y values, which means $p\left (y<y^{*}\right ) = \gamma$. So the $E I_{y^{*}}(x)=\frac {\gamma y^{*} \ell (x)-\ell (x) \int _{-\infty }^{y^{*}} p(y) d y}{\gamma \ell (x)+(1-\gamma ) g(x)} \propto \left (\gamma +\frac {g(x)}{\ell (x)}(1-\gamma )\right )^{-1}$. The tree-structured form of $\ell$ and $g$ makes it easy to draw many candidates according to $g(x) / \ell (x)$. On each iteration, the algorithm returns the candidate $x$ with the greatest EI. We implement this algorithm for the hyperparameter optimization of the neural networks predicting CV QKD key rates, by using a Python library called Hyperopt [51].

4. Method

Artificial neural networks can approximate arbitrary bounded continuous mapping on a given domain, according to universal approximation theorem [52]. Therefore, we expect that the neural network can learn the mapping between input variables defined in the constraints of Eq. (3) and output key rates, which avoids solving the time-consuming optimization problem and computes key rates with low latency. We demonstrated this possibility of using a neural network to predict the key rates of discrete-modulated CV QKD in previous work [39]. In that work, we built a four-layer fully connected forward neural network holding a loss function designed specifically to predict the key rates of discrete-modulated CV QKD with homodyne detection. The objective loss function is the key ingredient to keep the output key rates reliable and tight. We retain it in this work but utilize the TPE algorithm to search other parts of the neural network to improve the network’s overall performance. The specific formula of the loss function is as follows:

(14)$$\begin{aligned} \mathcal{L} & =\frac{1}{n} \sum_{i=1}^{n} \gamma\left(e_{i}^{* 2}+\max \left(e_{i}^{*},-\log _{10}(\varepsilon)\right)\right)\\ & -(1-\gamma)\left(\min \left(e_{i}^{*}, 0\right)\right)\end{aligned}$$

For training inputs $\left \{\vec {x}_{i}\right \}$ and corresponding labels $\left \{{y}_{i}\right \}$, here $n$ is the size of $\left \{\vec {x}_{i}\right \}$, $e_i^{*}=y_{i}^{*p}-y_{i}^{ *}$ is the residual error between preprocessed label $y_{i}^{*}$ and the corresponding output of the neural network $y_{i}^{* p}$, where $y_{i}^{ *}=-\log _{10}\left (y_{i}\right )$. There are two significant hyperparameters $\gamma$ and $\varepsilon$ contained in this loss function, the choices of which are crucial to a model’s performance, as we presented in Ref. [39]. The meaning of hyperparameter $\gamma$ is to force the predicted key rate as information-theoretically secure as possible, and $\varepsilon$ is to force the predicted key rate as close to numerical results as possible. Here, apart from the input layer and output layer, we do not fix the structure of the neural network but utilize the TPE algorithm to search it efficiently in a set configuration space. An illustration of our model is shown in Fig. 2.

Fig. 2. Schematic diagram of our neural network model combined with Bayesian optimization. The dataset training neural network to predict key rate is generated by some numerical approach. Here, the hyperparameters related to the neural network’s architecture are not determined by humans but updated by Bayesian optimization. Bayesian optimization primarily establishes a probability model for the distribution of neural network hyperparameters and performance. Then, according to the evaluation metric produced in each trial, such as validation loss, Bayesian optimization updates the probabilistic surrogate model and suggests the next choice of hyperparameters. After several trials, we can automatically obtain the best-performing neural network.

Download Full Size | PDF

Before training the neural network under the TPE method, we generate datasets for two different protocols by the aforementioned numerical approach. To obtain datasets with diversity, for the QPSK heterodyne detection protocol, we generate $36$ sets of data from excess noise $\xi =0.0045-0.0405$. Each dataset contains $80$ random samplings for $\xi$ from an interval of length $0.001$, for example $\left [0.0045,0.0055\right ]$. Under each random sampling, we generate data every $5$km with the transmission distance $L$ up to $200$km. At each distance, we generate data from amplitude $\alpha =0.62-0.72$ in a step of $0.01$. The total datasets contain $809,600$ input instances $\left \{\vec {x}_{i}\right \}$ and corresponding labels $\left \{{y}_{i}\right \}$. For the QPSK homodyne detection protocol, excess noise is sampled randomly from $\xi = 0.014-0.042$, where the length of sampling interval is $0.002$, for example $\left [0.014,0.016\right ]$, and amplitude $\alpha$ is sampled from $\left [0.60,1.05\right ]$. The size of total datasets is $368,116$. For both protocols, each $\vec {x}_{i} \in \left \{\vec {x}_{i}\right \}$ represents a vector of $29$ variables, there are $16$ variables that are the right parts of the first four constraints of Eq. (3), $12$ variables are nondiagonal elements of the right side matrix of the last constraint of Eq. (3), and the remaining variable is excess noise $\xi$. Label $y_i \in \left \{\vec {y}_{i}\right \}$ represents the corresponding key rate. Therefore, we fix the neurons of the network’s input layer in $29$ and output layer in $1$, and the search space of other hyperparameters can be found in Tables 1 and 2.

Table 1. Hyperparameter search space of the neural network under the TPE algorithm for the QPSK heterodyne detection protocol [37]. The neural network model we use here is a fully connected forward network. By fixing the number of neurons in the input layer and output layer, we search this neural network’s architecture for hidden layers. For each of the three hidden layers, there are the number of neurons, activation function in this layer and the ratio of dropout layer following it waiting to determine, where the dropout technique [53] is used to prevent overfitting. The batch size of the training process and two essential hyperparameters $\gamma$ and $\varepsilon$ are searched as well. The brace $\{ \}$ refers to a finite set that contains all possible discrete values. Bracket $()$ represents a continuous range.

View Table | View all tables in this article

Table 2. Hyperparameter search space of the neural network under the TPE algorithm for the QPSK homodyne detection protocol [43]. Different from the QPSK heterodyne detection protocol [37], here, we search the number of hidden layers in $3$ or $4$.

View Table | View all tables in this article

Before feeding data into neural networks, we split data into a training set and a test set and implement data preprocessing as in Ref. [39]. For the QPSK heterodyne detection protocol, the training set contains $769,120$ data instances, and the test set contains $40,480$ data instances. For the QPSK homodyne detection protocol, the training set contains $327,636$ data instances, and the test set contains $17,244$ data instances. For both cases, there is $10\%$ of the training data split as the validation set. We generate the dataset on the blade cluster system of the High Performance Computing Center of Nanjing University. We consume over $250,000$ core hours, and the node we use contains 4 Intel Xeon Gold 6248 CPUs, which involves immense computational power. Under the TPE algorithm with max iteration $10$, the Adam algorithm [54] is used to train neural networks for $200$ epochs, and the initial learning rate is set to $0.001$. It takes roughly $53$ hours for the QPSK heterodyne detection protocol and $23$ hours for the QPSK homodyne detection protocol on an Nvidia A100 GPU.

5. Result

After this training under TPE searching is complete, we obtain the resulting structures of neural networks in both cases, which are shown in Tables 3 and 4. Then, we use the selected and trained network to predict key rates on the test set for both protocols. The predicted key rates that show security achieve as high as $99.15\%$ for the QPSK heterodyne detection protocol and $99.59\%$ for the QPSK homodyne detection protocol, which suggests that our method combining a neural network with Bayesian optimization is highly reliable. For those key rates predicted securely, namely, predicted results are lower than the true values, we plot their relative deviation distributions for both protocols in Fig. 3. Figure 3 suggests that our method has good tightness.

Fig. 3. Relative deviation distribution of key rates predicted securely in respective test set for two protocols: (a)Protocol 1(with heterogyne detection) [37]. In these secure results, the relative deviations falling in $\left [-20\%,0\right ]$ account for $92.49\%$ and falling in $\left [-40\%,0\right ]$ account for $96.48\%$. (b)Protocol 2 with impoved homodyne detection [43]. The relative deviations falling in $\left [-20\%,0\right ]$ account for $90.26\%$ and falling in $\left [-40\%,0\right ]$ account for $98.39\%$.

Download Full Size | PDF

Table 3. Resulting structure of neural networks of QPSK heterodyne detection protocol [37]

View Table | View all tables in this article

Table 4. Resulting structure of neural networks of QPSK homodyne detection protocol [43]

View Table | View all tables in this article

Here, we also compare the predicted results with numerical results in key rates versus transmission distance for two protocols. The comparison is shown in Fig. 4. For this plot, we implement the same numerical approach to compute the best key rates of two protocols for different excess noises by optimizing the amplitude $\alpha$ of signal states in the range $\left [0.62,0.72\right ]$ and $\left [0.62,1.03\right ]$ with a step of $0.01$. The choice of the excess noise range is consistent with the sampling interval of previous training data. The photon-number cutoff $N_c$ is $12$, and the maximal iteration number of the first step in the numerical approach $N_i=300$. We record the corresponding $29$ variables producing the best key rates as neural networks’ inputs to predict key rates. As shown in Figs. 4(c) and (d), the predicted results are all secure and remain tight with relative deviations between $10\%$ and $20\%$ when the transmission distance is below $150$ km for both protocols.

Fig. 4. Comparison between NN-predicted results and numerical results in key rates for two protocols. (a)The predicted key rates vs. transmission distance for the QPSK heterodyne detection protocol [37] for different values of the excess noise, from top to bottom, $\xi =0.005,0.010,0.015,0.020,0.025,0.030,0.035,0.040$. The signal state amplitude of the QPSK heterodyne detection protocol is optimized in the range $\left [0.62,0.72\right ]$ with a step of $0.01$. (b)The predicted key rates vs. transmission distance for the QPSK homodyne detection protocol [43] for different values of excess noise, from top to bottom, $\xi =0.015,0.020,0.025,0.030,0.035,0.040$. The amplitude of the QPSK homodyne detection protocol is optimized in the range $\left [0.62,1.03\right ]$ with a step of $0.01$. (c) and (d)Relative deviation between NN-predicted results and numerical results, respectively, for the QPSK heterodyne detection protocol and QPSK homodyne detection protocol, in which the corresponding NN-predicted results are shown in (a) and (b). For all points, we set reconciliation efficiency $\beta =0.95$, postselection $\Delta =0$, transmittance $\eta =10^{-\frac {a L}{10}}$ in the distance L with $a=0.2$ dB/km, photon-number cutoff $N_c=12$ and the maximal $N_i=300$ iteration of the first step in the numerical method.

Download Full Size | PDF

To show the efficiency of our method, we compare the running time between the neural network method and the numerical method on a high-performance personal computer with a 3.3 GHz AMD Ryzen 9 4900H 16 GB of RAM, as shown in Fig. 5. The results suggest that the neural network method is generally 6-8 orders of magnitude of the numerical methods. For example, when $\xi =0.025$, the numerical method consumes approximately $850$ seconds to calculate the key rate at $50$ km for the QPSK heterodyne detection protocol. When $\xi =0.035$, the numerical method consumes approximately $1260$ seconds at $25$ km to calculate the key rate for the QPSK homodyne detection protocol. However, we can use a trained neural network to obtain results in approximately $0.0001$ seconds, which is almost real time.

Fig. 5. Running time comparison between neural-network method and numerical method. (a)Time comparison of protocol 1 for excess noise $\xi =0.005,0.015,0.025$ is shown as circles, triangles and diamonds respectively. (b)Time comparison of protocol 2 for excess noise $\xi =0.015,0.025,0.035$ is shown as circles, triangles and diamonds respectively. Each point refers to a ratio of time consumed by numerical method and neural-network method. The values of $\beta$, $\Delta$, $\eta$, $N_c$ and $N_i$ remain the same as those mentioned in Fig. 4.

Download Full Size | PDF

6. Discussion and conclusion

To summarize, we develop a neural network model combined with Bayesian optimization to directly extract key rates with high reliability, considerable tightness and great efficiency. Beyond designing the neural network architecture by human and troublesome manual tuning of hyperparameters, we utilize a special Bayesian optimization method called the TPE algorithm to automatically search the structure and hyperparameters that are the best fit for a given dataset. We exemplify our method on two promising discrete-modulated CV QKD protocols varied by different detection techniques across a large range of excess noise and transmission distances. For both protocols, the neural networks selected by the TPE algorithm predict the information-theoretically secure key rates with great high probability(up to $99.15\%$ for the QPSK heterodyne detection protocol and $99.59\%$ for the QPSK homodyne detection protocol), and the results present considerable tightness.

We show that our method can achieve approximately $10^{7}$ faster than the numerical method, which completely satisfies the requirement of the QKD system in practice. In contrast, the numerical method takes several minutes to calculate a point of key rate, which is intolerable since many free-space sessions, such as satellite-ground or handheld QKD might have a window of only minutes. While collecting enough data based on the numerical method to train the model consumes a large amount of computing power, we can consider these large computations offline. Once we obtain the trained neural network, it can be deployed on a certain device to infer key rates online in milliseconds by giving new inputs from the experiment. Ref. [55] demonstrated that a neural network method for parameter optimization of QKD can be deployed on various mobile low-power systems, which brings advantages of more power efficiency and low latency. We can also forecast that our neural network method combined with Bayesian optimization will play an essential role in free-space QKD scenarios such as handheld [56], drone-based [57] or satellite-ground QKD [58]. Several works have focused on machine learning for optimal parameters in QKD [29,55,59,60]. However, our work predicts secure key rates directly by automatically designed neural networks, which goes further than our previous work [39].

Based on our model, there are several directions worthy of investigation for future work. Up to now, we have only covered computing the asymptotic key rates. However, finite-size effects are practical issues considered in discrete-modulated CV-QKD [61]. Note that a recent work has analyzed the security and performance of discrete-modulated CV-QKD under a finite-size scenario [62], which inspires us to improve our model. To address these issues, we also consider applying our model to other protocols in future work. Moreover, the issue of post-processing (notably the error correction part) still limits the overall time acceleration for a discrete-modulated continuous-variable QKD system. Note that the error correction involving binary or quaternary error-correcting codes is less complex compared with the situation of Gaussian modulation. Therefore, we also consider developing an effective error-correction protocol for CV QKD with discrete modulation using machine learning techniques in the future.

Funding

Natural Science Foundation of Jiangsu Province (BK20211145); Fundamental Research Funds for the Central Universities (020414380182); Key Research and Development Program of Nanjing Jiangbei New Aera (ZDYD20210101); Program for Innovative Talents and Entrepreneurs in Jiangsu (JSSCRC2021484); Key-Area Research and Development Program of Guangdong Province (2020B0303040001).

Disclosures

The authors declare no conflicts of interest.

Data availability

Data underlying the results presented in this paper are not publicly available at this time but may be obtained from the authors upon reasonable request.

References

1. S. Grigorescu, B. Trasnea, T. Cocias, and G. Macesanu, “A survey of deep learning techniques for autonomous driving,” J. Field Robotics 37(3), 362–386 (2020). [CrossRef]

2. J. Levinson, J. Askeland, J. Becker, J. Dolson, D. Held, S. Kammel, J. Z. Kolter, D. Langer, O. Pink, V. Pratt, M. Sokolsky, G. Stanek, D. Stavens, A. Teichman, M. Werling, and S. Thrun, “Towards fully autonomous driving: Systems and algorithms,” in 2011 IEEE intelligent vehicles symposium (IV), (IEEE, 2011), pp. 163–168.

3. L. Deng, G. Hinton, and B. Kingsbury, “New types of deep neural network learning for speech recognition and related applications: An overview,” in 2013 IEEE international conference on acoustics, speech and signal processing, (IEEE, 2013), pp. 8599–8603.

4. T. Young, D. Hazarika, S. Poria, and E. Cambria, “Recent trends in deep learning based natural language processing,” IEEE Comput. Intell. Mag. 13(3), 55–75 (2018). [CrossRef]

5. J. Jumper, R. Evans, A. Pritzel, T. Green, M. Figurnov, O. Ronneberger, K. Tunyasuvunakool, R. Bates, A. Žídek, A. Potapenko, A. Bridgland, C. Meyer, S. A. A. Kohl, A. J. Ballard, A. Cowie, B. Romera-Paredes, S. Nikolov, R. Jain, J. Adler, T. Back, S. Petersen, D. Reiman, E. Clancy, M. Zielinski, M. Steinegger, M. Pacholska, T. Berghammer, S. Bodenstein, D. Silver, O. Vinyals, A. W. Senior, K. Kavukcuoglu, P. Kohli, and D. Hassabis, “Highly accurate protein structure prediction with alphafold,” Nature 596(7873), 583–589 (2021). [CrossRef]

6. A. Davies, P. Veličković, L. Buesing, S. Blackwell, D. Zheng, N. Tomašev, R. Tanburn, P. Battaglia, C. Blundell, A. Juhász, M. Lackenby, G. Williamson, D. Hassabis, and P. Kohli, “Advancing mathematics by guiding human intuition with AI,” Nature 600(7887), 70–74 (2021). [CrossRef]

7. J. Gao, L.-F. Qiao, Z.-Q. Jiao, Y.-C. Ma, C.-Q. Hu, R.-J. Ren, A.-L. Yang, H. Tang, M.-H. Yung, and X.-M. Jin, “Experimental machine learning of quantum states,” Phys. Rev. Lett. 120(24), 240501 (2018). [CrossRef]

8. M. Yang, C.-l. Ren, Y.-c. Ma, Y. Xiao, X.-J. Ye, L.-L. Song, J.-S. Xu, M.-H. Yung, C.-F. Li, and G.-C. Guo, “Experimental simultaneous learning of multiple nonclassical correlations,” Phys. Rev. Lett. 123(19), 190401 (2019). [CrossRef]

9. S. Ahmed, C. S. Muñoz, F. Nori, and A. F. Kockum, “Classification and reconstruction of optical quantum states with deep neural networks,” Phys. Rev. Research 3(3), 033278 (2021). [CrossRef]

10. M. Bukov, A. G. Day, D. Sels, P. Weinberg, A. Polkovnikov, and P. Mehta, “Reinforcement learning in different phases of quantum control,” Phys. Rev. X 8, 031086 (2018). [CrossRef]

11. A. Lumino, E. Polino, A. S. Rab, G. Milani, N. Spagnolo, N. Wiebe, and F. Sciarrino, “Experimental phase estimation enhanced by machine learning,” Phys. Rev. Appl. 10(4), 044033 (2018). [CrossRef]

12. M. Y. Niu, S. Boixo, V. N. Smelyanskiy, and H. Neven, “Universal quantum control through deep reinforcement learning,” npj Quantum Inf. 5(1), 33 (2019). [CrossRef]

13. A. Hentschel and B. C. Sanders, “Efficient algorithm for optimizing adaptive quantum metrology processes,” Phys. Rev. Lett. 107(23), 233601 (2011). [CrossRef]

14. C. H. Bennett and G. Brassard, “Quantum cryptography: public key distribution and coin tossing int,” in Conf. on Computers, Systems and Signal Processing (Bangalore, India, vol. 175 (1984).

15. A. K. Ekert, “Quantum cryptography based on Bell’s theorem,” Phys. Rev. Lett. 67(6), 661–663 (1991). [CrossRef]

16. P. W. Shor and J. Preskill, “Simple proof of security of the BB84 quantum key distribution protocol,” Phys. Rev. Lett. 85(2), 441–444 (2000). [CrossRef]

17. H.-K. Lo, M. Curty, and K. Tamaki, “Secure quantum key distribution,” Nat. Photonics 8(8), 595–604 (2014). [CrossRef]

18. N. Gisin, G. Ribordy, W. Tittel, and H. Zbinden, “Quantum cryptography,” Rev. Mod. Phys. 74(1), 145–195 (2002). [CrossRef]

19. F. Grosshans and P. Grangier, “Continuous variable quantum cryptography using coherent states,” Phys. Rev. Lett. 88(5), 057902 (2002). [CrossRef]

20. A. M. Lance, T. Symul, V. Sharma, C. Weedbrook, T. C. Ralph, and P. K. Lam, “No-switching quantum key distribution using broadband modulated coherent light,” Phys. Rev. Lett. 95(18), 180503 (2005). [CrossRef]

21. D. Huang, P. Huang, D. Lin, C. Wang, and G. Zeng, “High-speed continuous-variable quantum key distribution without sending a local oscillator,” Opt. Lett. 40(16), 3695–3698 (2015). [CrossRef]

22. H.-L. Yin, W. Zhu, and Y. Fu, “Phase self-aligned continuous-variable measurement-device-independent quantum key distribution,” Sci. Rep. 9, 49 (2019). [CrossRef]

23. S. Fossier, E. Diamanti, T. Debuisschert, A. Villing, R. Tualle-Brouri, and P. Grangier, “Field test of a continuous-variable quantum key distribution prototype,” New J. Phys. 11(4), 045023 (2009). [CrossRef]

24. D. Huang, P. Huang, H. Li, T. Wang, Y. Zhou, and G. Zeng, “Field demonstration of a continuous-variable quantum key distribution network,” Opt. Lett. 41(15), 3511–3514 (2016). [CrossRef]

25. P. Jouguet, S. Kunz-Jacques, T. Debuisschert, S. Fossier, E. Diamanti, R. Alléaume, R. Tualle-Brouri, P. Grangier, A. Leverrier, P. Pache, and P. Painchault, “Field test of classical symmetric encryption with continuous variables quantum key distribution,” Opt. Express 20(13), 14030–14041 (2012). [CrossRef]

26. S. Pirandola, U. L. Andersen, L. Banchi, M. Berta, D. Bunandar, R. Colbeck, D. Englund, T. Gehring, C. Lupo, C. Ottaviani, J. L. Pereira, M. Razavi, J. Shamsul Shaari, M. Tomamichel, V. C. Usenko, G. Vallone, P. Villoresi, and P. Wallden, “Advances in quantum cryptography,” Adv. Opt. Photonics 12(4), 1012–1236 (2020). [CrossRef]

27. D. Jin, Y. Guo, Y. Wang, Y. Li, and D. Huang, “Key-sifting algorithms for continuous-variable quantum key distribution,” Phys. Rev. A 104(1), 012616 (2021). [CrossRef]

28. Q. Liao, G. Xiao, H. Zhong, and Y. Guo, “Multi-label learning for improving discretely-modulated continuous-variable quantum key distribution,” New J. Phys. 22(8), 083086 (2020). [CrossRef]

29. W. Liu, P. Huang, J. Peng, J. Fan, and G. Zeng, “Integrating machine learning to achieve an automatic parameter prediction for practical continuous-variable quantum key distribution,” Phys. Rev. A 97(2), 022316 (2018). [CrossRef]

30. Y. Mao, W. Huang, H. Zhong, Y. Wang, H. Qin, Y. Guo, and D. Huang, “Detecting quantum attacks: a machine learning based defense strategy for practical continuous-variable quantum key distribution,” New J. Phys. 22(8), 083073 (2020). [CrossRef]

31. F. Xu, X. Ma, Q. Zhang, H.-K. Lo, and J.-W. Pan, “Secure quantum key distribution with realistic devices,” Rev. Mod. Phys. 92(2), 025002 (2020). [CrossRef]

32. A. Leverrier and P. Grangier, “Unconditional security proof of long-distance continuous-variable quantum key distribution with discrete modulation,” Phys. Rev. Lett. 102(18), 180504 (2009). [CrossRef]

33. A. Leverrier and P. Grangier, “Continuous-variable quantum-key-distribution protocols with a non-gaussian modulation,” Phys. Rev. A 83(4), 042312 (2011). [CrossRef]

34. Y.-B. Zhao, M. Heid, J. Rigas, and N. Lütkenhaus, “Asymptotic security of binary modulated continuous-variable quantum key distribution under collective attacks,” Phys. Rev. A 79(1), 012307 (2009). [CrossRef]

35. C. Simon, “Towards a global quantum network,” Nat. Photonics 11(11), 678–680 (2017). [CrossRef]

36. P. J. Coles, E. M. Metodiev, and N. Lütkenhaus, “Numerical approach for unstructured quantum key distribution,” Nat. Commun. 7(1), 11712 (2016). [CrossRef]

37. J. Lin, T. Upadhyaya, and N. Lütkenhaus, “Asymptotic security analysis of discrete-modulated continuous-variable quantum key distribution,” Phys. Rev. X 9, 041064 (2019). [CrossRef]

38. A. Winick, N. Lütkenhaus, and P. J. Coles, “Reliable numerical key rates for quantum key distribution,” Quantum 2, 77 (2018). [CrossRef]

39. M.-G. Zhou, Z.-P. Liu, W.-B. Liu, C.-L. Li, J.-L. Bai, Y.-R. Xue, Y. Fu, H.-L. Yin, and Z.-B. Chen, “Machine learning for secure key rate in continuous-variable quantum key distribution,” arXiv preprint arXiv:2108.02578 (2021).

40. H. Hu, J. Im, J. Lin, N. Lütkenhaus, and H. Wolkowicz, “Robust interior point method for quantum key distribution rate computation,” arXiv preprint arXiv:2104.03847 (2021).

41. T. Yu and H. Zhu, “Hyper-parameter optimization: A review of algorithms and applications,” arXiv preprint arXiv:2003.05689 (2020).

42. B. Shahriari, K. Swersky, Z. Wang, R. P. Adams, and N. De Freitas, “Taking the human out of the loop: A review of bayesian optimization,” Proc. IEEE 104(1), 148–175 (2015). [CrossRef]

43. W.-B. Liu, C.-L. Li, Y.-M. Xie, C.-X. Weng, J. Gu, X.-Y. Cao, Y.-S. Lu, B.-H. Li, H.-L. Yin, and Z.-B. Chen, “Homodyne detection quadrature phase shift keying continuous-variable quantum key distribution with high excess noise tolerance,” PRX Quantum 2(4), 040334 (2021). [CrossRef]

44. I. Devetak and A. Winter, “Distillation of secret key and entanglement from quantum states,” Proc. R. Soc. A 461(2053), 207–235 (2005). [CrossRef]

45. S. Ghorai, P. Grangier, E. Diamanti, and A. Leverrier, “Asymptotic security of continuous-variable quantum key distribution with a discrete modulation,” Phys. Rev. X 9, 021059 (2019). [CrossRef]

46. J. Bergstra, R. Bardenet, Y. Bengio, and B. Kégl, “Algorithms for hyper-parameter optimization,” Advances in Neural Information Processing Systems 24 (NIPS 2011) (2011).

47. J. Bergstra and Y. Bengio, “Random search for hyper-parameter optimization,” J. Mach. Learn. Res. 13, 281–305 (2012).

48. C. K. Williams and C. E. Rasmussen, Gaussian processes for machine learning, vol. 2 (MIT press Cambridge, MA, 2006).

49. L. Breiman, “Random forests,” Mach. learning 45(1), 5–32 (2001). [CrossRef]

50. F. Hutter, J. Lücke, and L. Schmidt-Thieme, “Beyond manual tuning of hyperparameters,” KI-Künstliche Intelligenz 29(4), 329–337 (2015). [CrossRef]

51. J. Bergstra, D. Yamins, and D. D. Cox, “Hyperopt: A python library for optimizing the hyperparameters of machine learning algorithms,” in Proceedings of the 12th Python in science conference, vol. 13 (Citeseer, 2013), p. 20.

52. K. Hornik, M. Stinchcombe, and H. White, “Multilayer feedforward networks are universal approximators,” Neural networks 2(5), 359–366 (1989). [CrossRef]

53. N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, “Dropout: a simple way to prevent neural networks from overfitting,” J. Mach. Learn. Res. 15, 1929–1958 (2014).

54. D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980 (2014).

55. W. Wang and H.-K. Lo, “Machine learning for optimal parameter prediction in quantum key distribution,” Phys. Rev. A 100(6), 062334 (2019). [CrossRef]

56. G. Mélen, P. Freiwang, J. Luhn, T. Vogl, M. Rau, C. Sonnleitner, W. Rosenfeld, and H. Weinfurter, “Handheld quantum key distribution, in Quantum Information and Measurement, (Optical Society of America, 2017), pp. QT6A–57.

57. A. D. Hill, J. Chapman, K. Herndon, C. Chopp, D. J. Gauthier, and P. Kwiat, “Drone-based quantum key distribution,” Urbana 51, 61801–63003 (2017).

58. S.-K. Liao, W.-Q. Cai, W.-Y. Liu, L. Zhang, Y. Li, J.-G. Ren, J. Yin, Q. Shen, Y. Cao, Z.-P. Li, F.-Z. Li, X.-W. Chen, L.-H. Sun, J.-J. Jia, J.-C. Wu, X.-J. Jiang, J.-F. Wang, Y.-M. Huang, Q. Wang, Y.-L. Zhou, L. Deng, T. Xi, L. Ma, T. Hu, Q. Zhang, Y.-A. Chen, N.-L. Liu, X.-B. Wang, Z.-C. Zhu, C.-Y. Lu, R. Shu, C.-Z. Peng, J.-Y. Wang, and J.-W. Pan, “Satellite-to-ground quantum key distribution,” Nature 549(7670), 43–47 (2017). [CrossRef]

59. F.-Y. Lu, Z.-Q. Yin, C. Wang, C.-H. Cui, J. Teng, S. Wang, W. Chen, W. Huang, B.-J. Xu, G.-C. Guo, and Z.-F. Han, “Parameter optimization and real-time calibration of a measurement-device-independent quantum key distribution network based on a back propagation artificial neural network,” J. Opt. Soc. Am. B 36(3), B92–B98 (2019). [CrossRef]

60. H.-J. Ding, J.-Y. Liu, C.-M. Zhang, and Q. Wang, “Predicting optimal parameters with random forest for quantum key distribution,” Quantum Inf. Process. 19(2), 60 (2020). [CrossRef]

61. A. Leverrier, F. Grosshans, and P. Grangier, “Finite-size analysis of a continuous-variable quantum key distribution,” Phys. Rev. A 81(6), 062343 (2010). [CrossRef]

62. M. Almeida, D. Pereira, N. J. Muga, M. Facão, A. N. Pinto, and N. A. Silva, “Secret key rate of multi-ring M-APSK continuous variable quantum key distribution,” Opt. Express 29(23), 38669–38682 (2021). [CrossRef]

	Number of neurons	Activation function	Dropout	Batch size	$γ$	$ε$
Input layer	29(fixed)	-	-	${32, 64, 128, 256}$	$(0.05, 0.2)$	$(0.8, 0.95)$
Hidden layer 1	${512, 1024}$	{tanh,ReLU,sigmoid}	$(0, 0.3)$
Hidden layer 2	${128, 256, 512}$	{tanh,ReLU,sigmoid}	$(0, 0.3)$
Hidden layer 3	${128, 256, 512}$	{tanh,ReLU,sigmoid}	$(0, 0.3)$
Output layer	1(fixed)	Linear(fixed)	-

	Number of neurons	Activation function	Dropout	Batch size	$γ$	$ε$
Input layer	29(fixed)	-	-	$[32, 64, 128, 256]$	$(0.05, 0.2)$	$(0.8, 0.99)$
Hidden layer 1	${128, 256, 512}$	[tanh,ReLU,sigmoid]	$(0, 0.3)$
Hidden layer 2	${128, 256, 512}$	[tanh,ReLU,sigmoid]	$(0, 0.3)$
Hidden layer 3	${128, 256}$	[tanh,ReLU,sigmoid]	$(0, 0.3)$
Hidden layer 4(optional)	${64, 128}$	[tanh,ReLU,sigmoid]	$(0, 0.3)$
Output layer	1(fixed)	Linear(fixed)	-

	Number of neurons	Activation function	Dropout	Batch size	$γ$	$ε$
Input layer	29	-	-	$64$	$0.0539$	$0.8727$
Hidden layer 1	$1024$	sigmoid	$0.0769$
Hidden layer 2	$256$	tanh	$0.0362$
Hidden layer 3	$256$	sigmoid	$0.0481$
Output layer	1	Linear	-

	Number of neurons	Activation function	Dropout	Batch size	$γ$	$ε$
Input layer	29	-	-	$64$	$0.1227$	$0.8784$
Hidden layer 1	$512$	tanh	$0.2210$
Hidden layer 2	$128$	tanh	$0.2361$
Hidden layer 3	$256$	sigmoid	$0.0657$
Hidden layer 4	$128$	tanh	$0.0036$
Output layer	1	Linear	-

	Number of neurons	Activation function	Dropout	Batch size	$γ$	$ε$
Input layer	29(fixed)	-	-	${32, 64, 128, 256}$	$(0.05, 0.2)$	$(0.8, 0.95)$
Hidden layer 1	${512, 1024}$	{tanh,ReLU,sigmoid}	$(0, 0.3)$
Hidden layer 2	${128, 256, 512}$	{tanh,ReLU,sigmoid}	$(0, 0.3)$
Hidden layer 3	${128, 256, 512}$	{tanh,ReLU,sigmoid}	$(0, 0.3)$
Output layer	1(fixed)	Linear(fixed)	-

Automated machine learning for secure key rate in discrete-modulated continuous-variable quantum key distribution

Abstract

1. Introduction

2. Numerical method for CV QKD with discrete modulation

3. Bayesian optimization

4. Method

5. Result

6. Discussion and conclusion

Funding

Disclosures

Data availability

References

Data availability

Cited By

Figures (5)

Tables (4)

Equations (14)

Optics Express