Quantum sensing represents one of the most promising applications of quantum technologies, with the aim of using quantum resources to improve measurement sensitivity. In particular, sensing of optical phases is one of the most investigated problems, considered key to developing mass-produced technological devices.
Optimal usage of quantum sensors requires regular characterization and calibration. In general, such calibration is an extremely complex and resource-intensive task — especially when considering systems for estimating multiple parameters, due to the sheer volume of required measurements as well as the computational time needed to analyze those measurements. Machine-learning algorithms present a powerful tool to address that complexity. The discovery of suitable protocols for algorithm usage is vital for the development of sensors for precise quantum-enhanced measurements.
A particular type of machine-learning algorithm known as “reinforcement learning” (RL) relies on an intelligent agent guided by rewards: depending on the rewards it receives it learns to perform the right actions to achieve the desired optimization. The first experimental realizations using RL algorithms for the optimization of quantum problems have been reported only very recently. Most of them still rely on prior knowledge of the model describing the system. What is desirable is instead a completely model-free approach, which is possible when the agent’s reward does not depend on the explicit system model.
As reported in Advanced Photonics, a team of researchers from the Physics Department of Sapienza University of Rome and the Institute for Photonics and Nanotechnologies (IFN-CRN) recently developed a model-free approach that widens the range of possible applications to that of adaptive multiphase estimation. They demonstrated the effectiveness of their model-free approach in a highly reconfigurable integrated photonic platform. They experimentally employ the RL algorithm to optimize estimation of multiple parameters and combine it with a deep neural network that updates after each measurement the Bayesian posterior probability distribution.
The protocol handles the quantum multiparameter sensor in a completely black-box manner, since at any step the system functioning model is not required. Importantly, they proved the enhanced performance obtained with their protocol on experimental data in a resource-limited regime and compared it to that of nonadaptive strategies, achieving significantly better estimations.
According to corresponding author Fabio Sciarrino, head of the Quantum Lab, “The protocol developed by our team provides a significant step toward fully artificial-intelligence-based quantum sensors.”
Read the Gold Open Access article by Cimini et al., “Deep reinforcement learning for quantum multiparameter estimation,” Adv. Photon. 5(1), 016005 (2023), doi 10.1117/1.AP.5.1.016005.