| Title: |
Reinforcement Learning Control of Quantum Error Correction |
| Authors: |
Sivak, Volodymyr; Morvan, Alexis; Broughton, Michael; Cortiñas, Rodrigo G.; Bausch, Johannes; Senior, Andrew W.; Neeley, Matthew; Eickbusch, Alec; Shutty, Noah; Beni, Laleh Aghababaie; Spencer, James S.; Heras, Francisco J. H; Edlich, Thomas; Abanin, Dmitry; Abbas, Amira; Acharya, Rajeev; Aigeldinger, Georg; Alcaraz, Ross; Alcaraz, Sayra; Andersen, Trond I.; Ansmann, Markus; Arute, Frank; Arya, Kunal; Askew, Walt; Astrakhantsev, Nikita; Atalaya, Juan; Ballard, Brian; Bardin, Joseph C.; Bates, Hector; Bengtsson, Andreas; Karimi, Majid Bigdeli; Bilmes, Alexander; Bilodeau, Simon; Borjans, Felix; Bourassa, Alexandre; Bovaird, Jenna; Bowers, Dylan; Brill, Leon; Brooks, Peter; Browne, David A.; Buchea, Brett; Buckley, Bob B.; Burger, Tim; Burkett, Brian; Bushnell, Nicholas; Busnaina, Jamal; Cabrera, Anthony; Campero, Juan; Chang, Hung-Shen; Chen, Silas; Chiaro, Ben; Chih, Liang-Ying; Cleland, Agnetta Y.; Cochrane, Bryan; Cockrell, Matt; Cogan, Josh; Collins, Roberto; Conner, Paul; Cook, Harold; Courtney, William; Crook, Alexander L.; Curtin, Ben; Damyanov, Martin; Das, Sayan; Debroy, Dripto M.; Demura, Sean; Donohoe, Paul; Drozdov, Ilya; Dunsworth, Andrew; Ehimhen, Valerie; Elbag, Aviv Moshe; Ella, Lior; Elzouka, Mahmoud; Enriquez, David; Erickson, Catherine; Ferreira, Vinicius S.; Flores, Marcos; Burgos, Leslie Flores; Forati, Ebrahim; Ford, Jeremiah; Fowler, Austin G.; Foxen, Brooks; Fukami, Masaya; Fung, Alan Wing Lun; Fuste, Lenny; Ganjam, Suhas; Garcia, Gonzalo; Garrick, Christopher; Gasca, Robert; Gehring, Helge; Geiger, Robert; Genois, Élie; Giang, William; Gilboa, Dar; Goeders, James E.; Gonzales, Edward C.; Gosula, Raja; de Graaf, Stijn J.; Dau, Alejandro Grajales; Graumann, Dietrich; Grebel, Joel; Greene, Alex; Gross, Jonathan A.; Guerrero, Jose; Guevel, Loïck Le; Ha, Tan; Habegger, Steve; Hadick, Tanner; Hadjikhani, Ali; Hamilton, Michael C.; Harrigan, Matthew P.; Harrington, Sean D.; Hartshorn, Jeanne; Heslin, Stephen; Heu, Paula; Higgott, Oscar; Hiltermann, Reno; Huang, Hsin-Yuan; Hucka, Mike; Hudspeth, Christopher; Huff, Ashley; Huggins, William J.; Jeffrey, Evan; Jevons, Shaun; Jiang, Zhang; Jin, Xiaoxuan; Joshi, Chaitali; Juhas, Pavol; Kabel, Andreas; Kafri, Dvir; Kang, Hui; Kang, Kiseo; Karamlou, Amir H.; Kaufman, Ryan; Kechedzhi, Kostyantyn; Khattar, Tanuj; Khezri, Mostafa; Kim, Seon; Knaut, Can M.; Kobrin, Bryce; Kostritsa, Fedor; Kreikebaum, John Mark; Kudo, Ryuho; Kueffler, Ben; Kumar, Arun; Kurilovich, Vladislav D.; Kutsko, Vitali; Lacroix, Nathan; Landhuis, David; Lange-Dei, Tiano; Langley, Brandon W.; Laptev, Pavel; Lau, Kim-Ming; Ledford, Justin; Lee, Joy; Lee, Kenny; Lester, Brian J.; Leung, Wendy; Li, Lily; Li, Wing Yan; Li, Ming; Lill, Alexander T.; Livingston, William P.; Lloyd, Matthew T.; Locharla, Aditya; De Lorenzo, Laura; Lundahl, Daniel; Lunt, Aaron; Madhuk, Sid; Maiti, Aniket; Maloney, Ashley; Mandrà, Salvatore; Martin, Leigh S.; Martin, Orion; Mascot, Eric; Das, Paul Masih; Maslov, Dmitri; Mathews, Melvin; Maxfield, Cameron; McClean, Jarrod R.; McEwen, Matt; Meeks, Seneca; Miao, Kevin C.; Minev, Zlatko K.; Molavi, Reza; Molina, Sebastian; Montazeri, Shirin; Neill, Charles; Newman, Michael; Nguyen, Anthony; Nguyen, Murray; Ni, Chia-Hung; Niu, Murphy Yuezhen; Oas, Logan; Orosco, Raymond; Ottosson, Kristoffer; Pagano, Alice; Di Paolo, Agustin; Peek, Sherman; Peterson, David; Pizzuto, Alex; Portoles, Elias; Potter, Rebecca; Pritchard, Orion; Qian, Michael; Quintana, Chris; Ranadive, Arpit; Reagor, Matthew J.; Resnick, Rachel; Rhodes, David M.; Riley, Daniel; Roberts, Gabrielle; Rodriguez, Roberto; Ropes, Emma; De Rose, Lucia B.; Rosenberg, Eliott; Rosenfeld, Emma; Rosenstock, Dario; Rossi, Elizabeth; Roushan, Pedram; Rower, David A.; Salazar, Robert; Sankaragomathi, Kannan; Sarihan, Murat Can; Satzinger, Kevin J.; Schaefer, Max; Schroeder, Sebastian; Schurkus, Henry F.; Shahingohar, Aria; Shearn, Michael J.; Shorter, Aaron; Shvarts, Vladimir; Small, Spencer; Smith, W. Clarke; Sobel, David A.; Spells, Barrett; Springer, Sofia; Sterling, George; Suchard, Jordan; Szasz, Aaron; Sztein, Alexander; Taylor, Madeline; Thiruraman, Jothi Priyanka; Thor, Douglas; Timucin, Dogan; Tomita, Eifu; Torres, Alfredo; Torunbalci, M. Mert; Tran, Hao; Vaishnav, Abeer; Vargas, Justin; Vdovichev, Sergey; Vidal, Guifre; Heidweiller, Catherine Vollgraff; Voorhees, Meghan; Waltman, Steven; Waltz, Jonathan; Wang, Shannon X.; Ware, Brayden; Watson, James D.; Wei, Yonghua; Weidel, Travis; White, Theodore; Wong, Kristi; Woo, Bryan W. K.; Wood, Christopher J.; Woodson, Maddy; Xing, Cheng; Yao, Z. Jamie; Yeh, Ping; Ying, Bicheng; Yoo, Juhwan; Yosri, Noureldin; Young, Elliot; Young, Grayson; Zalcman, Adam; Zhang, Ran; Zhang, Yaxing; Zhu, Ningfeng; Zobrist, Nicholas; Zou, Zhenjie; Babbush, Ryan; Bacon, Dave; Boixo, Sergio; Chen, Yu; Chen, Zijun; Devoret, Michel; Hansen, Monica; Hilton, Jeremy; Jones, Cody; Kelly, Julian; Korotkov, Alexander N.; Lucero, Erik; Megrant, Anthony; Neven, Hartmut; Oliver, William D.; Ramachandran, Ganesh; Smelyanskiy, Vadim; Klimov, Paul V. |
| Publication Year: |
2025 |
| Collection: |
ArXiv.org (Cornell University Library) |
| Subject Terms: |
Quantum Physics |
| Description: |
The promise of fault-tolerant quantum computing is challenged by environmental drift that relentlessly degrades the quality of quantum operations. The contemporary solution, halting the entire quantum computation for recalibration, is unsustainable for the long runtimes of the future algorithms. We address this challenge by unifying calibration with computation, granting the quantum error correction process a dual role: its error detection events are not only used to correct the logical quantum state, but are also repurposed as a learning signal, teaching a reinforcement learning (RL) agent to continuously steer the physical control parameters and stabilize the quantum system during the computation. We experimentally demonstrate this framework on a Willow superconducting processor, improving the logical stability of the surface code 3.5-fold against injected drift. By synthesizing our full suite of technological advances, including RL fine-tuning of the entire system and near-optimal decoding, we achieve record performance of the surface and color codes, with average logical error per cycle of $\varepsilon_L=7.72(9)\times10^{-4}$ and $\varepsilon_L=8.19(14)\times10^{-3}$ respectively. Simulations of surface codes up to distance-15 with tens of thousands control parameters confirm the scalability of our RL framework, revealing an optimization speed that is independent of the system size. This work thus enables a new paradigm: a quantum computer that learns from its errors and never stops computing. |
| Document Type: |
text |
| Language: |
unknown |
| Relation: |
http://arxiv.org/abs/2511.08493 |
| Availability: |
http://arxiv.org/abs/2511.08493 |
| Accession Number: |
edsbas.7D78295F |
| Database: |
BASE |