Katalog Plus
Bibliothek der Frankfurt UAS
Bald neuer Katalog: sichern Sie sich schon vorab Ihre persönlichen Merklisten im Nutzerkonto: Anleitung.
Dieses Ergebnis aus BASE kann Gästen nicht angezeigt werden.  Login für vollen Zugriff.

Voxtral Realtime

Title: Voxtral Realtime
Authors: Mistral-AI; Liu, Alexander H.; Ehrenberg, Andy; Lo, Andy; Sun, Chen-Yo; Lample, Guillaume; Delignon, Jean-Malo; Chandu, Khyathi Raghavi; von Platen, Patrick; Muddireddy, Pavankumar Reddy; Arora, Rohin; Gandhi, Sanchit; Subramanian, Sandeep; Ghosh, Soham; Mishra, Srijan; Rastogi, Abhinav; Sadé, Adrien; Jeffares, Alan; Jiang, Albert; Cahill, Alexandre; Gavaudan, Alexandre; Sablayrolles, Alexandre; Héliou, Amélie; You, Amos; Bai, Andrew; Lenglemetz, Angele; Agarwal, Anmol; Eliseev, Anton; Calvi, Antonia; Majumdar, Arjun; Sooriyarachchi, Avi; Bout, Baptiste; Rozière, Baptiste; De Monicault, Baudouin; Tibi, Benjamin; Cronjäger, Charlotte; Lanfranchi, Clémence; Chen, Connor; Barreau, Corentin; Sautier, Corentin; Courtot, Cyprien; Dabert, Darius; Casas, Diego de las; Demyanenko, Elizaveta; Chane-Sane, Elliot; Paquin, Enguerrand; Goffinet, Etienne; Niel, Fabien; Ahmed, Faruk; Baldassarre, Federico; Berrada, Gabrielle; Ecrepont, Gaëtan; Guinet, Gauthier; Hayes, Genevieve; Novikov, Georgii; Pistilli, Giada; Kunsch, Guillaume; Martin, Guillaume; Raille, Guillaume; Dhanuka, Gunjan; Gupta, Gunshi; Zhou, Han; Shah, Harshil; McGovern, Hope; Thimonier, Hugo; Mukherjee, Indraneel; Zhang, Irene; Kim, Jaeyoung; Ludziejewski, Jan; Rute, Jason; Studnia, Joachim; Harvill, John; Amar, Jonas; Delas, Joséphine; Roberts, Josselin Somerville; Tauran, Julien; Yadav, Karmesh; Khandelwal, Kartik; Tep, Kilian; Jain, Kush; Aitchison, Laurence; Fainsin, Laurent; Blier, Léonard; Zhao, Lingxiao; Martin, Louis; Saulnier, Lucile; Gao, Luyu; Buyl, Maarten; Sharma, Manan; Jennings, Margaret; Pellat, Marie; Prins, Mark; Alexandre, Martin; Poirée, Mathieu; Guillaumin, Mathilde; Dinot, Matthieu; Futeral, Matthieu; Darrin, Maxime; Augustin, Maximilian; Unsal, Mert; Chiquier, Mia; Pham, Minh-Quang; Grinsztajn, Nathan; Gupta, Neha; Bousquet, Olivier; Duchenne, Olivier; Wang, Patricia; Jacob, Paul; Wambergue, Paul; Kurylowicz, Paula; Pinel, Philippe; Chagniot, Philomène; Stock, Pierre; Miłoś, Piotr; Gupta, Prateek; Agrawal, Pravesh; Torroba, Quentin; Ramrakhya, Ram; Shah, Rishi; Sauvestre, Romain; Soletskyi, Roman; Millner, Rosalie; Menneer, Rupert; Vaze, Sagar; Barry, Samuel; Humeau, Samuel; Cha, Sean; Verma, Shashwat; Waghjale, Siddhant; Gandhi, Siddharth; Lepage, Simon; Aithal, Sumukh; Antoniak, Szymon; Scao, Teven Le; Cachet, Théo; Sorg, Theo Simon; Lavril, Thibaut; Chabal, Thomas; Foubert, Thomas; Robert, Thomas; Wang, Thomas; Lawson, Tim; Bewley, Tom; Edwards, Tom; Wang, Tyler; Jamil, Umar; Tomasini, Umberto; Nemychnikova, Valeriia; Phung, Van; Nanda, Vedant; Jouault, Victor; Maladière, Vincent; Richard, Virgile; Bataev, Vladislav; Bouaziz, Wassim; Li, Wen-Ding; Havard, William; Marshall, William; Li, Xinghui; Guo, Xingran; Yang, Xinyu; Neuhaus, Yannic; Ouahidi, Yassine El; Bendou, Yassir; Wang, Yihan; Pan, Yimu; Ramzi, Zaccharie; Xu, Zhenlin
Publication Year: 2026
Collection: ArXiv.org (Cornell University Library)
Subject Terms: Artificial Intelligence
Description: We introduce Voxtral Realtime, a natively streaming automatic speech recognition model that matches offline transcription quality at sub-second latency. Unlike approaches that adapt offline models through chunking or sliding windows, Voxtral Realtime is trained end-to-end for streaming, with explicit alignment between audio and text streams. Our architecture builds on the Delayed Streams Modeling framework, introducing a new causal audio encoder and Ada RMS-Norm for improved delay conditioning. We scale pretraining to a large-scale dataset spanning 13 languages. At a delay of 480ms, Voxtral Realtime achieves performance on par with Whisper, the most widely deployed offline transcription system. We release the model weights under the Apache 2.0 license.
Document Type: text
Language: unknown
Relation: http://arxiv.org/abs/2602.11298
Availability: http://arxiv.org/abs/2602.11298
Accession Number: edsbas.6A80CC99
Database: BASE