Katalog Plus
Bibliothek der Frankfurt UAS
Bald neuer Katalog: sichern Sie sich schon vorab Ihre persönlichen Merklisten im Nutzerkonto: Anleitung.
Dieses Ergebnis aus BASE kann Gästen nicht angezeigt werden.  Login für vollen Zugriff.

Tails tell tales: chapter-wide manga transcriptions with character names

Title: Tails tell tales: chapter-wide manga transcriptions with character names
Authors: Sachdeva, R; Shin, G; Zisserman, A
Publisher Information: Springer
Publication Year: 2025
Collection: Oxford University Research Archive (ORA)
Description: Enabling engagement of manga by visually impaired individuals presents a significant challenge due to its inherently visual nature. With the goal of fostering accessibility, this paper aims to generate a dialogue transcript of a complete manga chapter, entirely automatically, with a particular emphasis on ensuring narrative consistency. This entails identifying (i) what is being said, i.e., detecting the texts on each page and classifying them into essential vs non-essential, and (ii) who is saying it, i.e., attributing each dialogue to its speaker, while ensuring the same characters are named consistently throughout the chapter. To this end, we introduce: (i) Magiv2, a model that is capable of generating high-quality chapter-wide manga transcripts with named characters and significantly higher precision in speaker diarisation over prior works; (ii) an extension of the PopManga evaluation dataset, which now includes annotations for speech-bubble tail boxes, associations of text to corresponding tails, classifications of text as essential or non-essential, and the identity for each character box; and (iii) a new character bank dataset, which comprises over 11K characters from 76 manga series, featuring 11.5K exemplar character images in total, as well as a list of chapters in which they appear. The code, trained model, and both datasets can be found at: https://github.com/ragavsachdeva/magi
Document Type: conference object
Language: English
DOI: 10.1007/978-981-96-0908-6_4
Availability: https://doi.org/10.1007/978-981-96-0908-6_4; https://ora.ox.ac.uk/objects/uuid:fe1c6116-d79d-45ea-bda1-29f7624a867a
Rights: info:eu-repo/semantics/openAccess
Accession Number: edsbas.2FE5E79
Database: BASE