Katalog Plus
Bibliothek der Frankfurt UAS
Bald neuer Katalog: sichern Sie sich schon vorab Ihre persönlichen Merklisten im Nutzerkonto: Anleitung.
Dieses Ergebnis aus BASE kann Gästen nicht angezeigt werden.  Login für vollen Zugriff.

The manga whisperer: automatically generating transcriptions for comics

Title: The manga whisperer: automatically generating transcriptions for comics
Authors: Sachdeva, R; Zisserman, A
Publisher Information: IEEE
Publication Year: 2024
Collection: Oxford University Research Archive (ORA)
Description: In the past few decades, Japanese comics, commonly referred to as Manga, have transcended both cultural and linguistic boundaries to become a true worldwide sensation. Yet, the inherent reliance on visual cues and illustration within manga renders it largely inaccessible to individuals with visual impairments. In this work, we seek to address this substantial barrier, with the aim of ensuring that manga can be appreciated and actively engaged by everyone. Specifically, we tackle the problem of diarisation i.e. generating a transcription of who said what and when, in a fully automatic way. To this end, we make the following contributions: (1) we present a unified model, Magi, that is able to (a) detect panels, text boxes and character boxes, (b) cluster characters by identity (without knowing the number of clusters apriori), and (c) associate dialogues to their speakers; (2) we propose a novel approach that is able to sort the detected text boxes in their reading order and generate a dialogue transcript; (3) we annotate an evaluation benchmark for this task using publicly available [English] manga pages. The code, evaluation datasets and the pretrained model can be found at: https://github. com/ragavsachdeva/magi.
Document Type: conference object
Language: English
Relation: https://doi.org/10.1109/CVPR52733.2024.01232
DOI: 10.1109/CVPR52733.2024.01232
Availability: https://doi.org/10.1109/CVPR52733.2024.01232; https://ora.ox.ac.uk/objects/uuid:c8f9a65e-0853-40ce-b1b5-81ea69f8b849
Rights: info:eu-repo/semantics/openAccess
Accession Number: edsbas.495327A9
Database: BASE