Katalog Plus
Bibliothek der Frankfurt UAS
Bald neuer Katalog: sichern Sie sich schon vorab Ihre persönlichen Merklisten im Nutzerkonto: Anleitung.
Dieses Ergebnis aus BASE kann Gästen nicht angezeigt werden.  Login für vollen Zugriff.

Fast Scene Text Detection with RT-LoG Operator and CNN

Title: Fast Scene Text Detection with RT-LoG Operator and CNN
Authors: Nguyen, Dinh Cong; Delalandre, Mathieu; Conte, Donatello; Pham, The Anh
Contributors: Laboratoire d'Informatique Fondamentale et Appliquée de Tours (LIFAT); Université de Tours (UT)-Institut National des Sciences Appliquées - Centre Val de Loire (INSA CVL); Institut National des Sciences Appliquées (INSA)-Institut National des Sciences Appliquées (INSA); Hong Duc University
Source: 15th International Conference on Computer Vision Theory and Applications; https://hal.science/hal-02867205; 15th International Conference on Computer Vision Theory and Applications, Feb 2020, Valletta, Malta
Publisher Information: CCSD
Publication Year: 2020
Collection: Université François-Rabelais de Tours: HAL
Subject Terms: character pattern; RT-LoG; keypoint grouping; Scene text detection; [INFO.INFO-TI]Computer Science [cs]/Image Processing [eess.IV]; [INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]; [SCCO.COMP]Cognitive science/Computer science; [INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing
Subject Geographic: Valletta; Malta
Description: International audience ; Text detection in scene images is of particular importance for the computer-based applications. The text detection methods must be robust against variabilities and deformations of text entities. In addition, to be embedded into mobile devices, the methods have to be time efficient. In this paper, the keypoint grouping method is proposed by first applying the real-time Laplacian of Gaussian operator (RT-LoG) to detect keypoints. These keypoints will be grouped to produce the character patterns. The patterns will be filtered out by using a CNN model before aggregating into words. Performance evaluation is discussed on the ICDAR2017 RRC-MLT and the Challenge 4 of ICDAR2015 datasets. The results are given in terms of detection accuracy and time processing against different end-to-end systems in the literature. Our system performs as one of the strongest detection accuracy while supporting at approximately 15.6 frames per second to the HD resolution on a regular CPU architecture. It is one of the best candidates to guarantee the trade-off between accuracy and speed in the literature.
Document Type: conference object
Language: English
Availability: https://hal.science/hal-02867205; https://hal.science/hal-02867205v1/document; https://hal.science/hal-02867205v1/file/2020VISAPP_RTLog.pdf
Rights: info:eu-repo/semantics/OpenAccess
Accession Number: edsbas.7FD4DDD2
Database: BASE