派博傳思國際中心

標(biāo)題: Titlebook: Document Analysis and Recognition – ICDAR 2024 Workshops; Athens, Greece, Augu Harold Mouchère,Anna Zhu Conference proceedings 2024 The Edi [打印本頁]

作者: postpartum    時間: 2025-3-21 18:51
書目名稱Document Analysis and Recognition – ICDAR 2024 Workshops影響因子(影響力)




書目名稱Document Analysis and Recognition – ICDAR 2024 Workshops影響因子(影響力)學(xué)科排名




書目名稱Document Analysis and Recognition – ICDAR 2024 Workshops網(wǎng)絡(luò)公開度




書目名稱Document Analysis and Recognition – ICDAR 2024 Workshops網(wǎng)絡(luò)公開度學(xué)科排名




書目名稱Document Analysis and Recognition – ICDAR 2024 Workshops被引頻次




書目名稱Document Analysis and Recognition – ICDAR 2024 Workshops被引頻次學(xué)科排名




書目名稱Document Analysis and Recognition – ICDAR 2024 Workshops年度引用




書目名稱Document Analysis and Recognition – ICDAR 2024 Workshops年度引用學(xué)科排名




書目名稱Document Analysis and Recognition – ICDAR 2024 Workshops讀者反饋




書目名稱Document Analysis and Recognition – ICDAR 2024 Workshops讀者反饋學(xué)科排名





作者: 基因組    時間: 2025-3-21 21:01
TrOCR Meets Language Models: An End-to-End Post-correction Approachmodel with a language model (LM) that serves as a corrector. This integration addresses three principal challenges: over-correction, which compromises text authenticity; poor domain adaptation; and the scarcity of annotated images. We explore the synergy between TrOCR, a state-of-the-art OCR model,
作者: mettlesome    時間: 2025-3-22 02:17
: Domain Adaptive Document Restoration with?a?Layer Separation Approachaditional methods often falter with variable document types, leading to poor performance. To overcome these limitations, this paper introduces a text-graphic layer separation approach that enhances domain adaptability in document image restoration (DIR) systems. We propose ., which utilizes two laye
作者: anticipate    時間: 2025-3-22 05:36
Normalized vs Diplomatic Annotation: A Case Study of?Automatic Information Extraction from?Handwrittndwritten in Spanish. We investigate two annotation strategies for automatically transcribing handwritten documents, fine-tuning DAN with minimal training data and annotation effort. Experiments were conducted on two datasets containing?the same images (201 scans of birth certificates written by mor
作者: 分離    時間: 2025-3-22 09:41
Diminutives in Political Discourse – The Case of Serbian and Slovenianhis paper draws comparisons between the use of diminutives in the ParlaMint-RS 4.0 (Serbian parliament) and ParlaMint-SI 4.0 (Slovenian parliament) corpora [.]. Our findings reveal a distinctive pattern within political discussions: the employment of diminutives, particularly when referring to entit
作者: 戰(zhàn)勝    時間: 2025-3-22 14:55

作者: 戰(zhàn)勝    時間: 2025-3-22 20:57

作者: 把…比做    時間: 2025-3-22 23:16

作者: adjacent    時間: 2025-3-23 02:37
Retrieving and?Analyzing Translations of?American Newspaper Comics with?Visual Evidencen comics’ text features, thereby largely ignoring comics’ heavily visual dimension. Image classification applications for comics focus primarily on genre and artist attribution. This paper bridges the gap between these areas by investigating image classification model accuracy for identifying transl
作者: 狗窩    時間: 2025-3-23 07:17

作者: carbohydrate    時間: 2025-3-23 12:21
Comics Datasets Framework: Mix of?Comics Datasets for?Detection Benchmarkingarch on comics has evolved from basic object detection to more sophisticated tasks. However, the field faces persistent challenges such as small datasets, inconsistent annotations, inaccessible model weights, and results that cannot be directly compared due to varying train/test splits and metrics.
作者: Pigeon    時間: 2025-3-23 17:03
A Comprehensive Gold Standard and?Benchmark for?Comics Text Detection and?Recognitionfrom comic books. To do this, we developed a pipeline for OCR processing and labeling of comic books and created the first text detection and recognition datasets for Western comics, called . and .. We evaluated the performance of fine-tuned state-of-the-art text detection and recognition models on
作者: stroke    時間: 2025-3-23 21:13
Toward Accessible Comics for?Blind and?Low Vision Readersext description of the full story, ready to be forwarded to off-the-shelve speech synthesis tools. We propose to use existing computer vision and optical character recognition techniques to build a grounded context from the comic strip image content, such as panels, characters, text, reading order a
作者: 和音    時間: 2025-3-24 01:20

作者: Colonoscopy    時間: 2025-3-24 03:40
Spatially Augmented Speech Bubble to?Character Association via?Comic Multi-task Learningg increased attention as it enhances the accessibility and analyzability of this rapidly growing medium. Current methods often struggle with the complex spatial relationships within comic panels, which lead to inconsistent associations. To address these shortcomings, we developed a robust machine le
作者: Solace    時間: 2025-3-24 08:37

作者: 痛打    時間: 2025-3-24 14:30

作者: Relinquish    時間: 2025-3-24 17:34

作者: 本能    時間: 2025-3-24 21:05
ances visual and linguistic information, preserving the authenticity of the original texts. Furthermore, the model is able to adapt to historical data even when the recogniser is trained solely on contemporary data, mitigating the need for a large number of annotated historical handwritten images.
作者: 閑逛    時間: 2025-3-25 00:52

作者: 輕觸    時間: 2025-3-25 04:32

作者: 完成才會征服    時間: 2025-3-25 09:08

作者: 嫻熟    時間: 2025-3-25 13:58
TrOCR Meets Language Models: An End-to-End Post-correction Approachances visual and linguistic information, preserving the authenticity of the original texts. Furthermore, the model is able to adapt to historical data even when the recogniser is trained solely on contemporary data, mitigating the need for a large number of annotated historical handwritten images.
作者: Costume    時間: 2025-3-25 17:50
: Domain Adaptive Document Restoration with?a?Layer Separation Approach qualitatively and quantitatively using a new real-world dataset, ., developed for this study. Initially trained on a synthetically generated dataset, our model demonstrates strong generalization capabilities for the DIR task, offering a promising solution for handling variability in real-world data. Our code is accessible on this GitHub(.).
作者: prosthesis    時間: 2025-3-25 20:12
Investigating Neural Networks and?Transformer Models for?Enhanced Comic Decoding (eBDtheque, DCM772, Manga109) and using different metrics (Precision, Recall, Average Precision), we conclude that pre-trained self-supervised transformer models can competently outperform state of the art approaches, which often require further fine-tuning to achieve comparable results.
作者: 即席    時間: 2025-3-26 01:09

作者: 不可接觸    時間: 2025-3-26 04:28
Normalized vs Diplomatic Annotation: A Case Study of?Automatic Information Extraction from?Handwritte than?15 different writers) but with different annotation methods.?Our findings indicate that normalized annotation is more effective?for fields that can be standardized, such as dates and places of birth, whereas diplomatic annotation performs much better for fields containing names and surnames, which can not be standardized.
作者: Itinerant    時間: 2025-3-26 09:31

作者: Infinitesimal    時間: 2025-3-26 15:23
ion process is provided, in order to render the materials in question machine-readable, while in the second part the potential for linguistic research is highlighted, through a case-study exploring aspects of the ‘Greek language question’, as discussed in the parliamentary context, within the wider framework of language policy making.
作者: 碳水化合物    時間: 2025-3-26 17:50
Peter C. Maloney,E. R. Kashket,T. H. Wilsonon including character’s appearance, posture, mood, dialogues etc. We believe that such enriched content description can be easily used to produce audiobook and eBook with various voices for characters, captions and playing sound effects.
作者: 真    時間: 2025-3-26 23:42

作者: genuine    時間: 2025-3-27 01:35
Οpen Parliamentary Data as a Tool for Linguistic Research: Exploring the ‘Greek Language Question’ iion process is provided, in order to render the materials in question machine-readable, while in the second part the potential for linguistic research is highlighted, through a case-study exploring aspects of the ‘Greek language question’, as discussed in the parliamentary context, within the wider framework of language policy making.
作者: 縮減了    時間: 2025-3-27 07:36
Toward Accessible Comics for?Blind and?Low Vision Readerson including character’s appearance, posture, mood, dialogues etc. We believe that such enriched content description can be easily used to produce audiobook and eBook with various voices for characters, captions and playing sound effects.
作者: Diuretic    時間: 2025-3-27 11:08

作者: 微枝末節(jié)    時間: 2025-3-27 14:54
Conference proceedings 2024ment Analysis and Recognition, ICDAR 2024, held in Athens, Greece, during August 30–31, 2024..The total of 30 regular papers presented in these proceedings were carefully selected from 46 submissions..Part I contains 16 regular papers that stem from the following workshops:..ICDAR 2024 Workshop on A
作者: ventilate    時間: 2025-3-27 21:34

作者: Pelvic-Floor    時間: 2025-3-28 00:23
978-3-031-70644-8The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Switzerl
作者: 漫不經(jīng)心    時間: 2025-3-28 04:32
Lecture Notes in Computer Sciencehttp://image.papertrans.cn/e/image/284817.jpg
作者: MEEK    時間: 2025-3-28 07:31

作者: 細(xì)胞學(xué)    時間: 2025-3-28 12:15
s on a digital pen equipped with kinematic sensors, allowing users to write on any surface while simultaneously preserving a digital trajectory of handwriting. This technology holds significant potential as a valuable educational tool, particularly in classrooms where it can facilitate the process o
作者: 阻止    時間: 2025-3-28 17:48

作者: dithiolethione    時間: 2025-3-28 20:56
aditional methods often falter with variable document types, leading to poor performance. To overcome these limitations, this paper introduces a text-graphic layer separation approach that enhances domain adaptability in document image restoration (DIR) systems. We propose ., which utilizes two laye
作者: 笨拙處理    時間: 2025-3-29 01:25

作者: 除草劑    時間: 2025-3-29 03:27

作者: 顛簸下上    時間: 2025-3-29 10:51

作者: 平躺    時間: 2025-3-29 12:24
nctioning as dynamic information and knowledge hubs, through the production, management and availability of open diachronic and synchronic data; in this light, this paper presents the Hellenic Parliament Library experience, focusing on a subcategory of historical parliamentary data, included in the
作者: 整理    時間: 2025-3-29 19:03
–1977) in the Hellenic Parliament. A collaborative pilot project involving parliament, academia, and a research center facilitated the conversion of printed material to open data. The main tasks of the project include capturing digital images, a custom Optical Character Recognition (OCR) software so
作者: Pulmonary-Veins    時間: 2025-3-29 20:28

作者: 不給啤    時間: 2025-3-30 01:28
of visual storytelling across decades. Comic image segmentation is a pivotal aspect in the digital transformation of comics. Leveraging heuristic approaches, neural network-based model (YOLO), and innovative transformer-based architectures (GroundingDINO, SAM), our research aims to autonomously seg
作者: 有權(quán)    時間: 2025-3-30 04:02
arch on comics has evolved from basic object detection to more sophisticated tasks. However, the field faces persistent challenges such as small datasets, inconsistent annotations, inaccessible model weights, and results that cannot be directly compared due to varying train/test splits and metrics.
作者: nonchalance    時間: 2025-3-30 11:27
from comic books. To do this, we developed a pipeline for OCR processing and labeling of comic books and created the first text detection and recognition datasets for Western comics, called . and .. We evaluated the performance of fine-tuned state-of-the-art text detection and recognition models on
作者: 斥責(zé)    時間: 2025-3-30 15:15
Peter C. Maloney,E. R. Kashket,T. H. Wilsonext description of the full story, ready to be forwarded to off-the-shelve speech synthesis tools. We propose to use existing computer vision and optical character recognition techniques to build a grounded context from the comic strip image content, such as panels, characters, text, reading order a
作者: allude    時間: 2025-3-30 17:44

作者: 或者發(fā)神韻    時間: 2025-3-30 22:59





歡迎光臨 派博傳思國際中心 (http://pjsxioz.cn/) Powered by Discuz! X3.5
微山县| 五河县| 皮山县| 锡林郭勒盟| 新安县| 岗巴县| 固镇县| 平和县| 元朗区| 安新县| 涿鹿县| 漳平市| 洞头县| 闸北区| 巧家县| 苏州市| 闵行区| 福海县| 宁陵县| 安岳县| 庆云县| 灌南县| 金堂县| 苏尼特右旗| 芦山县| 土默特右旗| 浮山县| 广元市| 寿阳县| 乌海市| 临沭县| 思南县| 象山县| 犍为县| 大姚县| 海伦市| 萍乡市| 张家界市| 资中县| 崇礼县| 宁陕县|