skip to main content
10.1145/1860559.1860607acmconferencesArticle/Chapter ViewAbstractPublication PagesdocengConference Proceedingsconference-collections
poster

Interactive layout analysis and transcription systems for historic handwritten documents

Published: 21 September 2010 Publication History

Abstract

The amount of digitized legacy documents has been rising dramatically over the last years due mainly to the increasing number of on-line digital libraries publishing this kind of documents, waiting to be classified and finally transcribed into a textual electronic format (such as ASCII or PDF). Nevertheless, most of the available fully-automatic applications addressing this task are far from being perfect and heavy and inefficient human intervention is often required to check and correct the results of such systems. In contrast, multimodal interactive-predictive approaches may allow the users to participate in the process helping the system to improve the overall performance. With this in mind, two sets of recent advances are introduced in this work: a novel interactive method for text block detection and two multimodal interactive handwritten text transcription systems which use active learning and interactive-predictive technologies in the recognition process.

References

[1]
}}L. Likforman-Sulem, A. Zahour, and B. Taconet. Text line segmentation of historical documents: a survey. IJDAR, 9:123--138, 2007.
[2]
}}U. V. Marti and H. Bunke. Using a Statistical Language Model to improve the preformance of an HMM-Based Cursive Handwriting Recognition System. Int. Journal of Pattern Recognition and Artificial Intelligence, 15(1):65--90, 2001.
[3]
}}D. Perez, L. Tarazon, N. Serrano, F. Castro, O. R. Terrades, and A. Juan. The GERMANA database. 2009.
[4]
}}O. Ramos Terrades, N. Serrano, A. Gordo, E. Valveny, and A. Juan. Interactive-predictive detection of handwritten text blocks. In Document Recognition and Retrieval XVII, volume 7534, San Jose (USA), 2010.
[5]
}}V. Romero, A. Toselli, and E. Vidal. Using mouse feedback in computer assisted transcritpion of handwritten text images. In Proc. of the ICDAR 2009, Barcelona (Spain), 2009.
[6]
}}N. Serrano, A. Sanchis, and A. Juan. Balancing error and supervision effort in interactive-predictive handwritten text recognition. In Proceedings of the 15th International Conference on Intelligent User Interfaces (IUI 2010), pages 373--376, Hong Kong (China), June 2010.
[7]
}}B. Settles. Active learning literature survey. Computer Sciences Technical Report 1648, University of Wisconsin--Madison, 2009.
[8]
}}A. H. Toselli, V. Romero, M. Pastor, and E. Vidal. Multimodal interactive transcription of text images. Pattern Recognition, 43(5):1814--1825, 2009.
[9]
}}E. Vidal, L. Rodriguez, F. Casacuberta, and I. GarciaVarea. Interactive pattern recognition. In Proceedings of the 4th Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, Volume 4892 of LNCS, pages 60--71, Brno, Czech Republic, 28--30 June 2007.

Cited By

View all
  • (2018)An Intelligent User Interface for Efficient Semi-automatic Transcription of Historical Handwritten DocumentsCompanion Proceedings of the 23rd International Conference on Intelligent User Interfaces10.1145/3180308.3180357(1-2)Online publication date: 5-Mar-2018
  • (2012)Natural Language Processing for Historical TextsSynthesis Lectures on Human Language Technologies10.2200/S00436ED1V01Y201207HLT0175:2(1-157)Online publication date: 24-Sep-2012

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
DocEng '10: Proceedings of the 10th ACM symposium on Document engineering
September 2010
298 pages
ISBN:9781450302319
DOI:10.1145/1860559
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

In-Cooperation

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 September 2010

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. handwriting recognition
  2. interactive layout analysis
  3. interactive predictive processing
  4. partial supervision

Qualifiers

  • Poster

Conference

DocEng2010
Sponsor:
DocEng2010: ACM Symposium on Document Engineering
September 21 - 24, 2010
Manchester, United Kingdom

Acceptance Rates

DocEng '10 Paper Acceptance Rate 13 of 42 submissions, 31%;
Overall Acceptance Rate 194 of 564 submissions, 34%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 15 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2018)An Intelligent User Interface for Efficient Semi-automatic Transcription of Historical Handwritten DocumentsCompanion Proceedings of the 23rd International Conference on Intelligent User Interfaces10.1145/3180308.3180357(1-2)Online publication date: 5-Mar-2018
  • (2012)Natural Language Processing for Historical TextsSynthesis Lectures on Human Language Technologies10.2200/S00436ED1V01Y201207HLT0175:2(1-157)Online publication date: 24-Sep-2012

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media