- Handwritten Text Recognition Techniques
- Natural Language Processing Techniques
- Image Processing and 3D Reconstruction
- Topic Modeling
- Image Retrieval and Classification Techniques
- Hand Gesture Recognition Systems
- Music and Audio Processing
- Algorithms and Data Compression
- Advanced Image and Video Retrieval Techniques
- Vehicle License Plate Recognition
- Speech and dialogue systems
- Interactive and Immersive Displays
- Video Analysis and Summarization
- Digital Humanities and Scholarship
- Digital Media Forensic Detection
- semigroups and automata theory
- Machine Learning in Healthcare
- Speech Recognition and Synthesis
- E-Learning and Knowledge Management
- Machine Learning and Algorithms
- Usability and User Interface Design
- Mobile Crowdsensing and Crowdsourcing
- Innovative Human-Technology Interaction
- DNA and Biological Computing
- Persona Design and Applications
Universitat de València
2021-2024
Universitat Politècnica de València
2012-2021
Salisbury University
2018
Télécom Paris
2018
John Wiley & Sons (United States)
2018
Hudson Institute
2018
University of Alberta
2018
University of New Brunswick
2018
Maritime Administration of Latvia
2018
Dalhousie University
2018
This paper describes the Handwritten Text Recognition (HTR) competition on READ dataset that has been held in context of International Conference Frontiers Handwriting 2016. aims to bring together researchers working off-line HTR and provide them a suitable benchmark compare their techniques task transcribing typical historical handwritten documents. Two tracks with different conditions use training data were proposed. Ten research groups registered but finally five submitted results. The...
Purpose An overview of the current use handwritten text recognition (HTR) on archival manuscript material, as provided by EU H2020 funded Transkribus platform. It explains HTR, demonstrates , gives examples cases, highlights affect HTR may have scholarship, and evidences this turning point advanced digitised heritage content. The paper aims to discuss these issues. Design/methodology/approach This adopts a case study approach, using development delivery one openly available platform for...
A contest on Handwritten Text Recognition organised in the context of ICFHR 2014 conference is described. Two tracks with increased freedom use training data were proposed and three research groups participated these two tracks. The handwritten images for this drawn from an English set which currently being considered Tran scriptorium project. goal project to develop innovative, efficient cost-effective solutions transcription historical document images, focusing four languages: English,...
This paper describes the fourth edition of Handwritten Text Recognition (HTR) competition that was prepared this time in context International Conference on Document Analysis and (ICDAR) 2017. Previous editions were conducted, first, with datasets from tranScriptorium project ICFHR 2014, ICDAR 2015, then, "Recognition Enrichment Archival Documents (READ)" European 2016. aims to bring together researchers working off-line HTR provides them a suitable benchmark compare their techniques task...
Named Entity Recoginiton (NER) consists of tagging parts an unstructured text containing particular semantic information. When applied to handwritten documents, it is possible do as a two-step approach in which Handwritten Text Recognition (HTR) performed prior the automatic transcription. However, also both tasks simultaneously by using HTR model that learns output transcription and symbols. In this paper, we focus on improving one-step introducing auxiliary task predicting Pyramidal...
Tran Scriptorium is a 3-years project that aims to develop innovative, cost-effective solutions for the indexing, search and full transcription of historical handwritten document images, using Handwritten Text Recognition (HTR) technology. The production ground-truth (GT) dataset images among first tasks. We address novel approaches faster this GT based on crowd-sourcing prior-knowledge methods. also here low-cost semi-supervised procedure obtaining pairs correct line-level aligned...
The extraction of relevant information from historical handwritten document collections is one the key steps in order to make these manuscripts available for access and searches. In this competition, goal detect named entities assign each them a semantic category, therefore, simulate filling knowledge database. This paper describes dataset, tasks, evaluation metrics, participants methods results.
This paper describes the second edition of Handwritten Text Recognition (HTR) contest on tranScriptorium datasets that has been held in context International Conference Document Analysis and 2015. Two tracks with different conditions use training data were proposed. Nine research groups registered but finally three submitted results. The handwritten images for this drawn from English "Bentham collection" dataset used project. A small subset collection chosen present HTR competition. selected...
Abstract The proper use of protective equipment is very important to avoid fatalities. One sector in which this has a great impact that construction sites, where large number workers die each year. In as others, employers are responsible for providing their employees with equipment. addition, must monitor and ensure its correct use. These tasks usually performed using manual procedures. Existing tools automate process unreliable present scalability issues. paper, we research the benefits...
To date, automatic handwriting recognition systems are far from being perfect and often they need a post editing where human intervention is required to check correct the results of such systems. We propose have new interactive, on-line framework which, rather than full automation, aims at assisting in proper recognition- transcription process; that is, facilitate speed up their task handwritten texts. This combines efficiency with accuracy transcriptor. The best result cost-effective text images.
Transcription of historical handwritten documents is a crucial problem for making easier the access to these general public. Currently, huge amount are being made available by on-line portals worldwide. It not realistic obtain transcription manually, and therefore automatic techniques has be used. tranScriptorium project that aims at researching on modern Handwritten Text Recognition (HTR) technology transcribing documents. The HTR used in based models learnt automatically from examples....
Text line segmentation is the process by which text lines in a document image are localized and extracted. It an important step off-line Handwritten Recognition (HTR) given that input of these systems to be transcribed. A myriad solutions problem have been proposed literature. Although may differ greatly on what actually applied perform segmentation, they can classified level precision detail final extracted lines. In this paper we study influence real needs different levels HTR task. We...
This paper introduces a new corpus of multilingual medieval handwritten charter images, annotated with full transcription and named entities. The is used to compare two approaches for entity recognition in historical document images several languages: on the one hand, sequential approach, more commonly used, that sequentially applies text (HTR) (NER), other combined approach simultaneously transcribes image line extracts Experiments conducted Latin, early high German old Czech name, date...
The main aim of the Carabela project was to develop and apply techniques that allow textual searching on massive Spanish collections 15th-19th century manuscripts. focused a relatively small subset 125 000 images interest underwater archaeology. For this type manuscripts, state-of-the-art automatic transcription techniques, generally fail achieve usable accuracy. Therefore, rather than insisting in actual transcription, methodologies for probabilistic indexing handwritten text have been...
Textual access to large collections of digitized images remains unfeasible because usually they lack transcripts. Transcribing such is in turn typically unattainable terms costs. However, the use probabilistic indices can facilitate textual accessing with only moderate demands resources. Besides allowing effortless information retrieval, it will be shown that also used estimate features indexed but otherwise untranscribed collections, as running words and Zipf's curves. Complete have been...
This document introduces a web based demo of an interactive framework for transcription handwritten text, where the user feedback is provided by means pen strokes on touchscreen. Here, automatic handwriting text recognition system and both cooperate to generate final transcription.