- Semantic Web and Ontologies
- Natural Language Processing Techniques
- linguistics and terminology studies
- Service-Oriented Architecture and Web Services
- Biomedical Text Mining and Ontologies
- Artificial Intelligence in Law
- Topic Modeling
- Library Science and Information Systems
- Spanish Linguistics and Language Studies
- Data Quality and Management
- Literary and Cultural Studies
- Law, logistics, and international trade
- Interpreting and Communication in Healthcare
- Business Process Modeling and Analysis
- Language, Metaphor, and Cognition
- Advanced Database Systems and Queries
- International Labor and Employment Law
- Cultural and political discourse analysis
- European Criminal Justice and Data Protection
- Digital Economy and Work Transformation
- Discrimination and Equality Law
- Translation Studies and Practices
- Lexicography and Language Studies
- Wildlife Ecology and Conservation
- Bat Biology and Ecology Studies
Universidad Politécnica de Madrid
2013-2024
University of Coimbra
2017
Universitat Pompeu Fabra
2016-2017
Thomson Reuters (United States)
2017
Bridge University
2017
University of Cambridge
2017
National Student Clearinghouse Research Center
2016
Universitat Politècnica de Catalunya
2016
Next Interactions (United States)
2016
Hamad bin Khalifa University
2016
Abstract This paper presents a novel approach to ontology localization with the objective of obtaining multilingual ontologies. Within development process, has been defined as activity adapting an concrete linguistic and cultural community. Depending on layers – terminological and/or conceptual involved in activity, three heterogeneous metamodels have identified, which we propose one them. Our proposal consists associating metamodel external model for representing structuring lexical data...
Legal texts, characterized by complex and specialized terminology, present a significant challenge for Language Models. Adding an underrepresented language, such as Spanish, to the mix makes it even more challenging. While pre-trained models like XLM-RoBERTa have shown capabilities in handling multilingual corpora, their performance on domain specific documents remains underexplored. This paper presents development evaluation of MEL, legal language model based XLM-RoBERTa-large, fine-tuned...
Legal corpora for Natural Language Processing (NLP) are valuable and scarce resources in languages like Spanish due to two main reasons: data accessibility legal expert knowledge availability. INESData 2024 is a European Union funded project lead by the Universidad Polit\'ecnica de Madrid (UPM) developed Instituto Ingenier\'ia del Conocimiento (IIC) create series of state-of-the-art NLP applied legal/administrative domain Spanish. The goal this paper present Corpus Contract Clauses (3CEL),...
International organizations (e.g., FAO, WHO, etc.) are increasingly expressing the need for multilingual ontologies different purposes, e.g., ontology-based machine translation, information retrieval. However, most of built so far have mainly English or another natural language as basis. Since ontology building is a very expensive and time-consuming undertaking, we propose methods guiding users in localization ontologies, provide tools supporting process. The main contributions this paper...
We revisit the notion of ontology localization, propose a new definition and clearly specify layers an that can be affected by process localizing it. also work out number dimensions allow to characterize type localization performed predict will affected. Overall our aim is contribute better understanding task ontology.
In this article, we argue that there is a growing number of linked datasets in different natural languages, and need for guidelines mechanisms to ensure the quality organic growth emerging multilingual data network. However, have little knowledge regarding actual state network, its current practices, open challenges it poses. Questions distribution links are established across or how linguistic features represented, remain mostly unanswered. Addressing these other language-related issues can...
The EU-funded project Lynx focuses on the creation of a knowledge graph for legal domain (Legal Knowledge Graph, LKG) and its use semantic processing, analysis enrichment documents from domain. This article describes cases covered in project, entire developed platform services that operate documents.
Abstract As the interest of Semantic Web and computational linguistics communities in linguistic linked data (LLD) keeps increasing number contributions that dwell on LLD rapidly grows, scholars (and linguists particular) interested development resources sometimes find it difficult to determine which mechanism is suitable for their needs challenges have already been addressed. This review seeks present state art models, ontologies extensions represent language as by focusing nature content...