Montserrat Marimon

ORCID: 0000-0002-0211-8681
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Natural Language Processing Techniques
  • Topic Modeling
  • Speech and dialogue systems
  • Semantic Web and Ontologies
  • Biomedical Text Mining and Ontologies
  • Text Readability and Simplification
  • Syntax, Semantics, Linguistic Variation
  • Gender and Feminist Studies
  • Social Sciences and Policies
  • Spanish Linguistics and Language Studies
  • Cancer, Hypoxia, and Metabolism
  • Advanced Software Engineering Methodologies
  • Occupational Health and Safety in Workplaces
  • Ethics in Business and Education
  • Educational theories and practices
  • Translation Studies and Practices
  • Ethics and bioethics in healthcare
  • Physical Education and Sports Studies
  • Advanced Text Analysis Techniques
  • Model-Driven Software Engineering Techniques
  • linguistics and terminology studies
  • Wikis in Education and Collaboration
  • Arts and Performance Studies
  • Silkworms and Sericulture Research
  • Geography and Education Methods

Universitat Politècnica de Catalunya
2018-2023

Barcelona Supercomputing Center
2018-2023

Hospital General de Catalunya
2022

Universitat Pompeu Fabra
2007-2022

Spanish National Cancer Research Centre
2019

Chartered Institute of Management Accountants
2019

Universidad de Navarra
2019

Universitat de Barcelona
1998-2018

University of Stuttgart
2017

University of Groningen
2017

One of the biomedical entity types relevance for medicine or biosciences are chemical compounds and drugs. The correct detection these entities is critical other text mining applications building on them, such as adverse drug-reaction detection, medication-related fake news drug-target extraction. Although a significant effort was made to detect mentions drugs/chemicals in English texts, so far only very limited attempts were recognize them medical documents languages. Taking into account...

10.18653/v1/d19-5701 article EN cc-by 2019-01-01

This work introduces Salamandra, a suite of open-source decoder-only large language models available in three different sizes: 2, 7, and 40 billion parameters. The were trained from scratch on highly multilingual data that comprises text 35 European languages code. Our carefully curated corpus is made exclusively open-access compiled wide variety sources. Along with the base models, supplementary checkpoints fine-tuned public-domain instruction are also released for chat applications....

10.48550/arxiv.2502.08489 preprint EN arXiv (Cornell University) 2025-02-12

Purpose represents a unique opportunity for identifying and analyzing the complexity of human reasoning, considering that its constitution brings together cognitive, affective social elements. In this article, we use Theory Organizing Models Thinking (OMT), an epistemological methodological approach based on developmental psychologist Jean Piaget's work, to present different perspective how analyze youth purpose explain cognitive-emotional dynamics reasoning in everyday thinking. We...

10.1080/03057240.2017.1345725 article EN Journal of Moral Education 2017-07-03

This paper presents the IULA Spanish Clinical Record Corpus, a corpus of 3,194 sentences extracted from anonymized clinical records and manually annotated with negation markers their scope. The was conceived as resource to support text-mining systems, but it is also useful for other Natural Language Processing systems handling texts: automatic encoding records, diagnosis support, term extraction, among others, well study texts. publicly available CC-BY-SA 3.0 license.

10.18653/v1/w17-1807 article EN cc-by 2017-01-01

Automatically detecting mentions of pharmaceutical drugs and chemical substances is key for the subsequent extraction relations chemicals with other biomedical entities such as genes, proteins, diseases, adverse reactions or symptoms. The identification drug also a prior step complex event types dosage recognition, duration medical treatments repurposing. Formally, this task known named entity recognition (NER), meaning automatically identifying predefined interest in running text. In domain...

10.5808/gi.2019.17.2.e15 article EN Genomics & Informatics 2019-06-28

Nowadays, vast amounts of multimedia content are being produced, archived, and digitized, resulting in great troves data interest. Examples include user-generated content, such as images, videos, text, audio posted by users on social media wikis, or provided through official publishers distributors, digital libraries, organizations, online museums. This can serve a valuable source inspiration to the creative industries, architecture gaming, produce new innovative assets enhance (re-)use...

10.1109/jsyst.2022.3217655 article EN IEEE Systems Journal 2022-11-10

The work we present here is concerned with the acquisition of deep grammatical information for nouns in Spanish. aim to build a learner that can handle noise, but, more interestingly, able overcome problem sparse data, especially important case nouns. We have based our on two main points. Firstly, used distributional evidences as features. Secondly, made deal all occurrences word single complex unit. obtained results show features level generalization be successfully approached Decision Tree learner.

10.3115/1614108.1614110 article EN 2007-01-01

This paper describes work on the development of an open-source HPSG grammar for Spanish implemented within LKB system. Following a brief description main features grammar, we present our approach pre-processing and ongoing research automatic lexical acquisition.

10.3115/1608912.1608929 article EN 2007-01-01

10.1007/s10579-012-9199-7 article EN Language Resources and Evaluation 2012-09-21

This article presents an ensemble parse approach to detecting and selecting high-quality linguistic analyses output by a hand-crafted HPSG grammar of Spanish implemented in the LKB system. The uses full agreement (i.e., exact syntactic match) along with MaxEnt selection model statistical dependency parser trained on same data. ultimate goal is develop hybrid corpus annotation methodology that combines fully automatic manual selection, order make task more efficient while maintaining high...

10.1162/coli_a_00190 article EN Computational Linguistics 2014-03-28

En los trabajos de Carol Gilligan aparece la ética del cuidad y responsabilidad que tiene en cuenta aspectos diferenciales las necesidades particulares personas. Esta se contrapone a justicia descrita por Kholberg, cuyas características son el principio igualdad no consideración específicas cada ser humano. Ambas éticas parecen, sin embargo, complementarias. El trabajo presenta incluye ambos enfoques éticos, una situación experimental cual pide sujetos diferentes edades (desde 6 años hasta...

10.1590/s1517-97022000000200009 article ES Educação e Pesquisa 2000-12-01

Two didactic approaches are considered in this paper: the direct transmission of knowledge and constructivist model, with an analysis conceptions they imply on intelligence how it works. The contribution that evolution science itself may bring to process individual learning is autlined.

10.5565/rev/ensciencias.5192 article EN cc-by Enseñanza de las Ciencias Revista de investigación y experiencias didácticas 2006-10-25

El text es basa en l’anàlisi del raonament moral a partir marc dels models organitzadors. A de la determinació les diferents representacions que una sèrie subjectes va realitzar sobre un dilema basat conflicte entre iguals d’edat, i incloure tres nuclis (justícia, felicitat sol·licitud) detectar existeixen quatre grans organitzadors presenten línies evolutives. La tesi defensa és inclusió sentiments pot guiar el nivells més elevats integrar perspectives justícia solidaritat l’hora resoldre...

10.5565/rev/educar.353 article CA cc-by-nc Educar 1998-02-01
Coming Soon ...