Sophie Aubin

ORCID: 0000-0003-4805-8220
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Semantic Web and Ontologies
  • Biomedical Text Mining and Ontologies
  • Natural Language Processing Techniques
  • Research Data Management Practices
  • Robotics and Automated Systems
  • Context-Aware Activity Recognition Systems
  • Scientific Computing and Data Management
  • Agriculture and Rural Development Research
  • Service-Oriented Architecture and Web Services
  • linguistics and terminology studies
  • French Language Learning Methods
  • Topic Modeling
  • Genomics and Phylogenetic Studies
  • Library Science and Information Systems
  • Advanced Text Analysis Techniques
  • Smart Agriculture and AI
  • Aging, Elder Care, and Social Issues
  • Cooperative Studies and Economics
  • Linguistics and Discourse Analysis
  • Mathematical Control Systems and Analysis
  • Text Readability and Simplification
  • Bioinformatics and Genomic Networks
  • Infant Nutrition and Health
  • Customer churn and segmentation
  • Information Technology and Learning

Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement
2020-2024

Université Paris-Saclay
2023

Centre Hospitalier Universitaire d'Angers
2023

Institut de l'Information Scientifique et Technique
2008-2022

Institut Universitaire de Cardiologie et de Pneumologie de Québec
2022

Mathématiques et Informatique Appliquées du Génome à l'Environnement
2010

Laboratoire d'Informatique de Paris-Nord
2004-2006

Université Sorbonne Paris Nord
2005-2006

Nord University
2005-2006

Centre National de la Recherche Scientifique
2006

Many vocabularies and ontologies are produced to represent annotate agronomic data. However, those spread out, in different formats, of size, with structures from overlapping domains. Therefore, there is need for a common platform receive host them, align enabling their use agro-informatics applications. By reusing the National Center Biomedical Ontologies (NCBO) BioPortal technology, we have designed AgroPortal, an ontology repository agronomy domain. The AgroPortal project re-uses...

10.1016/j.compag.2017.10.012 article EN cc-by-nc-nd Computers and Electronics in Agriculture 2017-12-06

This study aims to examine, through stakeholders consultation, the widely used definitions of four terms related plastics sustainability: ‘bio-based plastics', ‘bioplastics’, ‘biodegradable plastics’ and ‘plastics recycling’ mitigate their potential ambiguity for diverse scientific communities sectors activity. For three ‘biodegradable’ ‘recycling’, consolidated were elaborated based on feedback online survey analysis pro con arguments given by face-to-face interviews with 18 experts...

10.1016/j.envsci.2022.04.011 article EN cc-by Environmental Science & Policy 2022-04-27

We study the adaptation of Link Grammar Parser to biomedical sublanguage with a focus on domain terms not found in general parser lexicon. Using two corpora, we implement and evaluate three approaches addressing unknown words: automatic lexicon expansion, use morphological clues, disambiguation using part-of-speech tagger. each approach separately for its effect parsing performance consider combinations these approaches. In addition 45% increase efficiency, find that best approach,...

10.1186/1471-2105-7-s3-s2 article EN cc-by BMC Bioinformatics 2006-11-01

In this article, we present a joint effort of the wheat research community, along with data and ontology experts, to develop interoperability guidelines. Interoperability is ability two or more systems devices cooperate exchange data, interpret that shared information. growing concern scientific agriculture in general, as need deluge obtained through high-throughput technologies grows. Agreeing on common formats, metadata, vocabulary standards an important step obtain required level order...

10.12688/f1000research.12234.2 preprint EN cc-by F1000Research 2017-12-06

This paper gives an overview of the Caderige project. project involves teams from different areas (biology, machine learning, natural language processing) in order to develop highlevel analysis tools for extracting structured information biological bibliographical databases, especially Medline. The approach and compares it state art.

10.3115/1567594.1567602 article EN 2004-01-01

Making data compliant with the FAIR Data principles (Findable, Accessible, Interoperable, Reusable) is still a challenge for many researchers, who are not sure which criteria should be met first and how. Illustrated experimental tables associated Design of Experiments, we propose an approach that can serve as model research management allows researchers to disseminate their by satisfying main without insurmountable efforts. More importantly, this aims facilitate compliance process providing...

10.1093/gigascience/giaa144 article EN cc-by GigaScience 2020-12-01

<ns3:p>In this article, we present a joint effort of the wheat research community, along with data and ontology experts, to develop interoperability guidelines. Interoperability is ability two or more systems devices cooperate exchange data, interpret that shared information. growing concern scientific agriculture in general, as need deluge obtained through high-throughput technologies grows. Agreeing on common formats, metadata, vocabulary standards an important step obtain required level...

10.12688/f1000research.12234.1 preprint EN cc-by F1000Research 2017-10-16

Web semantic access in specific domains calls for specialized search engines with enhanced querying and indexing capacities, which pertain both to information retrieval (IR) extraction (IE). A rich linguistic analysis is required either identify the relevant units index weight them according statistical distribution, or as basis of an process. Recent developments make Natural Language Processing (NLP) techniques reliable enough process large collections documents enrich annotations. This...

10.48550/arxiv.0706.4375 preprint EN other-oa arXiv (Cornell University) 2007-01-01

The GODAN Action online map of agri-food data standards is a deliverable the project . supports users, producers and intermediaries to effectively engage with open maximise its potential for impact in agriculture nutrition sectors.  In particular, we work strengthen capacity, promote common best practice improve how measure impact. part programme that promotes proactive sharing make information about available, accessible usable.  has been initially funded 3.5 years by UK Department...

10.7490/f1000research.1115260.1 article EN F1000Research 2018-02-12

En l’absence de thésaurus spécialisé dans le domaine l’agroécologie, un groupe métier aux compétences complémentaires (experts scientifiques et spécialistes l’information scientifique technique) a construit « d’agroécologie ». Ce est issu la valorisation l’ensemble des termes capitalisés par dispositif veille territoriale Agroécologie conduit à l’échelle région Midi-Pyrénées sur période 2013–2017. L’ensemble données constitutives ce accessible sous Licence Ouverte format standard. Exposé...

10.1051/cagri/2020004 article FR cc-by-nc Cahiers Agricultures 2020-01-01
Coming Soon ...