Mathieu Roche

ORCID: 0000-0003-3272-8568
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Natural Language Processing Techniques
  • Advanced Text Analysis Techniques
  • Semantic Web and Ontologies
  • Topic Modeling
  • Biomedical Text Mining and Ontologies
  • Data-Driven Disease Surveillance
  • Web Data Mining and Analysis
  • Geographic Information Systems Studies
  • Sentiment Analysis and Opinion Mining
  • Linguistics and Discourse Analysis
  • Text and Document Classification Technologies
  • Data Management and Algorithms
  • Animal Disease Management and Epidemiology
  • Digital Communication and Language
  • Data Mining Algorithms and Applications
  • linguistics and terminology studies
  • Lexicography and Language Studies
  • Data Quality and Management
  • Zoonotic diseases and public health
  • Rough Sets and Fuzzy Logic
  • Agriculture and Rural Development Research
  • Service-Oriented Architecture and Web Services
  • Linguistics, Language Diversity, and Identity
  • French Language Learning Methods
  • Misinformation and Its Impacts

Territoires
2016-2025

Forests and Societies
2016-2025

Territoires, Environnement, Télédétection et Information Spatiale
2016-2025

Centre de Coopération Internationale en Recherche Agronomique pour le Développement
2016-2025

Université de Montpellier
2016-2025

Institut National de Recherche pour l'Agriculture, l'Alimentation et l'Environnement
2020-2025

Centre National de la Recherche Scientifique
2015-2025

AgroParisTech
2015-2025

Centre Hospitalier Universitaire de Montpellier
2023-2024

Animal, Santé, Territoires, Risques et Ecosystèmes
2021-2023

The growing popularity of Web 2.0 provides with increasing numbers documents expressing opinions on different topics. Recently, new research approaches have been defined in order to automatically extract such from the Internet. They usually consider be expressed through adjectives, and make extensive use either general dictionaries or experts provide relevant adjectives. Unfortunately, these suffer following drawback: a specific domain, given adjective may not exist meaning another domain....

10.1145/1456223.1456269 article EN 2008-01-01

10.1016/j.compag.2019.104864 article EN Computers and Electronics in Agriculture 2019-07-04

Since 2013, the French Animal Health Epidemic Intelligence System (in French: Veille Sanitaire Internationale, VSI) has been monitoring signals of emergence new and exotic animal infectious diseases worldwide. Once detected, VSI team verifies issues early warning reports to health authorities when potential threats France are detected. To improve detection from online news sources, we designed Platform for Automated extraction Disease Information web (PADI-web). PADI-web automatically...

10.1371/journal.pone.0199960 article EN cc-by PLoS ONE 2018-08-03

In this article, firstly we briefly summarise the sud4science project and data collection (http://sud4science.org), ensuing processing/analysing stages, resulting corpus, 88milSMS (http://88milsms.huma-num.fr), through a synthesis of quotes references to previous articles (§ 1). Secondly, provide state art on some research initiatives that use in various domains frameworks, which will enable future cross-disciplinary insight 2). Then, present other usages corpus identified surveys 3)....

10.4000/corpus.4852 article EN cc-by Corpus 2020-01-28

Event-based surveillance (EBS) systems monitor a broad range of information sources to detect early signals disease emergence, including new and unknown diseases. In December 2019, newly identified coronavirus emerged in Wuhan (China), causing global (COVID-19) pandemic. A retrospective study was conducted evaluate the capacity three event-based (ProMED, HealthMap PADI-web) COVID-19 emergence signals. We focused on changes online news vocabulary over period before/after identification...

10.1111/tbed.13738 article EN Transboundary and Emerging Diseases 2020-07-19

Tweets exchanged over the Internet are an important source of information even if their characteristics make them difficult to analyze (e.g., a maximum 140 characters; noisy data). In this paper, we address problem extracting relevant topics through tweets coming from different communities. More precisely interested following question: which most terms given community. To answer question define and evaluate new variants traditional TF-IDF. Furthermore also show that our measures well suited...

10.1145/2389661.2389669 article EN 2012-11-02

PADI-web (Platform for Automated extraction of animal Disease Information from the web) is a biosurveillance system dedicated to monitoring online news sources detection emerging infectious diseases. has collected more than 380,000 articles since 2016. Compared other existing tools, focuses specifically on health and fully automated pipeline based machine-learning methods. This paper presents new functionalities integration of: (i) fine-grained classification system, (ii) automatic methods...

10.1016/j.onehlt.2021.100357 article EN cc-by-nc-nd One Health 2021-12-01
Coming Soon ...