Thierry Artières

ORCID: 0000-0003-3696-0321
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Handwritten Text Recognition Techniques
  • Natural Language Processing Techniques
  • Speech Recognition and Synthesis
  • Neural Networks and Applications
  • Machine Learning and Algorithms
  • Topic Modeling
  • Image Retrieval and Classification Techniques
  • Music and Audio Processing
  • Human Motion and Animation
  • Advanced Image and Video Retrieval Techniques
  • Sparse and Compressive Sensing Techniques
  • Image Processing and 3D Reconstruction
  • Video Analysis and Summarization
  • Web Data Mining and Analysis
  • Tensor decomposition and applications
  • Text and Document Classification Technologies
  • Speech and Audio Processing
  • Human Pose and Action Recognition
  • Time Series Analysis and Forecasting
  • Semantic Web and Ontologies
  • Machine Learning and Data Classification
  • Recommender Systems and Techniques
  • Algorithms and Data Compression
  • Face and Expression Recognition
  • Functional Brain Connectivity Studies

Centrale Marseille
2016-2024

Aix-Marseille Université
2016-2024

Centre National de la Recherche Scientifique
2013-2024

Laboratoire d’Informatique et Systèmes
2017-2024

Château Gombert
2016-2024

Université de Toulon
2019-2024

Institut de Neurosciences de la Timone
2024

Laboratoire d’Informatique Fondamentale de Marseille
2015-2017

Innate Pharma (France)
2016-2017

Sorbonne Université
2006-2015

This article provides an overview of the first BIOASQ challenge, a competition on large-scale biomedical semantic indexing and question answering (QA), which took place between March September 2013. assesses ability systems to semantically index very large numbers scientific articles, return concise user-understandable answers given natural language questions by combining information from articles ontologies.The 2013 comprised two tasks, Task 1a 1b. In participants were asked automatically...

10.1186/s12859-015-0564-6 article EN cc-by BMC Bioinformatics 2015-04-29

LSHTC is a series of challenges which aims to assess the performance classification systems in large-scale large number classes (up hundreds thousands). This paper describes dataset that have been released along series. The details construction datsets and design tracks as well evaluation measures we implemented quick overview results. All these datasets are available online runs may still be submitted on server challenges.

10.48550/arxiv.1503.08581 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Large margin learning of Continuous Density HMMs with a partially labeled dataset has been extensively studied in the speech and handwriting recognition fields. Yet due to non-convexity optimization problem, previous works usually rely on severe approximations so that it is still an open problem. We propose new algorithm relies non-convex bundle methods allows tackling original problem as is. It proved converge solution accuracy ε rate O (1/ε). provide experimental results gained demonstrate...

10.1145/1553374.1553408 preprint EN 2009-06-14

Object segmentation is a crucial problem that usually solved by using supervised learning approaches over very large datasets composed of both images and corresponding object masks. Since the masks have to be provided at pixel level, building such dataset for any new domain can time-consuming. We present ReDO, model able extract objects from without annotation in an unsupervised way. It relies on idea it should possible change textures or colors changing overall distribution dataset....

10.48550/arxiv.1905.13539 preprint EN other-oa arXiv (Cornell University) 2019-01-01

We investigate a new approach for online handwritten shape recognition. Interesting features of this include learning without manual tuning, from very few training samples, incremental characters, and adaptation to the user-specific needs. The proposed system can deal with two-dimensional graphical shapes such as Latin Asian command gestures, symbols, small drawings, geometric shapes. It be used building block series recognition tasks many applications.

10.1109/tpami.2007.38 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2007-01-05

Active learning (AL) consists of asking human annotators to annotate automatically selected data that are assumed bring the most benefit in creation a classifier.AL allows learn accurate systems with much less annotated than what is required by pure supervised algorithms, hence limiting tedious effort annotating large collection data.We experimentally investigate behavior several AL strategies for sequence labeling tasks (in partially-labeled scenario) tailored on Partially-Labeled...

10.3115/v1/d14-1097 article EN cc-by 2014-01-01

This paper provides an overview of the workshop Web-Scale Classification: Web Classification in Big Data Era which was held New York City, on February 28th as a seventh International Conference Search and Mining. The goal to discuss assess recent research focusing classification mining Web-scale category systems. brought together members several communities such web mining, machine learning, text social media mining.

10.1145/2556195.2556208 article EN 2014-02-18

This paper focuses on designing a handwriting recognition system dealing with on-line signal, i.e. temporal signal captured through an electronic pen or digitalized tablet. We present here some new results concerning hybrid based Hidden Markov Models (HMMs) and Neural Networks (NNs), which has already been presented in several contributions. In our approach, letter-model is Left-Right HMM, whose emission probability densities are approximated mixtures of predictive multilayer perceptrons....

10.1109/icdar.2001.953886 preprint EN 2002-11-13

We consider the problem of learning when obtaining training labels is costly, which usually tackled in literature using active-learning techniques. These approaches provide strategies to choose examples label before or during training. are based on heuristics even theoretical measures, but not learned as they directly used design a model aims at \textit{learning strategies} meta-learning setting. More specifically, we pool-based setting, where system observes all dataset and has subset...

10.48550/arxiv.1706.08334 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Abstract The cerebral processing of voice information is known to engage, in human as well non-human primates, “temporal areas” (TVAs) that respond preferentially conspecific vocalizations. However, how represented by neuronal populations these areas, particularly speaker identity information, remains poorly understood. Here, we used a deep neural network (DNN) generate high-level, small-dimension representational space for identity—the ‘voice latent space’ (VLS)—and examined its linear...

10.1101/2024.02.27.582302 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2024-02-28

Single-cell RNA sequencing (scRNASeq) data plays a major role in advancing our understanding of developmental biology. An important current question is how to classify transcriptomic profiles obtained from scRNASeq experiments into the various cell types and identify lineage relationship for individual cells. Because fast accumulation datasets high dimensionality data, it has become challenging explore annotate single-cell by hand. To overcome this challenge, automated classification methods...

10.1371/journal.pcbi.1012006 article EN cc-by PLoS Computational Biology 2024-04-05

In this paper we study the use of confidence measures for an on-line handwriting recognizer. We investigate various and their integration in isolated word recognition system as well a sentence system. tasks, rejection mechanism is designed order to reject outputs recognizer that are possibly wrong, which case badly written words, out-of-vocabulary words or general drawing. allows rejecting parts decoded sentence.

10.1109/iwfhr.2002.1030879 preprint EN 2003-06-25

The rise in the use of social media networks has increased prevalence cyberbullying, and time is paramount to reduce negative effects that derive from those behaviours on any platform. This paper aims study early detection problem a general perspective by carrying out experiments over two independent datasets (Instagram Vine), exclusively using users' comments. We used textual information comments baseline models (fixed, threshold, dual models) apply three different methods improving...

10.3390/s23104788 article EN cc-by Sensors 2023-05-16

This paper investigates the cooperation of online and off-line handwriting word recognition systems. Our goal is to improve a mature system by, exploiting complementary information present in representation built from signal. After describing HMM based systems, we propose formal framework, which allows different strategies for combining two HMM. These schemes are then evaluated on UNIPEN database, both isolated character tasks.

10.1109/icdar.2001.953823 preprint EN 2002-11-13
Coming Soon ...