Abir Masmoudi

ORCID: 0000-0002-7450-360X
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Natural Language Processing Techniques
  • Speech and dialogue systems
  • Language, Linguistics, Cultural Analysis
  • Topic Modeling
  • Spam and Phishing Detection
  • Sentiment Analysis and Opinion Mining
  • Speech Recognition and Synthesis
  • Linguistic Studies and Language Acquisition
  • Text and Document Classification Technologies
  • Advanced Text Analysis Techniques
  • Terrorism, Counterterrorism, and Political Violence
  • Phonetics and Phonology Research
  • Translation Studies and Practices
  • Semantic Web and Ontologies
  • Service-Oriented Architecture and Web Services
  • Algorithms and Data Compression
  • Misinformation and Its Impacts
  • Lexicography and Language Studies
  • Text Readability and Simplification
  • Machine Learning and Algorithms
  • Imbalanced Data Classification Techniques

University of Sfax
2015-2025

Le Mans Université
2014-2018

Université Nantes Angers Le Mans
2017

10.5220/0013134000003890 article EN Proceedings of the 14th International Conference on Agents and Artificial Intelligence 2025-01-01

The evolution of information and communication technology has markedly influenced between correspondents. This facilitated the transmission engendered new forms written (email, chat, SMS, comments, etc.). Most these messages comments are in Latin script, also called Arabizi . Moreover, language used social media SMS messaging is characterized by use informal non-standard vocabulary, such as repeated letters for emphasis, typos, abbreviations, nonlinguistic content like emoticons. Since...

10.1145/3364319 article EN ACM Transactions on Asian and Low-Resource Language Information Processing 2019-11-28

Automatic sentiment analysis has become one of the fastest growing research areas in Natural Language Processing (NLP) field. Despite its importance, this is first work towards at both aspect and sentence levels for Tunisian Dialect field supermarkets. Therefore, we experimentally evaluate, paper, three deep learning methods, namely convolution neural networks (CNN), long short-term memory (LSTM), bi-directional long-short-term-memory (Bi-LSTM). Both LSTM Bi-LSTM constitute two major types...

10.13053/cys-25-1-3472 article EN Computación y Sistemas 2021-02-15

Modern Standard Arabic, as well Arabic dialect languages, are usually written without diacritics. The absence of these marks constitute a real problem in the automatic processing data by NLP tools. Indeed, writing diacritics introduces several types ambiguity. First, word diacratics could have many possible meanings depending on their diacritization. Second, undiacritized surface forms an might 200 readings complexity its morphology [12]. In fact, agglutination property produce that can only...

10.1145/3297278 article EN ACM Transactions on Asian and Low-Resource Language Information Processing 2019-07-12

Abstract This paper focuses on the problem of hierarchical multi‐label classification research papers, which is task assigning set relevant labels for a from hierarchy, using reduced amounts labelled training data. Specifically, we study leveraging unlabelled data, are usually plentiful and easy to collect, in addition few available ones semi‐supervised learning framework achieving better performance results. Thus, this paper, propose approach papers based well‐known Co‐training algorithm,...

10.1111/exsy.12613 article EN Expert Systems 2020-08-24

Detecting radicalization on social networks is crucial to the fight against violent extremism and terrorism. In most cases, online has clear warning indicators that can be detected at early stages of process. this paper, we focus mining from messages by exploiting structured domain knowledge. More precisely, propose an approach automatically annotate with concepts a ontology. Annotations are then exploited within inference phase identify exhibiting indicator. We conducted set experiments...

10.1109/aina.2018.00094 preprint EN 2018-05-01

Scientific research teams have immense valuable knowledge that need to be managed. Organizing scientific contributions of team members constitutes a major challenge for the monitoring evolution, member's competences discovery, and facilitating information retrieval processes. However, performing manual annotations is often time consuming labor-intensive task, especially in case complex annotation schemas. Currently, existing management systems focus on ensuring creation, sharing,...

10.1109/wetice.2017.35 article EN 2017-06-01

The absence of diacritical marks in the Arabic texts generally leads to morphological, syntactic and semantic ambiguities.This can be more blatant when one deals with under-resourced languages, such as Tunisian dialect, which suffers from unavailability basic tools linguistic resources, like sufficient amount corpora, multilingual dictionaries, morphological analyzers.Thus, this language processing faces greater challenges due lack these resources.The automatic diacritization MSA text is...

10.26615/978-954-452-056-4_085 article EN 2019-10-22

In this paper, we describe the process of creating a statistical Language Model (LM) for Tunisian Dialect.Indeed, work is part realization Automatic Speech Recognition (ASR) system Railway Transport Network.Since our field has been limited, there are several words with similar behaviors (semantic example) but they do not have same appearance probability; their class groupings will therefore be possible.For these reasons, propose to build an n-class LM that based mainly on integration purely...

10.26615/978-954-452-056-4_084 article EN 2019-10-22
Coming Soon ...