Helena de Medeiros Caseli

ORCID: 0000-0003-3996-8599
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Natural Language Processing Techniques
  • Topic Modeling
  • Speech and dialogue systems
  • Text Readability and Simplification
  • Sentiment Analysis and Opinion Mining
  • Advanced Text Analysis Techniques
  • Semantic Web and Ontologies
  • Biomedical Text Mining and Ontologies
  • Digital Mental Health Interventions
  • Software Engineering Research
  • Social Media and Politics
  • Multimodal Machine Learning Applications
  • Impact of Technology on Adolescents
  • Spam and Phishing Detection
  • Data Mining Algorithms and Applications
  • Lexicography and Language Studies
  • Data Quality and Management
  • Mental Health Research Topics
  • Anomaly Detection Techniques and Applications
  • Machine Learning and Data Classification
  • Text and Document Classification Technologies
  • linguistics and terminology studies
  • Mental Health via Writing
  • Media and Communication Studies
  • Multi-Agent Systems and Negotiation

Universidade Federal de São Carlos
2014-2025

Federal Institute of São Paulo
2022

Universidade de São Paulo
2006-2022

Instituto Federal do Piauí
2022

Weatherford College
2022

Brazilian Society of Computational and Applied Mathematics
2020

Consejo Superior de Investigaciones Científicas
2020

Instituto de Linguística Teórica e Computacional
2008

A formação de câmaras eco (echo chambers) consiste em um fenômeno no qual os usuários são frequentemente expostos a pontos vista que reforçam suas crenças, dificultando exposição perspectivas divergentes. Enquanto sabe-se do surgimento e da disseminação das se formam comunidades online, identificação automática compreensão deste surge como uma importante ferramenta para auxiliar investigação implicações na esfera política democracia. O objetivo artigo é investigação, implementação avaliação...

10.21728/p2p.2025v11n2e-7398 article PT cc-by-nc-sa P2P E INOVAÇÃO 2025-02-18

Machine learning (ML) is becoming critical to many businesses. Keeping an ML solution online and responding therefore a necessity, part of the MLOps (Machine Learning operationalization) movement. One aspect for this process monitoring not only prediction quality, but also system resources. This important correctly provide necessary infrastructure, either using fully-managed cloud platform or local solution. difficult task, as there are tools available. However, it requires some planning...

10.1109/icmla51294.2020.00104 article EN 2021 20th IEEE International Conference on Machine Learning and Applications (ICMLA) 2020-12-01

Depression has been one of the leading causes disability worldwide. In addition to conventional drugs and clinical treatments, other forms treatment are also available. For example, computational solutions have developed prevent, screen for, assist depression. More specifically, chatbots computer systems that used provide therapeutic support for individuals diagnosed with Although these commercially available, their design rationale evaluation still not fully validated, further research is...

10.1145/3554364.3559119 article EN 2022-10-17

In this paper we investigate the main linguistic phenomena that can make texts complex and how they could be simplified. We focus on a corpus analysis of simple account available web for Brazilian Portuguese (BP). This study illustrates need text simplification to facilitate accessibility information by poor readers people with cognitive disabilities. It also highlights features BP, which may differ from other languages. Moreover, propose strategies Simplification Annotation Editor. consists...

10.1145/1456536.1456540 article EN 2008-09-22

Multiword Expressions (MWEs) are one of the stumbling blocks for more precise Natural Language Processing (NLP) systems. Particularly, lack coverage MWEs in resources can impact negatively on performance tasks and applications, lead to loss information or communication errors. This is especially problematic technical domains, where a significant portion vocabulary composed MWEs. paper investigates use statistically-driven alignment-based approach identification corpora. We look at several...

10.3115/1698239.1698241 article EN 2009-01-01

This paper introduces NEMWEL, a system that performs Never-Ending Mul-tiWord Expressions Learning.Instead of using static corpus and classifier, NEMWEL applies supervised learning on automatically crawled news texts.Moreover, it uses its own results to periodically retrain the bootstrapping results.In addition detailed description system's architecture modules, we report manual evaluation.It shows is capable new expressions over time with improved precision.

10.3115/v1/w15-0908 preprint EN cc-by 2015-01-01

In this paper we describe some experiments carried out to test the impact of automatic casing and punctuation changes when training testing statistical translation models. The described here concern from/to English Brazilian Portuguese texts but since superficial investigated are language independent, believe that conclusions can be applied many other pairs languages. These were designed aiming at setting a baseline scenario for future more complex models such as factored ones. From...

10.1109/stil.2009.24 article EN 2009-01-01

A internet e, em especial as mídias sociais, são um terreno fértil para a publicação de opiniões sobre os mais diversos assuntos, produtos e serviços. Tradicionalmente análise automática é realizada com base nas palavras que denotam alguma polaridade ou emoção. Contudo, o surgimento dos grandes modelos linguagem, como ChatGPT, maneira qual processamos textos realizar análises subjetivas mudou bastante. Neste contexto, este artigo tem foco investigar potencialidades do ChatGPT – comparada...

10.5753/stil.2023.233938 article PT 2023-09-21

Technology plays a relevant role in mental health. Specifically, integrating pervasive technologies with artificial intelligence (AI) holds promising potential to collect users’ data, monitor individuals daily, and support treatment. However, the lack of trust collected data is common limitation prior work on health technology. This paper proposes involving user Human-in-the-loop approach as solution deal accuracy through In our study, end users judged evaluated at two different times:...

10.55612/s-5002-059-003 article EN cc-by-nc-nd Deleted Journal 2023-12-15
Coming Soon ...