Ivandré Paraboni

ORCID: 0000-0002-7270-1477
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Natural Language Processing Techniques
  • Topic Modeling
  • Speech and dialogue systems
  • Authorship Attribution and Profiling
  • Sentiment Analysis and Opinion Mining
  • Multi-Agent Systems and Negotiation
  • Mental Health via Writing
  • Personality Traits and Psychology
  • Spam and Phishing Detection
  • Advanced Text Analysis Techniques
  • Hate Speech and Cyberbullying Detection
  • Misinformation and Its Impacts
  • Semantic Web and Ontologies
  • Speech Recognition and Synthesis
  • Text Readability and Simplification
  • Language, Metaphor, and Cognition
  • Digital Mental Health Interventions
  • Complex Network Analysis Techniques
  • Computational and Text Analysis Methods
  • Social Media and Politics
  • Education and Digital Technologies
  • Video Surveillance and Tracking Methods
  • Anomaly Detection Techniques and Applications
  • Mental Health Research Topics
  • Linguistics, Language Diversity, and Identity

Universidade de São Paulo
2014-2023

Universidade Cidade de São Paulo
2023

Hospital Universitário da Universidade de São Paulo
2023

Universidad San Pedro
2017

Intel (United States)
2010

Ibero American University
2010

King's College Hospital
2007

Brazilian Society of Computational and Applied Mathematics
2006

Pontifícia Universidade Católica do Rio Grande do Sul
1998

It is often desirable that referring expressions be chosen in such a way their referents are easy to identify. This article focuses on hierarchically structured domains, exploring the hypothesis can improved by including logically redundant information them if this leads significant reduction amount of search needed identify referent. Generation algorithms presented implement idea into generated expression, certain well-circumscribed situations. To test our hypotheses, and assess performance...

10.1162/coli.2007.33.2.229 article EN Computational Linguistics 2007-06-01

Transformer-based language models such as Bidirectional Encoder Representations from Transformers (BERT) are now mainstream in the NLP field, but extensions to languages other than English, new domains and/or more specific text genres still demand.In this paper we introduced BERTabaporu, a BERT model that has been pre-trained on Twitter data Brazilian Portuguese language.The is shown outperform best-known general-purpose for three Twitter-related tasks, making potentially useful resource general.

10.26615/978-954-452-092-2_024 article EN 2023-01-01

Earlier work has suggested that, in hierarchically ordered domains (e.g., a document divided into sections and subsections), referring expressions that are judiciously over-specified to higher extent than is achieved by existing generation algorithms can make it considerably easier for hearer find the referent of expression. The present paper investigates over-specification spatial domains, which plays an important role daily life. We report experiment whose aim (1) out whether similar as...

10.1080/01690965.2013.805796 article EN Language Cognition and Neuroscience 2013-06-17

This article presents a method for prompt-based mental health screening from large and noisy dataset of social media text. Our uses GPT 3.5. prompting to distinguish publications that may be more relevant the task, then straightforward bag-of-words text classifier predict actual user labels. Results are found on pair with BERT mixture experts classifier, incurring only fraction its training costs.

10.5753/brasnam.2024.1879 article EN 2024-07-18

Advances in the Natural Language Processing (NLP) and machine learning fields have led to development of automated methods for recognition personality traits from text available social media similar sources. Systems this kind exploit close relation between lexical knowledge models – such as well-known Big Five model provide information about author an input a non-intrusive fashion, at low cost. Although now well-established research topic field, computational still leaves number questions...

10.1080/13614568.2020.1722761 article EN New Review of Hypermedia and Multimedia 2019-10-02

The language employed by an individual when discussing topics of a moral nature (of the kind typically found in, e.g., social media) is revealing not only text affective contents itself, but also who wrote in first place. Based on these observations, this work intends to illustrate how two kinds morality-related information may be inferred from presenting number shallow and deep learning models stance foundations classification. In doing so, we introduce novel corpus texts labelled with...

10.1109/taffc.2020.3034050 article EN IEEE Transactions on Affective Computing 2020-10-26

Abstract As in many other natural language processing (NLP) fields, the use of statistical methods is now part mainstream generation (NLG). In development systems this kind, however, there issue data sparseness, a problem that particularly evident case morphologically-rich languages such as Portuguese. This work presents shallow surface realisation system makes factored models (FLMs) Portuguese to overcome some these difficulties. The combines FLMs trained on large corpus with number NLP...

10.1007/s13173-012-0095-1 article EN cc-by Journal of the Brazilian Computer Society 2012-11-23

This paper presents a study on the recognition of personality traits from text in Brazilian Portuguese. Based well-known Big Five model personality, we collected basic linguistic-computational resource - which can be seen as parallel corpus texts and inventories then use this to build supervised models Facebook status updates.

10.1109/tla.2018.8362165 article EN IEEE Latin America Transactions 2018-04-01

The inference of politically-oriented information from text data is a popular research topic in Natural Language Processing (NLP) at both text- and author-level. In recent years, studies this kind have been implemented with the aid representations ranging simple count-based models (e.g., bag-of-words) to sequence-based built transformers BERT). Despite considerable success, however, we may still ask whether results be improved further by combining these additional representations. To shed...

10.3897/jucs.96652 article EN cc-by-nd JUCS - Journal of Universal Computer Science 2023-06-23

Predicting mental health statuses from social media text is a well-known Natural Language Processing (NLP) task. In this work, we focus on the issue of depression and anxiety disorder prediction Twitter by comparing more conventional approach based engineered features with data-oriented alternative mixture specialists transformer language models. Results large corpus depression/anxiety self-disclosed diagnoses in Portuguese are reported, feature importance analysis carried out to provide...

10.1109/tla.2023.10172137 article EN IEEE Latin America Transactions 2023-06-01

We introduce a labelled corpus of stances about moral issues for the Brazilian Portuguese language, and present reference results both stance recognition polarity classification tasks.The is built from Twitter further expanded with data elicited through crowd sourcing by their own authors.Put together, are expected to be taken as baseline studies in field text.

10.26615/978-954-452-056-4_123 article EN 2019-10-22

Computational models of hate speech detection and related tasks (e.g., detecting misogyny, racism, xenophobia, homophobia etc.) have emerged as major Natural Language Processing (NLP) research topics in recent years. In the present work, we investigate a range alternative implementations three these - namely, speech, aggressive behaviour target group recognition- by presenting number experiments involving different learning methods, including regularised logistic regression, convolutional...

10.13053/cys-24-3-3478 article EN Computación y Sistemas 2020-09-30

10.1016/j.eswa.2021.114866 article EN Expert Systems with Applications 2021-03-14

At both semantic and syntactic levels, the generation of referring expressions (REG) involves far more than simply producing 'correct' output strings and, accordingly, remains central to study development Natural Language Generation (NLG) systems.In particular, REG algorithms have pay regard humanlikeness, an issue that lies at very heart classic definition Artificial Intelligence as, e.g., motivated by Turing test.In this work we present end-to-end approach takes humanlikeness into account,...

10.4114/ia.v14i45.1100 article EN INTELIGENCIA ARTIFICIAL 2010-03-08

This paper discusses the computational problem of generating referring expressions (REG) in 3D virtual worlds. We propose a REG algorithm that attempts to make adequate choices spatial relations for purpose disambiguation (as opposed to, e.g., determining localisation previously identified object). The decisions made by are based on existing models reference, and further refined use domain knowledge obtained from corpus instructions environments. proposed approach is shown outperform number...

10.1080/13875868.2015.1039166 article EN Spatial Cognition and Computation 2015-05-11
Coming Soon ...