NFDI4DS | UHH-SEMS - Publication Details

Costanza Conforti

ORCID: 0000-0002-9484-9632

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5070550428

Research Areas

Topic Modeling
Sentiment Analysis and Opinion Mining
Misinformation and Its Impacts
Natural Language Processing Techniques
Business Strategies and Management Research
Handwritten Text Recognition Techniques
Machine Learning and Data Classification
Advanced Data Storage Technologies
Complex Network Analysis Techniques
Spam and Phishing Detection
Scientific Computing and Data Management
Wikis in Education and Collaboration
ICT in Developing Communities
Psychology of Moral and Emotional Judgment
Innovation and Socioeconomic Development
Cultural Differences and Values
Linguistic Studies and Language Acquisition
Innovative Human-Technology Interaction
Time Series Analysis and Forecasting
E-Government and Public Services
Energy and Environment Impacts
Software Engineering Techniques and Practices
Computational and Text Analysis Methods
Mobile Crowdsensing and Crowdsourcing
Psychological Well-being and Life Satisfaction

Google (United States)
2024

University of Cambridge
2017-2022

Monash University
2021

University of Groningen
2021

Masaryk University
2021

Czech Academy of Sciences, Institute of Sociology
2021

Will-They-Won’t-They: A Very Large Dataset for Stance Detection on Twitter

OPENALEX - Publications

Costanza Conforti Jakob Berndt Mohammad Taher Pilehvar Chryssi Giannitsarou Flavio Toxvaerd and 1 more

Costanza Conforti, Jakob Berndt, Mohammad Taher Pilehvar, Chryssi Giannitsarou, Flavio Toxvaerd, Nigel Collier. Proceedings of the 58th Annual Meeting Association for Computational Linguistics. 2020.

10.18653/v1/2020.acl-main.157 article EN cc-by 2020-01-01

Croissant: A Metadata Format for ML-Ready Datasets

OPENALEX - Publications

Mubashara Akhtar Omar Benjelloun Costanza Conforti Pieter Gijsbers Joan Giner-Miguelez and 16 more

Data is a critical resource for Machine Learning (ML), yet working with data remains key friction point. This paper introduces Croissant, metadata format datasets that simplifies how used by ML tools and frameworks. Croissant makes more discoverable, portable interoperable, thereby addressing significant challenges in management responsible AI. already supported several popular dataset repositories, spanning hundreds of thousands datasets, ready to be loaded into the most

10.1145/3650203.3663326 article EN 2024-05-29

Towards Automatic Fake News Detection: Cross-Level Stance Detection in News Articles

OPENALEX - Publications

Costanza Conforti Mohammad Taher Pilehvar Nigel Collier

In this paper, we propose to adapt the four-staged pipeline proposed by Zubiaga et al. (2018) for Rumor Verification task problem of Fake News Detection. We show that recently released FNC-1 corpus covers two its steps, namely Tracking and Stance Detection task. identify asymmetry in length input be a key characteristic latter step, when adapted framework Detection, handle it as specific type Cross-Level Inspired theories from field Journalism Studies, implement test architectures...

10.18653/v1/w18-5507 article EN cc-by 2018-01-01

Natural Language Processing for Achieving Sustainable Development: the Case of Neural Labelling to Enhance Community Profiling

OPENALEX - Publications

Costanza Conforti Stephanie Hirmer D. G. MORGAN Marco Basaldella Yau Ben Or

In recent years, there has been an increasing interest in the application of Artificial Intelligence – and especially Machine Learning to field Sustainable Development (SD). However, until now, NLP not systematically applied this context. paper, we show high potential enhance project sustainability. particular, focus on case community profiling developing countries, where, contrast developed world, a notable data gap exists. Here, could help address cost time barrier structuring qualitative...

10.18653/v1/2020.emnlp-main.677 article EN cc-by 2020-01-01

Incorporating Stock Market Signals for Twitter Stance Detection

OPENALEX - Publications

Costanza Conforti Jakob Berndt Mohammad Taher Pilehvar Chryssi Giannitsarou Flavio Toxvaerd and 1 more

Costanza Conforti, Jakob Berndt, Mohammad Taher Pilehvar, Chryssi Giannitsarou, Flavio Toxvaerd, Nigel Collier. Proceedings of the 60th Annual Meeting Association for Computational Linguistics (Volume 1: Long Papers). 2022.

10.18653/v1/2022.acl-long.281 article EN cc-by Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) 2022-01-01

Guiding Principles for Participatory Design-inspired Natural Language Processing

OPENALEX - Publications

Tommaso Caselli Roberto Cibin Costanza Conforti Enrique Encinas Maurizio Teli

We introduce 9 guiding principles to integrate Participatory Design (PD) methods in the development of Natural Language Processing (NLP) systems. The adoption PD by NLP will help alleviate issues concerning more democratic, fairer, less-biased technologies process natural language data. This short paper is outcome an ongoing dialogue between designers and experts adopts a non-standard format following previous work Traum (2000); Bender (2013); Abzianidze Bos (2019). Every section principle....

10.18653/v1/2021.nlp4posimpact-1.4 article EN cc-by 2021-01-01

The power of language: Exploring values, empowerment dynamics and communication strategies for gender-inclusive energy service design in rural Uganda

OPENALEX - Publications

Stephanie Hirmer Antonella Mazzone Alycia Leonard Costanza Conforti

Understanding consumer needs and values is crucial to the sustainable delivery uptake of energy access projects in Low-and Middle-Income Countries (LMICs). Nevertheless, many aim empower women without first assessing gendered roles, needs, values, relations for both men project communities. Neglecting these can be detrimental end-users projects, exacerbating conflict within households rather than empowering vulnerable groups. We propose a value-based approach elicit varying priorities assess...

10.1016/j.erss.2021.102379 article EN cc-by Energy Research & Social Science 2021-12-01

Neural architectures for open-type relation argument extraction

OPENALEX - Publications

Benjamin Roth Costanza Conforti Nina Poerner Sanjeev Kumar Karn Hinrich Schütze

Abstract In this work, we focus on the task of open-type relation argument extraction (ORAE) : given a corpus, query entity Q , and knowledge base (e.g., “ authored notable work with title X ”), model has to extract an non-standard type (entities that cannot be extracted by standard named tagger, for example, book or art) from corpus. We develop compare wide range neural models yielding large improvements over strong baseline obtained question answering system. The impact different sentence...

10.1017/s1351324918000451 article EN Natural Language Engineering 2018-12-07

STANDER: An Expert-Annotated Dataset for News Stance Detection and Evidence Retrieval

OPENALEX - Publications

Costanza Conforti Jakob Berndt Mohammad Taher Pilehvar Chryssi Giannitsarou Flavio Toxvaerd and 1 more

We present a new challenging news dataset that targets both stance detection (SD) and fine-grained evidence retrieval (ER). With its 3,291 expert-annotated articles, the constitutes high-quality benchmark for future research in SD multi-task learning. provide detailed description of corpus collection methodology carry out an extensive analysis on sources disagreement between annotators, observing correlation their diffusion uncertainty around target real world. Our experiments show poses...

10.18653/v1/2020.findings-emnlp.365 article EN cc-by 2020-01-01

Integrating Transformers and Knowledge Graphs for Twitter Stance Detection

OPENALEX - Publications

Thomas A. Clark Costanza Conforti Fangyu Liu Zaiqiao Meng Ehsan Shareghi and 1 more

Stance detection (SD) entails classifying the sentiment of a text towards given target, and is relevant sub-task for opinion mining social media analysis. Recent works have explored knowledge infusion supplementing linguistic competence latent large pre-trained language models with structured graphs (KGs), yet few applied such methods to SD task. In this work, we first perform stance-relevant probing on Transformers-based in zero-shot setting, showing these models' real-world about targets...

10.18653/v1/2021.wnut-1.34 article EN cc-by 2021-01-01

Perceived value interviews and socio-economic survey data for communities in rural Uganda

OPENALEX - Publications

Stephanie Hirmer Alycia Leonard Sofia Conforti Costanza Conforti

This article describes a dataset of perceived values and socioeconomic indicators collected in rural Ugandan communities. The data were interviews which employed: (1) the User-Perceived Value game, solicits verbal using graphical prompts 'why'-probing; (2) socio-economic surveys, demographic data. constitutes 119 conducted between 2014 2015 seven villages. Interviews various settings (e.g. individual/group, women/men/mixed) different local languages (which subsequently translated into...

10.1016/j.dib.2021.107734 article EN cc-by Data in Brief 2021-12-18

Building Representative Corpora from Illiterate Communities: A Reviewof Challenges and Mitigation Strategies for Developing Countries

OPENALEX - Publications

Stephanie Hirmer Alycia Leonard Josephine Tumwesige Costanza Conforti

Stephanie Hirmer, Alycia Leonard, Josephine Tumwesige, Costanza Conforti. Proceedings of the 16th Conference European Chapter Association for Computational Linguistics: Main Volume. 2021.

10.18653/v1/2021.eacl-main.186 article EN cc-by 2021-01-01

Croissant: A Metadata Format for ML-Ready Datasets

OPENALEX - Publications

Mubashara Akhtar Omar Benjelloun Costanza Conforti Pieter Gijsbers Joan Giner-Miguelez and 16 more

10.1145/3650203.3663326 preprint EN arXiv (Cornell University) 2024-03-28

Will-They-Won't-They: A Very Large Dataset for Stance Detection on Twitter

OPENALEX - Publications

Costanza Conforti Jakob Berndt Mohammad Taher Pilehvar Chryssi Giannitsarou Flavio Toxvaerd and 1 more

We present a new challenging stance detection dataset, called Will-They-Won't-They (WT-WT), which contains 51,284 tweets in English, making it by far the largest available dataset of type. All annotations are carried out experts; therefore, constitutes high-quality and reliable benchmark for future research detection. Our experiments with wide range recent state-of-the-art systems show that poses strong challenge to existing models this domain.

10.48550/arxiv.2005.00388 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Natural language processing for achieving sustainable development: the case of neural labelling to enhance community profiling

OPENALEX - Publications

Costanza Conforti Stephanie Hirmer David Morgan Marco Basaldella Yau Ben Or

In recent years, there has been an increasing interest in the application of Artificial Intelligence - and especially Machine Learning to field Sustainable Development (SD). However, until now, NLP not applied this context. research paper, we show high potential applications enhance sustainability projects. particular, focus on case community profiling developing countries, where, contrast developed world, a notable data gap exists. context, could help address cost time barrier structuring...

10.48550/arxiv.2004.12935 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Coming Soon ...