NFDI4DS | UHH-SEMS - Publication Details

Felice Dell’Orletta⋄

ORCID: 0000-0003-3454-9387

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5084812833

Research Areas

Natural Language Processing Techniques
Topic Modeling
Text Readability and Simplification
Linguistic Studies and Language Acquisition
Authorship Attribution and Profiling
Semantic Web and Ontologies
Speech and dialogue systems
Software Engineering Research
Sentiment Analysis and Opinion Mining
Advanced Text Analysis Techniques
Multimodal Machine Learning Applications
Interpreting and Communication in Healthcare
Biomedical Text Mining and Ontologies
linguistics and terminology studies
Second Language Acquisition and Learning
Speech Recognition and Synthesis
Software Engineering Techniques and Practices
Language, Metaphor, and Cognition
Hate Speech and Cyberbullying Detection
Digital Communication and Language
Reading and Literacy Development
Algorithms and Data Compression
Lexicography and Language Studies
Language and cultural evolution
Machine Learning in Healthcare

Institute for Computational Linguistics “A. Zampolli”
2016-2025

National Research Council
2009-2024

University of Pisa
2005-2023

University of Bologna
2023

University of Groningen
2014-2021

National Academies of Sciences, Engineering, and Medicine
2010-2020

University of California, Davis
2020

University of Genoa
2018-2019

Istituto Nazionale di Fisica Nucleare, Sezione di Roma I
2018

University of Salerno
2018

Cross-Media Learning for Image Sentiment Analysis in the Wild

OPENALEX - Publications

Lucia Vadicamo Fabio Carrara Andrea Cimino Stefano Cresci Felice Dell’Orletta⋄ and 2 more

Much progress has been made in the field of sentiment analysis past years. Researchers relied on textual data for this task, while only recently they have started investigating approaches to predict sentiments from multimedia content. With increasing amount shared social media, there is also a rapidly growing interest that work "in wild", i.e. are able deal with uncontrolled conditions. In work, we faced challenge training visual classifier starting large set user-generated and unlabeled...

10.1109/iccvw.2017.45 article EN 2017-10-01

The PAISÀ Corpus of Italian Web Texts

OPENALEX - Publications

Verena Lyding Egon Stemle Claudia Borghetti M. Brunello Sara Castagnoli and 4 more

Verena Lyding, Egon Stemle, Claudia Borghetti, Marco Brunello, Sara Castagnoli, Felice Dell’Orletta, Henrik Dittmann, Alessandro Lenci, Vito Pirrelli. Proceedings of the 9th Web as Corpus Workshop (WaC-9). 2014.

10.3115/v1/w14-0406 article EN 2014-01-01

Automatic extraction of function–behaviour–state information from patents

OPENALEX - Publications

Gualtiero Fantoni R. Apreda Felice Dell’Orletta⋄ Marco Antonio Sotelo Monge

10.1016/j.aei.2013.04.004 article EN Advanced Engineering Informatics 2013-06-18

A Linguistically-driven Approach to Cross-Event Damage Assessment of Natural Disasters from Social Media Messages

OPENALEX - Publications

Stefano Cresci Maurizio Tesconi Andrea Cimino Felice Dell’Orletta⋄

This work focuses on the analysis of Italian social media messages for disaster management and aims at detection carrying critical information damage assessment task. A main novelty this study consists in focus out-domain cross-event detection, investigation most relevant tweet-derived features these tasks. We devised different experiments by resorting to a wide set linguistic qualifying lexical grammatical structure text as well ad-hoc specifically implemented investigated effective that...

10.1145/2740908.2741722 article EN 2015-05-18

Contextual and Non-Contextual Word Embeddings: an in-depth Linguistic Investigation

OPENALEX - Publications

Alessio Miaschi Felice Dell’Orletta⋄

In this paper we present a comparison between the linguistic knowledge encoded in internal representations of contextual Language Model (BERT) and contextual-independent one (Word2vec). We use wide set probing tasks, each which corresponds to distinct sentence-level feature extracted from different levels annotation. show that, although BERT is capable understanding full context word an input sequence, implicit its aggregated sentence still comparable that model. also find able encode...

10.18653/v1/2020.repl4nlp-1.15 article EN cc-by 2020-01-01

Mining commonalities and variabilities from natural language documents

OPENALEX - Publications

Alessio Ferrari Giorgio Oronzo Spagnolo Felice Dell’Orletta⋄

A company who wishes to enter an established marked with a new, competitive product is required analyse the solutions of competitors. Identifying and comparing features provided by other vendors might greatly help during market analysis. However, mining common variant from publicly available documents competitors time consuming error-prone task. In this paper, we suggest employ natural language processing approach based on contrastive analysis identify commonalities variabilities brochures...

10.1145/2491627.2491634 article EN 2013-08-26

Natural Language Requirements Processing: A 4D Vision

OPENALEX - Publications

Alessio Ferrari Felice Dell’Orletta⋄ Andrea Esuli Vincenzo Gervasi Stefania Gnesi

The future evolution of the application natural language processing technologies in requirements engineering can be viewed from four dimensions: discipline, dynamism, domain knowledge, and datasets.

10.1109/ms.2017.4121207 article EN IEEE Software 2017-11-01

Overexpression of the cohesin-core subunit SMC1A contributes to colorectal cancer development

OPENALEX - Publications

Patrizia Sarogni Orazio Palumbo Adele Servadio Simonetta Astigiano Barbara D’Alessio and 12 more

Cancer cells are characterized by chromosomal instability (CIN) and it is thought that errors in pathways involved faithful chromosome segregation play a pivotal role the genesis of CIN. Cohesin forms large protein ring binds DNA strands encircling them. In addition to this central segregation, cohesin also needed for repair, gene transcription regulation chromatin architecture. Though mutations both cohesin-regulator genes have been identified many human cancers, contribution cancer...

10.1186/s13046-019-1116-0 article EN cc-by Journal of Experimental & Clinical Cancer Research 2019-03-01

T-FREX: A Transformer-based Feature Extraction Method from Mobile App Reviews

OPENALEX - Publications

Quim Motger Alessio Miaschi Felice Dell’Orletta⋄ Xavier Franch Jordi Marco

10.1109/saner60148.2024.00030 article EN 2022 IEEE International Conference on Software Analysis, Evolution and Reengineering (SANER) 2024-03-12

Cross-lingual distillation for domain knowledge transfer with sentence transformers

OPENALEX - Publications

Ruben Piperno Luca Bacco Felice Dell’Orletta⋄ Mario Merone Leandro Pecchia

10.1016/j.knosys.2025.113079 article EN cc-by Knowledge-Based Systems 2025-01-31

When Time Matters: Exploring the Impact of Recall Techniques and Educational Levels on Witness Testimony Quality

OPENALEX - Publications

Sara Solà-Sales Chiara Alzetta Carmen Moret‐Tatay Felice Dell’Orletta⋄

Mental reconstruction (MRC) and Free Recall (FR) have been recognized for enhancing the quality of witness statements. However, mechanisms underlying this association remain insufficiently understood. This study explores how time allocated to MRC FR variations in educational level influence eyewitness testimonies. Testimony is evaluated based on manually annotated content information provided by experts testimony assessment, which measures adherence events. further complemented fine-grained...

10.3390/info16020122 article EN cc-by Information 2025-02-08

Efficient multi-task learning with instance selection for biomedical NLP

OPENALEX - Publications

Agnese Bonfigli Luca Bacco Leandro Pecchia Mario Merone Felice Dell’Orletta⋄

Biomedical natural language processing (NLP) increasingly relies on large models and extensive datasets, presenting significant computational challenges. We propose Blue5, a multi-task model based SciFive that incorporates instance selection (IS) to enable efficient, learning (MTL) biomedical data. adapt the E2SC-IS framework for domain, integrating calibrated SVM classifier reduce costs. Our approach achieves an average data reduction of 26.6% across several tasks BLUE (Biomedical Language...

10.1016/j.compbiomed.2025.110050 article EN cc-by-nc-nd Computers in Biology and Medicine 2025-04-02

Reverse revision and linear tree combination for dependency parsing

OPENALEX - Publications

Giuseppe Attardi Felice Dell’Orletta⋄

Deterministic transition-based Shift/Reduce dependency parsers make often mistakes in the analysis of long span dependencies (McDonald & Nivre, 2007).

10.3115/1620853.1620925 article EN 2009-01-01

Is this Sentence Difficult? Do you Agree?

OPENALEX - Publications

Dominique Brunato⋄ Lorenzo De Mattei Felice Dell’Orletta⋄ Benedetta Iavarone Giulia Venturi⋄

In this paper, we present a crowdsourcing-based approach to model the human perception of sentence complexity. We collect large corpus sentences rated with judgments complexity for two typologically-different languages, Italian and English. test our in experimental scenarios aimed investigate contribution wide set lexical, morpho-syntactic syntactic phenomena predicting i) degree agreement among annotators independently from assigned judgment ii)

10.18653/v1/d18-1289 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2018-01-01

Text mining tool for translating terms of contract into technical specifications: Development and application in the railway sector

OPENALEX - Publications

Gualtiero Fantoni E. Coli Filippo Chiarello R. Apreda Felice Dell’Orletta⋄ and 1 more

10.1016/j.compind.2020.103357 article EN Computers in Industry 2020-11-25

Linguistic Profiling of a Neural Language Model

OPENALEX - Publications

Alessio Miaschi Dominique Brunato⋄ Felice Dell’Orletta⋄ Giulia Venturi⋄

In this paper we investigate the linguistic knowledge learned by a Neural Language Model (NLM) before and after fine-tuning process how affects its predictions during several classification problems. We use wide set of probing tasks, each which corresponds to distinct sentence-level feature extracted from different levels annotation. show that BERT is able encode range characteristics, but it tends lose information when trained on specific downstream tasks. also find BERT's capacity kind...

10.18653/v1/2020.coling-main.65 article EN cc-by Proceedings of the 17th international conference on Computational linguistics - 2020-01-01

Assessing the Readability of Sentences: Which Corpora and Features?

OPENALEX - Publications

Felice Dell’Orletta⋄ Martijn Wieling Giulia Venturi⋄ Andrea Cimino Simonetta Montemagni⋄

The paper investigates the problem of sentence readability assessment, which is modelled as a classification task, with specific view to text simplification.In particular, it addresses two open issues connected it, i.e. corpora be used for training, and identification most effective features determine readability.An existing assessment tool developed Italian was specialized at level training corpus learning algorithm.A maximum entropy-based feature selection ranking algorithm (grafting)...

10.3115/v1/w14-1820 article EN cc-by 2014-01-01

Design and Annotation of the First Italian Corpus for Text Simplification

OPENALEX - Publications

Dominique Brunato⋄ Felice Dell’Orletta⋄ Giulia Venturi⋄ Simonetta Montemagni⋄

In this paper, we present design and construction of the first Italian corpus for automatic semi-automatic text simplification.In line with current approaches, propose a new annotation scheme specifically conceived to identify typology changes an original sentence undergoes when it is manually simplified.Such has been applied two aligned corpora, containing texts corresponding simplified versions, selected as representative different manual simplification strategies addressing target reader...

10.3115/v1/w15-1604 article EN cc-by 2015-01-01

Coming Soon ...