Alfio Gliozzo

ORCID: 0000-0002-8044-2911
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Topic Modeling
  • Natural Language Processing Techniques
  • Semantic Web and Ontologies
  • Data Quality and Management
  • Advanced Text Analysis Techniques
  • Text and Document Classification Technologies
  • Advanced Graph Neural Networks
  • Biomedical Text Mining and Ontologies
  • Speech and dialogue systems
  • Multimodal Machine Learning Applications
  • Text Readability and Simplification
  • Web Data Mining and Analysis
  • Expert finding and Q&A systems
  • Domain Adaptation and Few-Shot Learning
  • Time Series Analysis and Forecasting
  • Geographic Information Systems Studies
  • Rough Sets and Fuzzy Logic
  • Multi-Agent Systems and Negotiation
  • Speech Recognition and Synthesis
  • Information Retrieval and Search Behavior
  • Artificial Intelligence in Healthcare and Education
  • Wikis in Education and Collaboration
  • Personal Information Management and User Behavior
  • Intelligent Tutoring Systems and Adaptive Learning
  • Data Visualization and Analytics

IBM (United States)
2012-2023

IBM Research - Thomas J. Watson Research Center
2020-2022

Columbia University
2016

Skyline College
2012

Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo"
2007-2010

Ferioli & Gianotti (Italy)
2009

Istituto Centrale per la Ricerca Scientifica e Tecnologica Applicata al Mare
2001-2009

National Academies of Sciences, Engineering, and Medicine
2008

Fondazione Bruno Kessler
2007

Universidad Autónoma de Madrid
2005

Pavan Kapanipathi, Ibrahim Abdelaziz, Srinivas Ravishankar, Salim Roukos, Alexander Gray, Ramón Fernandez Astudillo, Maria Chang, Cristina Cornelio, Saswati Dana, Achille Fokoue, Dinesh Garg, Alfio Gliozzo, Sairam Gurajada, Hima Karanam, Naweed Khan, Khandelwal, Young-Suk Lee, Yunyao Li, Francois Luus, Ndivhuwo Makondo, Nandana Mihindukulasooriya, Tahira Naseem, Sumit Neelam, Lucian Popa, Revanth Gangi Reddy, Ryan Riegel, Gaetano Rossiello, Udit Sharma, G P Shrivatsa Bhargav, Mo Yu. Findings...

10.18653/v1/2021.findings-acl.339 article EN cc-by 2021-01-01

Michael Glass, Gaetano Rossiello, Md Faisal Mahbub Chowdhury, Ankita Naik, Pengshan Cai, Alfio Gliozzo. Proceedings of the 2022 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2022.

10.18653/v1/2022.naacl-main.194 article EN cc-by Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2022-01-01

This paper explores the role of domain information in word sense disambiguation. The underlying hypothesis is that labels, such as M EDICINE , A RCHITECTURE and S PORT provide a useful way to establish semantic relations among senses, which can be profitably used during disambiguation process. Results obtained at ENSEVAL -2 initiative confirm for significant subset words disambiguate with very high level precision.

10.1017/s1351324902003029 article EN Natural Language Engineering 2002-12-01

Michael Glass, Alfio Gliozzo, Rishav Chakravarti, Anthony Ferritto, Lin Pan, G P Shrivatsa Bhargav, Dinesh Garg, Avi Sil. Proceedings of the 58th Annual Meeting Association for Computational Linguistics. 2020.

10.18653/v1/2020.acl-main.247 article EN cc-by 2020-01-01

Michael Glass, Mustafa Canim, Alfio Gliozzo, Saneem Chemmengath, Vishwajeet Kumar, Rishav Chakravarti, Avi Sil, Feifei Pan, Samarth Bharadwaj, Nicolas Rodolfo Fauceglia. Proceedings of the 2021 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2021.

10.18653/v1/2021.naacl-main.96 article EN cc-by Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2021-01-01

Text-to-SQL is emerging as a practical interface for real world databases. The dominant paradigm cross-database or schema-independent, supporting application schemas unseen during training. schema of database defines the tables, columns, column types and foreign key connections between tables. Real can be large, containing hundreds but any particular query only small fraction will relevant. Placing entire in prompt an LLM impossible models with smaller token windows expensive even when...

10.48550/arxiv.2501.17174 preprint EN arXiv (Cornell University) 2025-01-23

In this paper we present a supervised Word Sense Disambiguation methodology, that exploits kernel methods to model sense distinctions. particular combination of functions is adopted estimate independently both syntagmatic and domain similarity. We defined function, namely the Domain Kernel, allowed us plug "external knowledge" into learning process. External knowledge acquired from unlabeled data in totally unsupervised way, it represented by means Models. evaluated our methodology on...

10.3115/1219840.1219890 article EN 2005-01-01

Yannis Katsis, Saneem Chemmengath, Vishwajeet Kumar, Samarth Bharadwaj, Mustafa Canim, Michael Glass, Alfio Gliozzo, Feifei Pan, Jaydeep Sen, Karthik Sankaranarayanan, Soumen Chakrabarti. Proceedings of the 2022 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies: Industry Track. 2022.

10.18653/v1/2022.naacl-industry.34 preprint EN cc-by 2022-01-01

We propose KnowGL, a tool that allows converting text into structured relational data represented as set of ABox assertions compliant with the TBox given Knowledge Graph (KG), such Wikidata. address this problem sequence generation task by leveraging pre-trained sequence-to-sequence language models, e.g. BART. Given sentence, we fine-tune models to detect pairs entity mentions and jointly generate facts consisting full semantic annotations for KG, labels, types, their relationships. To...

10.1609/aaai.v37i13.27084 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

Many forms of word relatedness have been developed, providing different perspectives on similarity.We introduce a Bayesian probabilistic tensor factorization model for synthesizing single vector representation and per-perspective linear transformations from any number similarity matrices.The resulting vectors, when combined with the transformation, approximately recreate while also regularizing generalizing, each perspective.Our method can combine manually created semantic resources neural...

10.3115/v1/d14-1161 article EN cc-by 2014-01-01

Cross-language Text Categorization is the task of assigning semantic classes to documents written in a target language (e.g. English) while system trained using labeled source Italian).In this work we present many solutions according availability bilingual resources, and show that it possible deal with problem even when no such resources are accessible. The core technique relies on automatic acquisition Multilingual Domain Models from comparable corpora.Experiments effectiveness our...

10.3115/1220175.1220245 article EN 2006-01-01

This paper introduces Strict Partial Order Networks (SPON), a novel neural network architecture designed to enforce asymmetry and transitive properties as soft constraints. We apply it induce hypernymy relations by training with is-a pairs. also present an augmented variant of SPON that can generalize type information learned for in-vocabulary terms previously unseen ones. An extensive evaluation over eleven benchmarks across different tasks shows consistently either outperforms or attains...

10.1609/aaai.v34i05.6263 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Extracting lexico-semantic relations as graph-structured taxonomies, also known taxonomy construction, has been beneficial in a variety of NLP applications. Recently Graph Neural Network (GNN) shown to be powerful successfully tackling many tasks. However, there no attempt exploit GNN create taxonomies. In this paper, we propose Graph2Taxo, GNN-based cross-domain transfer framework for the construction task. Our main contribution is learn latent features from existing domains guide structure...

10.18653/v1/2020.acl-main.199 article EN cc-by 2020-01-01

Tahira Naseem, Srinivas Ravishankar, Nandana Mihindukulasooriya, Ibrahim Abdelaziz, Young-Suk Lee, Pavan Kapanipathi, Salim Roukos, Alfio Gliozzo, Alexander Gray. Proceedings of the 59th Annual Meeting Association for Computational Linguistics and 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers). 2021.

10.18653/v1/2021.acl-short.34 article EN cc-by 2021-01-01

In a multilingual scenario, the classical monolingual text categorization problem can be reformulated as cross language TC task, in which we have to cope with two or more languages (e.g. English and Italian). this setting, system is trained using labeled examples source English), it classifies documents different target

10.3115/1654449.1654452 article EN 2005-01-01

In this paper we propose and evaluate a technique to perform semi-supervised learning for Text Categorization.In particular defined kernel function, namely the Domain Kernel, that allowed us plug "external knowledge" into supervised process.External knowledge is acquired from unlabeled data in totally unsupervised way, it represented by means of Models.We evaluated Kernel two standard benchmarks Categorization with good results, compared its performance function exploits bag-of-words feature...

10.3115/1706543.1706553 article EN 2005-01-01

This paper investigates conceptually and empirically the novel sense matching task, which requires to recognize whether senses of two synonymous words match in context. We suggest direct approaches problem, avoid intermediate step explicit word disambiguation, demonstrate their appealing advantages stimulating potential for future research.

10.3115/1220175.1220232 article EN 2006-01-01

We present an approach to ontology population based on a lexical substitution technique. It consists in estimating the plausibility of sentences where named entity be classified is substituted with ones contained training data, our case, partially populated ontology. Plausibility estimated by using Web while classification algorithm instance-based. evaluated method two different tasks. Experiments show that solution effective, outperforming existing methods, and it can applied practical problems.

10.3115/1599081.1599115 article EN 2008-01-01
Coming Soon ...