NFDI4DS | UHH-SEMS - Publication Details

Matthew Shardlow

ORCID: 0000-0003-1129-2750

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5035746903

Research Areas

Natural Language Processing Techniques
Text Readability and Simplification
Topic Modeling
Health, Environment, Cognitive Aging
Biomedical Text Mining and Ontologies
Interpreting and Communication in Healthcare
Semantic Web and Ontologies
Genetics, Bioinformatics, and Biomedical Research
Software Engineering Research
Sentiment Analysis and Opinion Mining
Misinformation and Its Impacts
Advanced Text Analysis Techniques
Software Engineering Techniques and Practices
Authorship Attribution and Profiling
Explainable Artificial Intelligence (XAI)
Wikis in Education and Collaboration
Ethics and Social Impacts of AI
Multimodal Machine Learning Applications
Artificial Intelligence in Healthcare and Education
Spam and Phishing Detection
Advanced Malware Detection Techniques
Computational and Text Analysis Methods
Emergency and Acute Care Studies
Digital Communication and Language
Pharmacovigilance and Adverse Drug Reactions

Manchester Metropolitan University
2018-2025

IT University of Copenhagen
2023

Tokyo Institute of Technology
2023

Administration for Community Living
2023

American Jewish Committee
2023

University of Manchester
2013-2018

Open Text (Canada)
2016-2018

Buglife
2006

The identification of 100 ecological questions of high policy relevance in the UK

OPENALEX - Publications

William J. Sutherland SUSAN ARMSTRONG‐BROWN Paul R. Armsworth Tom Brereton J. H. BRICKLAND and 34 more

Summary Evidence‐based policy requires researchers to provide the answers ecological questions that are of interest makers. To find out what those in UK, representatives from 28 organizations involved policy, together with scientists 10 academic institutions, were asked generate a list their organizations. During 2‐day workshop initial 1003 generated consulting at least 654 makers and academics was used as basis for generating short 100 significant relevance. Short‐listing decided on...

10.1111/j.1365-2664.2006.01188.x article EN Journal of Applied Ecology 2006-05-30

A Survey of Automated Text Simplification

OPENALEX - Publications

Matthew Shardlow

Text simplification modifies syntax and lexicon to improve the understandability of language for an end user. This survey identifies classifies research within period 1998-2013. Simplification can be used many applications, including: Second learners, preprocessing in pipelines assistive technology. There are approaches task, lexical, syntactic, statistical machine translation hybrid techniques. also explores current challenges which this field faces. is a non-trivial task rapidly growing...

10.14569/specialissue.2014.040109 article EN cc-by International Journal of Advanced Computer Science and Applications 2014-01-01

Analysis of the effect of sentiment analysis on extracting adverse drug reactions from tweets and forum posts

OPENALEX - Publications

Ioannis Korkontzelos Azadeh Nikfarjam Matthew Shardlow Abeed Sarker Sophia Ananiadou and 1 more

The abundance of text available in social media and health related forums along with the rich expression public opinion have recently attracted interest community to use these sources for pharmacovigilance. Based on intuition that patients post about Adverse Drug Reactions (ADRs) expressing negative sentiments, we investigate effect sentiment analysis features locating ADR mentions. We enrich feature space a state-of-the-art identification method features. Using corpus posts from...

10.1016/j.jbi.2016.06.007 article EN cc-by Journal of Biomedical Informatics 2016-06-28

SemEval-2021 Task 1: Lexical Complexity Prediction

OPENALEX - Publications

Matthew Shardlow Richard Evans Gustavo Henrique Paetzold Marcos Zampieri

This paper presents the results and main findings of SemEval-2021 Task 1 - Lexical Complexity Prediction. We provided participants with an augmented version CompLex Corpus (Shardlow et al. 2020). is English multi-domain corpus in which words multi-word expressions (MWEs) were annotated respect to their complexity using a five point Likert scale. featured two Sub-tasks: Sub-task focused on single 2 MWEs. The competition attracted 198 teams total, 54 submitted official runs test data 37 2.

10.18653/v1/2021.semeval-1.1 article EN cc-by Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022) 2021-01-01

Overview of the BioLaySumm 2024 Shared Task on the Lay Summarization of Biomedical Research Articles

OPENALEX - Publications

Tomas Goldsack Carolina Scarton Matthew Shardlow Chenghua Lin

10.18653/v1/2024.bionlp-1.10 article EN 2024-01-01

Identification of research hypotheses and new knowledge from scientific literature

OPENALEX - Publications

Matthew Shardlow Riza Batista-Navarro Paul M. Thompson Raheel Nawaz John McNaught and 1 more

Text mining (TM) methods have been used extensively to extract relations and events from the literature. In addition, TM techniques various types or dimensions of interpretative information, known as Meta-Knowledge (MK), context events, e.g. negation, speculation, certainty knowledge type. However, most existing focussed on extraction individual MK, without investigating how they can be combined obtain even richer contextual information. this paper, we describe a novel, supervised method new...

10.1186/s12911-018-0639-1 article EN cc-by BMC Medical Informatics and Decision Making 2018-06-25

HTSS: A novel hybrid text summarisation and simplification architecture

OPENALEX - Publications

Farooq Zaman Matthew Shardlow Saeed‐Ul Hassan Naif Radi Aljohani Raheel Nawaz

10.1016/j.ipm.2020.102351 article EN Information Processing & Management 2020-07-13

Leveraging AI and Machine Learning for National Student Survey: Actionable Insights from Textual Feedback to Enhance Quality of Teaching and Learning in UK’s Higher Education

OPENALEX - Publications

Raheel Nawaz Quanbin Sun Matthew Shardlow Georgios Kontonatsios Naif Radi Aljohani and 2 more

Students’ evaluation of teaching, for instance, through feedback surveys, constitutes an integral mechanism quality assurance and enhancement teaching learning in higher education. These surveys usually comprise both the Likert scale free-text responses. Since discrete responses are easy to analyze, they feature more prominently survey analyses. However, often contain richer, detailed, nuanced information with actionable insights. Mining these insights is challenging, as it requires a degree...

10.3390/app12010514 article EN cc-by Applied Sciences 2022-01-05

Deep learning approaches to lexical simplification: A survey

OPENALEX - Publications

Kai North Tharindu Ranasinghe Matthew Shardlow Marcos Zampieri

Abstract Lexical Simplification (LS) is the task of substituting complex words within a sentence for simpler alternatives while maintaining sentence’s original meaning. LS lexical component Text (TS) systems with aim improving accessibility to various target populations such as individuals low literacy or reading disabilities. Prior surveys have been published several years before introduction transformers, transformer-based large language models (LLMs), and prompt learning that drastically...

10.1007/s10844-024-00882-9 article EN cc-by Journal of Intelligent Information Systems 2024-09-02

Bot detection using a single post on social media

OPENALEX - Publications

Shad Mohammad Muhammad Usman Shahid Khan Mazhar Ali Chengxi Liu Matthew Shardlow and 1 more

Recent studies of social media have made a unanimous conclusion that public opinions can be altered through systematic exploitation using bot accounts. The existing detection methodologies utilize features the accounts to label them as either or human. However, in this work, we propose convolutional neural network (CNN) identify single post on media. We compared our results with an artificial (ANN) trained extracted from accounts' profiles. Results shown detected 98.71% accuracy CNN 97.6%...

10.1109/worlds4.2019.8903989 article EN 2019-07-01

CompLex: A New Corpus for Lexical Complexity Prediction from Likert Scale Data

OPENALEX - Publications

Matthew Shardlow M. Cooper Marcos Zampieri

Predicting which words are considered hard to understand for a given target population is vital step in many NLP applications such as text simplification. This task commonly referred Complex Word Identification (CWI). With few exceptions, previous studies have approached the binary classification systems predict complexity value (complex vs. non-complex) set of text. choice motivated by fact that all CWI datasets compiled so far been annotated using annotation scheme. Our paper addresses...

10.48550/arxiv.2003.07008 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Lexical simplification benchmarks for English, Portuguese, and Spanish

OPENALEX - Publications

Sanja Štajner Daniel Ferrés Matthew Shardlow Kai North Marcos Zampieri and 1 more

Even in highly-developed countries, as many 15-30% of the population can only understand texts written using a basic vocabulary. Their understanding everyday is limited, which prevents them from taking an active role society and making informed decisions regarding healthcare, legal representation, or democratic choice. Lexical simplification natural language processing task that aims to make text understandable everyone by replacing complex vocabulary expressions with simpler ones, while...

10.3389/frai.2022.991242 article EN cc-by Frontiers in Artificial Intelligence 2022-09-22

Findings of the TSAR-2022 Shared Task on Multilingual Lexical Simplification

OPENALEX - Publications

Horacio Saggion Sanja Štajner Daniel Ferrés Kim Cheng Sheang Matthew Shardlow and 2 more

Horacio Saggion, Sanja Štajner, Daniel Ferrés, Kim Cheng Sheang, Matthew Shardlow, Kai North, Marcos Zampieri. Proceedings of the Workshop on Text Simplification, Accessibility, and Readability (TSAR-2022). 2022.

10.18653/v1/2022.tsar-1.31 article EN cc-by 2022-01-01

Using Natural Language Processing to Explore Differences in Healthcare Professionals’ Language On Functional Neurological Disorders: A Comparative Topic and Sentiment Analysis Study

OPENALEX - Publications

Md. Shadab Mashuk Yang Lü Lana Yin Hui Lai Matthew Shardlow Shumit Saha and 5 more

<title>Abstract</title> Background Effective communication is essential for delivering quality healthcare, particularly individuals with Functional Neurological Disorders (FND), who are often subject to misdiagnosis and stigmatising language that implies symptom fabrication. Variability in styles among healthcare professionals may contribute these challenges, affecting patient understanding care outcomes. Methods This study employed natural processing (NLP) analyse clinician-to-clinician...

10.21203/rs.3.rs-6018381/v1 preprint EN cc-by Research Square (Research Square) 2025-02-18

Neural Text Simplification of Clinical Letters with a Domain Specific Phrase Table

OPENALEX - Publications

Matthew Shardlow Raheel Nawaz

Clinical letters are infamously impenetrable for the lay patient. This work uses neural text simplification methods to automatically improve understandability of clinical patients. We take existing software and augment it with a new phrase table that links complex medical terminology simpler vocabulary by mining SNOMED-CT. In an evaluation task using crowdsourcing, we show results our system ranked easier understand (average rank 1.93) than original (2.34) without table. also improvement...

10.18653/v1/p19-1037 article EN cc-by 2019-01-01

Collaborative augmentation and simplification of text (CoAST): pedagogical applications of natural language processing in digital learning environments

OPENALEX - Publications

Matthew Shardlow Sam Sellar David Rousell

Abstract The digitisation of higher education is raising significant questions about the impact artificial intelligence and automation on teaching learning environments, highlighting need to investigate how teachers students can work with new educational technologies in complementary ways. This paper reports results from a pilot study collaborative augmentation simplification text (CoAST) system, which online software designed facilitate engagement university theoretically-sophisticated...

10.1007/s10984-021-09368-9 article EN cc-by Learning Environments Research 2021-05-22

One emoji, many meanings: A corpus for the prediction and disambiguation of emoji sense

OPENALEX - Publications

Matthew Shardlow Luciano Gerber Raheel Nawaz

In this work, we uncover a hidden linguistic property of emoji, namely that they are polysemous and can be used to form semantic network emoji meanings. Our key contributions direction study as follows: (1) We have developed new corpus help in the task sense prediction. This contains tweets with single emojis, where each has been labelled an appropriate identifier from WordNet. (2) Experiments, which demonstrate it is possible predict using our reasonable level accuracy. able report average...

10.1016/j.eswa.2022.116862 article EN cc-by Expert Systems with Applications 2022-03-18

Generative image captioning in Urdu using deep learning

OPENALEX - Publications

Muhammad Kashif Afzal Matthew Shardlow Suppawong Tuarob Farooq Zaman Raheem Sarwar and 5 more

Abstract Urdu is morphologically rich language and lacks the resources available in English. While several studies on image captioning task English have been published, this among pioneer generative captioning. The study makes key contributions: (i) it presents a new dataset for captioning, (ii) different attention-based architectures language. These attention mechanisms are to language, as those never used (iii) Finally, performs quantitative qualitative analysis of results by studying...

10.1007/s12652-023-04584-y article EN cc-by Journal of Ambient Intelligence and Humanized Computing 2023-04-10

BioLaySumm 2023 Shared Task: Lay Summarisation of Biomedical Research Articles

OPENALEX - Publications

Tomas Goldsack Zheheng Luo Qianqian Xie Carolina Scarton Matthew Shardlow and 2 more

Tomas Goldsack, Zheheng Luo, Qianqian Xie, Carolina Scarton, Matthew Shardlow, Sophia Ananiadou, Chenghua Lin. The 22nd Workshop on Biomedical Natural Language Processing and BioNLP Shared Tasks. 2023.

10.18653/v1/2023.bionlp-1.44 preprint EN cc-by 2023-01-01

Coming Soon ...