Paul Rayson

ORCID: 0000-0002-1257-2191
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Natural Language Processing Techniques
  • Topic Modeling
  • Lexicography and Language Studies
  • Semantic Web and Ontologies
  • Linguistic Variation and Morphology
  • Software Engineering Research
  • Linguistics, Language Diversity, and Identity
  • Advanced Text Analysis Techniques
  • Second Language Acquisition and Learning
  • Authorship Attribution and Profiling
  • Geographic Information Systems Studies
  • Sentiment Analysis and Opinion Mining
  • linguistics and terminology studies
  • Misinformation and Its Impacts
  • Stock Market Forecasting Methods
  • Digital Communication and Language
  • Translation Studies and Practices
  • Text Readability and Simplification
  • Language, Metaphor, and Cognition
  • Auditing, Earnings Management, Governance
  • Speech and dialogue systems
  • Software Engineering Techniques and Practices
  • Digital Mental Health Interventions
  • Biomedical Text Mining and Ontologies
  • Advanced Software Engineering Methodologies

Lancaster University
2016-2025

Centre for Mental Health
2021

Institute for Health Metrics and Evaluation
2021

Johns Hopkins University
2021

King Saud University
2021

Mental Health Research UK
2021

University of Birmingham
2018

University of California, Irvine
2016

The Open University
2015

University of Salento
2015

This paper reports the extension of key words method for comparison corpora. Using automatic tagging software that assigns part-of-speech and semantic field (domain) tags, a is described which permits extraction domains by applying keyness calculation to tag frequency lists. The combination methods shown allow macroscopic analysis (the study characteristics whole texts or varieties language) inform microscopic level (focussing on use particular linguistic feature) thereby suggesting those...

10.1075/ijcl.13.4.06ray article EN International Journal of Corpus Linguistics 2008-12-08

This paper describes a method of comparing corpora which uses frequency profiling. The can be used to discover key words in the differentiate one corpus from another. Using annotated corpora, it applied grammatical or word-sense categories. as quick way find differences between and is shown have applications study social differentiation use English vocabulary, profiling learner document analysis software engineering process.

10.3115/1117729.1117730 article EN 2000-01-01

To compare the frequencies with which patients cancer and health professionals use Violence Journey metaphors when writing online; to investigate of these by cancer, in view critiques war-related for adoption notion 'cancer journey' UK policy documents.Computer-assisted quantitative qualitative study two data sets totalling 753 302 words.A UK-based online forum (500 134 words) a website (253 168 words).56 between 2007 2012; 307 2008 2013.Patients both approximately 1.5 times per 1000 words...

10.1136/bmjspcare-2014-000785 article EN cc-by BMJ Supportive & Palliative Care 2015-03-05

Abstract We take a step towards addressing the under- representation of African continent in NLP research by bringing together different stakeholders to create first large, publicly available, high-quality dataset for named entity recognition (NER) ten languages. detail characteristics these languages help researchers and practitioners better understand challenges they pose NER tasks. analyze our datasets conduct an extensive empirical evaluation state- of-the-art methods across both...

10.1162/tacl_a_00416 article EN cc-by Transactions of the Association for Computational Linguistics 2021-01-01

In this article, we undertake selective quantitative analyses of the demographi-cally-sampled spoken English component British National Corpus (for brevity, referred to here as ''Conversational Corpus"). This is a subcorpus c. 4.5 million words, in which speakers and respondents (see I below) are identified by such factors gender, age, social group, geographical region. Using corpus analysis tool developed at Lancaster, comparison vocabulary speakers, highlighting those differences marked...

10.1075/ijcl.2.1.07ray article EN International Journal of Corpus Linguistics 1997-01-01

The dynamic changes in the composition, dry weight, and mineral nutrient status of heath following fire have been investigated. overall growth (dry weightltime) curve for aerial organs is essentially exponential. Soil moisture conserved by burning and, provided climatic conditions are favourable, regeneration all species rapid. Annual rare found only first year after a fire. Many fire-resistant regenerate rapidly from buriedperennating buds; others reproduce great numbers seeds. number...

10.1071/bt9580059 article EN Australian Journal of Botany 1958-01-01

Abstract We critically assess mainstream accounting and finance research applying methods from computational linguistics (CL) to study financial discourse. also review common themes innovations in the literature incremental contributions of studies CL over manual content analysis. Key conclusions emerging our analysis are: (a) is behind curve terms generally word sense disambiguation particular; (b) implementation issues mean proposed benefits are often less pronounced than proponents...

10.1111/jbfa.12378 article EN cc-by Journal of Business Finance &amp Accounting 2019-03-01

This study combines quantitative semi-automated corpus methods with manual qualitative analysis to investigate the use of Violence metaphors for cancer and end life in a 1,500,000-word data from three stakeholder groups healthcare: patients, family carers healthcare professionals. general, especially military metaphors, are conventionally used talk about illness, particularly cancer. However, they have also been criticized their potentially negative implications. The innovative methodology...

10.1075/ijcl.20.2.03dem article EN International Journal of Corpus Linguistics 2015-08-17

Formulae display:?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax order to improve their display. Uncheck the box turn off. This feature requires Javascript. Click on a formula zoom.

10.1080/00014788.2019.1609346 article EN cc-by Accounting and Business Research 2019-07-25

This paper presents the ParlaMint corpora containing transcriptions of sessions 17 European national parliaments with half a billion words. The are uniformly encoded, contain rich meta-data about 11 thousand speakers, and linguistically annotated following Universal Dependencies formalism named entities. Samples conversion scripts available from project's GitHub repository, complete openly via CLARIN.SI repository for download, as well through NoSketch Engine KonText concordancers Parlameter...

10.1007/s10579-021-09574-0 article EN cc-by Language Resources and Evaluation 2022-02-02

In this paper, we discuss the limitations of current syntactic composition mechanisms in aspect-oriented requirements engineering (AORE). We highlight that such not only increase coupling between aspects and base concerns but are also insufficient to capture intentionality aspect composition. Furthermore, they force engineer reason about semantic influences trade-offs among from a perspective. present description language (RDL) enriches existing natural specification with information derived...

10.1145/1218563.1218569 article EN 2007-03-14

Domain analysis involves not only looking at standard requirements documents (e.g., use case specifications) but also customer information packs, market analyses, etc. Looking across all these and deriving, in a practical scalable way, feature model that is comprised of coherent abstractions fundamental non-trivial challenge. We conduct an exploratory study to investigate the suitability Information Retrieval (IR) techniques for identification commonalities variabilities requirement...

10.1109/splc.2008.18 article EN 2008-09-01

A survey of the literature on distribution sclerophyllous understorey in Australia leads to conclusion that, although climatic conditions under which it grows may vary considerably, soils flourishes are always acid and very low available phosphorus nitrogen sometimes potassium, copper, zinc, molybdenum. The problem is how species able flourish such deficient soils. To provide a background for investigation this problem, detailed ecological study was initiated an extensive stand heath...

10.1071/bt9570052 article EN Australian Journal of Botany 1957-01-01
Coming Soon ...