Scott Piao

ORCID: 0000-0003-3890-6521
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Natural Language Processing Techniques
  • Topic Modeling
  • Semantic Web and Ontologies
  • Advanced Text Analysis Techniques
  • Lexicography and Language Studies
  • Sentiment Analysis and Opinion Mining
  • Biomedical Text Mining and Ontologies
  • linguistics and terminology studies
  • Linguistic Variation and Morphology
  • Digital Communication and Language
  • Second Language Acquisition and Learning
  • Language, Metaphor, and Cognition
  • Translation Studies and Practices
  • Text Readability and Simplification
  • Authorship Attribution and Profiling
  • Linguistics, Language Diversity, and Identity
  • Educational Assessment and Pedagogy
  • Categorization, perception, and language
  • Digital Humanities and Scholarship
  • Web Data Mining and Analysis
  • Linguistics and language evolution
  • Emotion and Mood Recognition
  • Cognitive Science and Mapping
  • Computational and Text Analysis Methods
  • Bioinformatics and Genomic Networks

Lancaster University
2011-2023

China Electronic Information Industry Development
2016

University of Manchester
2007-2010

Tampere University
2005

University of Sheffield
2001-2002

In this paper we present results from the METER (MEasuring TExt Reuse) project whose aim is to explore issues pertaining text reuse and derivation, especially in context of newspapers using newswire sources. Although by journalists has been studied linguistics, are not aware any investigation existing computational methods for particular task. We investigate classification newspaper articles according their degree dependence upon, or derivation from, a source simple 3-level scheme designed...

10.3115/1073083.1073110 article EN 2001-01-01

Scott Piao, Francesca Bianchi, Carmen Dayrell, Angela D’Egidio, Paul Rayson. Proceedings of the 2015 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2015.

10.3115/v1/n15-1137 article EN cc-by Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2015-01-01

Plastic pollution is one of the most significant environmental issues in world. The rapid increase cumulative amount plastic waste has caused alarm, and public have called for actions to mitigate its impacts on environment. Numerous governments social activists from various non-profit organisations set up policies actively promoted awareness engaged discussions this issue. Nevertheless, responsibility key a sustainable environment, individuals are accountable performing their civic duty...

10.3390/su14031709 article EN Sustainability 2022-02-01

This paper presents our research on the feasibility of extracting Twitter users' interests for suggesting serendipitous connections using natural language processing (NLP) technology. Defined by Andel [1] as art making an unsought finding, serendipity has a positive role in scientific and people's daily lives. Applications that facilitate would bring various benefits to us. In this work, we focus mining from messages (tweets hereafter) support detection connections. To address challenge,...

10.1109/passat/socialcom.2011.164 article EN 2011-10-01

Automatic extraction of multiword expressions (MWE) presents a tough challenge for the NLP community and corpus linguistics. Although various statistically driven or knowledge-based approaches have been proposed tested, efficient MWE still remains an unsolved issue. In this paper, we present our research work in which tested approaching issue using semantic field annotator. We use English tagger (USAS) developed at Lancaster University to identify units depict single concepts. The Meter...

10.3115/1119282.1119289 article EN 2003-01-01

Automatic extraction and analysis of meaning-related information from natural language data has been an important issue in a number research areas, such as processing (NLP), text mining, corpus linguistics, science. An aspect is the semantic annotation using tagger. In practice, various tools have designed to carry out different levels annotation, topics documents, role labeling, named entities or events. Currently, majority existing identify tag partial core data, but they tend be...

10.1016/j.csl.2017.04.010 article EN cc-by Computer Speech & Language 2017-05-17

Much has been documented in the literature on sentiment analysis and document summarisation. of this applies to long structured text form documents blog posts. With a shift social media towards short commentary (see Facebook status updates twitter tweets), difference comment structure may affect accuracy techniques. From our VoiceYourView trial, we collected over 2000 individual comments topic library refurbishment, many which are transcribed spoken comments. We have shown success...

10.1109/socialcom.2010.87 article EN 2010-08-01

Journal Article Word Alignment in English–Chinese Parallel Corpora Get access Scott Songlin Piao Search for other works by this author on: Oxford Academic Google Scholar Literary and Linguistic Computing, Volume 17, Issue 2, June 2002, Pages 207–230, https://doi.org/10.1093/llc/17.2.207 Published: 01 2002

10.1093/llc/17.2.207 article EN Literary and Linguistic Computing 2002-06-01

We introduce an annotation type system for a data-driven NLP core system.The specifications cover formal document structure and meta information, as well the linguistic levels of morphology, syntax semantics.The is embedded in framework Unstructured Information Management Architecture (UIMA).

10.3115/1642059.1642064 article EN 2007-01-01

There are various factors that affect the sentiment level expressed in textual comments. Capitalization of letters tends to mark something for attention and repeating strengthen emotion. Emoticons used help visualize facial expressions which can understanding text. In this paper, we show effect number exclamation marks used, via testing with twelve online tools. We present opinions gathered from 500 respondents towards "like" "dislike" values, a varying marks. Results only 20% tools tested...

10.1145/2837185.2837216 article EN 2015-12-11

We hypothesise that it is possible to determine a fine-grained set of sentiment values over and above the simple three-way positive/neutral/negative or binary Like/Dislike distinctions by examining textual formatting features. show this for online comments about ten different categories products. In context shopping reviews, one ways analyse consumers' feedback analysing comments. The rating "like" button on product comment not sufficient understand level expression. expression opinion only...

10.1109/icos.2015.7377288 article EN 2015-08-01

The use of metaphor in popular science is widespread to aid readers' conceptions the scientific concepts under discussion. Almost all research this area has been done by careful close reading text(s) question, but article describes—for first time—a digital 'distant reading' analysis science, using a system created team from Glasgow and Lancaster. This team, as part SAMUELS project, developed semantic tagging software which based upon UCREL Semantic Analysis System Lancaster University's...

10.1093/llc/fqv045 article EN cc-by-nc Digital Scholarship in the Humanities 2015-10-01
Coming Soon ...