Craig Thomson

ORCID: 0000-0002-1602-4694
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Natural Language Processing Techniques
  • Topic Modeling
  • Software Engineering Research
  • Speech and dialogue systems
  • Economic and Technological Systems Analysis
  • Digital Transformation in Industry
  • Advanced Text Analysis Techniques
  • Software-Defined Networks and 5G
  • Adversarial Robustness in Machine Learning
  • Explainable Artificial Intelligence (XAI)
  • Semantic Web and Ontologies
  • Agricultural economics and policies
  • Agriculture, Plant Science, Crop Management
  • Impact of AI and Big Data on Business and Society
  • Advanced Malware Detection Techniques
  • Network Security and Intrusion Detection
  • Data Mining Algorithms and Applications
  • Video Analysis and Summarization
  • Multi-Agent Systems and Negotiation
  • Smart Cities and Technologies
  • Multimodal Machine Learning Applications

University of Aberdeen
2018-2024

Dublin City University
2024

Edinburgh Napier University
2021-2024

Most Natural Language Generation systems need to produce accurate texts. We propose a methodology for high-quality human evaluation of the accuracy generated texts, which is intended serve as gold-standard evaluations data-to-text systems. use our evaluate computer basketball summaries. then show how gold standard can be used validate automated metrics.

10.18653/v1/2020.inlg-1.22 article EN cc-by 2020-01-01

Emiel van Miltenburg, Miruna Clinciu, Ondřej Dušek, Dimitra Gkatzia, Stephanie Inglis, Leo Leppänen, Saad Mahamood, Emma Manning, Schoch, Craig Thomson, Luou Wen. Proceedings of the 14th International Conference on Natural Language Generation. 2021.

10.18653/v1/2021.inlg-1.14 preprint EN cc-by 2021-01-01

Abstract While conducting a coordinated set of repeat runs human evaluation experiments in NLP, we discovered flaws every single experiment selected for inclusion via systematic process. In this squib, describe the types discovered, which include coding errors (e.g., loading wrong system outputs to evaluate), failure follow standard scientific practice ad hoc exclusion participants and responses), mistakes reported numerical results numbers not matching experimental data). If these problems...

10.1162/coli_a_00508 article EN cc-by-nc-nd Computational Linguistics 2024-01-01

Digital Twins (DT) have become crucial to achieve sustainable and effective smart urban solutions. However, current DT modelling techniques cannot support the dynamicity of these city environments. This is caused by lack right-time data capturing in traditional approaches, resulting inaccurate high resource energy consumption challenges. To fill this gap, we explore spatiotemporal graphs propose Reinforcement Learning-based Adaptive Twining (RL-AT) mechanism with Deep Q Networks (DQN). By...

10.1109/icc51166.2024.10622773 preprint EN arXiv (Cornell University) 2024-01-28

Human evaluation is widely regarded as the litmus test of quality in NLP. A basic requirementof all evaluations, but particular where they are used for meta-evaluation, that should support same conclusions if repeated. However, reproducibility human evaluations virtually never queried, let alone formally tested, NLP which means their repeatability and results currently an open question. This focused contribution reports our review experiments reported papers over past five years we assessed...

10.18653/v1/2023.findings-acl.226 article EN cc-by Findings of the Association for Computational Linguistics: ACL 2022 2023-01-01

The Shared Task on Evaluating Accuracy focused techniques (both manual and automatic) for evaluating the factual accuracy of texts produced by neural NLG systems, in a sports-reporting domain. Four teams submitted evaluation this task, using very different approaches techniques. best-performing submissions did encouragingly well at difficult task. However, all automatic struggled to detect errors which are semantically or pragmatically complex (for example, based incorrect computation inference).

10.18653/v1/2021.inlg-1.23 article EN cc-by 2021-01-01

We propose a shared task on methodologies and algorithms for evaluating the accuracy of generated texts, specifically summaries basketball games produced from box score other game data. welcome submissions based protocols human evaluation, automatic metrics, as well combinations evaluations metrics.

10.18653/v1/2020.inlg-1.28 article EN cc-by 2020-01-01

It is unfair to expect neural data-to-text produce high quality output when there are gaps between system input data and information contained in the training text. Thomson et al. (2020) identify narrow Rotowire, a popular dataset. In this paper, we describe study which finds that state-of-the-art produces higher output, according extraction (IE) based metrics, additional carefully selected from newly available source. remains be shown, however, whether IE metrics used correlate well with...

10.18653/v1/2020.inlg-1.6 article EN cc-by 2020-01-01

This paper proposes an approach to NLG system design which focuses on generating output text can be more easily processed by the reader. Ways in cognitive theory might combined with existing techniques are discussed and two simple experiments content ordering presented.

10.18653/v1/w18-6544 article EN cc-by 2018-01-01

Neural data-to-text systems lack the control and factual accuracy required to generate useful insightful summaries of multidimensional data. We propose a solution in form data views, where each view describes an entity its attributes along specific dimensions. A sequence views can then be used as high-level schema for document planning, with neural model handling complexities micro-planning surface realization. show that our view-based system retains while offering output tailored based on...

10.18653/v1/2023.inlg-main.16 article EN cc-by 2023-01-01

In today's networked world, Digital Twin Networks (DTNs) are revolutionizing how we understand and optimize physical networks. These networks, also known as 'Digital (DTNs)' or 'Networks Twins (NDTs),' encompass many from cellular wireless to optical satellite. They leverage computational power AI capabilities provide virtual representations, leading highly refined recommendations for real-world network challenges. Within DTNs, tasks include performance enhancement, latency optimization,...

10.48550/arxiv.2411.00681 preprint EN arXiv (Cornell University) 2024-11-01

Intrusion detection systems are integral to the security of networked for detecting malicious or anomalous network traffic. As traditional approaches becoming less effective, machine learning and deep learning-based intrusion vital research areas improved systems. Past into computer vision using revealed that classifiers themselves vulnerable adversarial attacks, these attacks have been investigated extensively. However, restricted not only domain image recognition. indicated by previous...

10.1109/sin54109.2021.9699157 article EN 2021-12-15

Most Natural Language Generation systems need to produce accurate texts. We propose a methodology for high-quality human evaluation of the accuracy generated texts, which is intended serve as gold-standard evaluations data-to-text systems. use our evaluate computer basketball summaries. then show how gold standard can be used validate automated metrics

10.48550/arxiv.2011.03992 preprint EN other-oa arXiv (Cornell University) 2020-01-01

We propose a shared task on methodologies and algorithms for evaluating the accuracy of generated texts. Participants will measure basketball game summaries produced by NLG systems from box score data.

10.48550/arxiv.2006.12234 preprint EN other-oa arXiv (Cornell University) 2020-01-01

The Shared Task on Evaluating Accuracy focused techniques (both manual and automatic) for evaluating the factual accuracy of texts produced by neural NLG systems, in a sports-reporting domain. Four teams submitted evaluation this task, using very different approaches techniques. best-performing submissions did encouragingly well at difficult task. However, all automatic struggled to detect errors which are semantically or pragmatically complex (for example, based incorrect computation inference).

10.48550/arxiv.2108.05644 preprint EN other-oa arXiv (Cornell University) 2021-01-01
Coming Soon ...