NFDI4DS | UHH-SEMS - Publication Details

Gillian Dobbie

ORCID: 0000-0001-7245-0367

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5016115576

Research Areas

Advanced Database Systems and Queries
Semantic Web and Ontologies
Data Management and Algorithms
Data Stream Mining Techniques
Data Mining Algorithms and Applications
Anomaly Detection Techniques and Applications
Rough Sets and Fuzzy Logic
Adversarial Robustness in Machine Learning
Software Engineering Research
Imbalanced Data Classification Techniques
Machine Learning and Data Classification
Software Testing and Debugging Techniques
Time Series Analysis and Forecasting
Privacy-Preserving Technologies in Data
Advanced Clustering Algorithms Research
Network Security and Intrusion Detection
Complex Network Analysis Techniques
Recommender Systems and Techniques
Service-Oriented Architecture and Web Services
Topic Modeling
Spam and Phishing Detection
Data Quality and Management
Algorithms and Data Compression
Natural Language Processing Techniques
Logic, Reasoning, and Knowledge

University of Auckland
2016-2025

Macquarie University
2008

Victoria University of Wellington
1995-2002

Mississippi State University
2002

National University of Singapore
2000

The University of Melbourne
1993

Source Inference Attacks in Federated Learning

OPENALEX - Publications

Hongsheng Hu Zoran Salčić Lichao Sun Gillian Dobbie Xuyun Zhang

Federated learning (FL) has emerged as a promising privacy-aware paradigm that allows multiple clients to jointly train model without sharing their private data. Recently, many studies have shown FL is vulnerable membership inference attacks (MIAs) can distinguish the training members of given from non-members. However, existing MIAs ignore source member, i.e., information client owning while it essential explore privacy in beyond examples all clients. The leakage lead severe issues. For...

10.1109/icdm51629.2021.00129 article EN 2021 IEEE International Conference on Data Mining (ICDM) 2021-12-01

Large Language Models Are Not Strong Abstract Reasoners

OPENALEX - Publications

G. Gendron Qiming Bao Michael Witbrock Gillian Dobbie

Large Language Models have shown tremendous performance on a large variety of natural language processing tasks, ranging from text comprehension to common sense reasoning. However, the mechanisms responsible for this success remain opaque, and it is unclear whether LLMs can achieve human-like cognitive capabilities or these models are still fundamentally circumscribed. Abstract reasoning fundamental task cognition, consisting finding applying general pattern few data. Evaluating deep neural...

10.24963/ijcai.2024/693 article EN 2024-07-26

Detecting Volatility Shift in Data Streams

OPENALEX - Publications

David Tse Jung Huang Yun Sing Koh Gillian Dobbie Russel Pears

Current drift detection techniques detect a change in distribution within stream. However, there are no current that analyze the rate of these detected changes. We coin term stream volatility, to describe changes A has high volatility if frequently and low infrequently. particularly interested shift which is (e.g. From volatility). introduce define concept propose novel technique on data streams presence drifts. In experiments we show our algorithm be both fast efficient. also new for called...

10.1109/icdm.2014.50 article EN 2014-12-01

Network Embedding and Change Modeling in Dynamic Heterogeneous Networks

OPENALEX - Publications

Ranran Bian Yun Sing Koh Gillian Dobbie Anna Divoli

Network embedding learns the vector representations of nodes. Most real world networks are heterogeneous and evolve over time. There are, however, no network approaches designed for dynamic so far. Addressing this research gap is beneficial analyzing mining networks. We develop a novel representation learning method, change2vec, which considers as snapshots with different time stamps. Instead processing whole at each stamp, change2vec models changes between two consecutive static by...

10.1145/3331184.3331273 article EN 2019-07-18

Predictive modeling of biodegradation pathways using transformer architectures

OPENALEX - Publications

Liam Brydon Kunyang Zhang Gillian Dobbie Katerina Taškova Jörg Wicker

In recent years, the integration of machine learning techniques into chemical reaction product prediction has opened new avenues for understanding and predicting behaviour substances. The necessity such predictive methods stems from growing regulatory social awareness environmental consequences associated with persistence accumulation residues. Traditional biodegradation rely on expert knowledge to perform predictions. However, creating this is becoming increasingly prohibitive due...

10.1186/s13321-025-00969-7 article EN cc-by Journal of Cheminformatics 2025-02-17

How does a GPT perform in Forecasting Severe Respiratory Disease Hospitalizations?

OPENALEX - Publications

Steffen Albrecht Alex C. Kim João Afonso Madelino Katharina Dost Johnny Zhu and 16 more

Forecasting surges in hospital admissions caused by severe respiratory infections is of crucial importance during the winter season to enable proactive management and timely decision-making prevent healthcare system overload. As time series derived from surveillance systems for these cases are sparse encode weak seasonality patterns, machine learning key computing accurate forecasts. The most recent algorithmic advance forecasting adaptation generative pre-trained transformers (GPTs). Those...

10.24135/iconip1 article EN cc-by-nc-sa 2025-03-16

Assessing the Risk of Discriminatory Bias in Classification Datasets

OPENALEX - Publications

Kunpeng Dai Jonathan Kim Sašo Džeroski Jörg Wicker Gillian Dobbie and 1 more

<title>Abstract</title> Bias in machine learning models remains a critical challenge, particularly datasets with numeric features where discrimination may be subtle and hard to detect. Existing fairness frameworks rely on expert knowledge of marginalized groups, such as specific racial categorical defining them. Furthermore, most evaluate bias rather than datasets, despite the fact that model can often traced back dataset shortcomings. Our research aims remedy this gap by capturing flaws set...

10.21203/rs.3.rs-6370375/v1 preprint EN 2025-04-04

The Automation of Design Model Repair

OPENALEX - Publications

Cheng‐Hao Cai Jing Sun Gillian Dobbie

10.1016/j.scico.2025.103313 article EN cc-by Science of Computer Programming 2025-04-01

Weighted association rule mining via a graph based connectivity model

OPENALEX - Publications

Russel Pears Yun Sing Koh Gillian Dobbie Wai K. Yeap

10.1016/j.ins.2012.07.001 article EN Information Sciences 2012-07-20

Detecting online auction shilling frauds using supervised learning

OPENALEX - Publications

Sidney Tsang Yun Sing Koh Gillian Dobbie Shafiq Alam

10.1016/j.eswa.2013.10.033 article EN Expert Systems with Applications 2013-10-24

Detection of abnormal profiles on group attacks in recommender systems

OPENALEX - Publications

Wei Zhou Yun Sing Koh Junhao Wen Shafiq Alam Gillian Dobbie

Recommender systems using Collaborative Filtering techniques are capable of make personalized predictions. However, these highly vulnerable to profile injection attacks. Group attacks that target a group items instead one, and there common attributes among items. Such profiles will have good probability being similar large number user profiles, making them hard detect. We propose novel technique for identifying attack which uses an improved metric based on Degree Similarity with Top...

10.1145/2600428.2609483 article EN 2014-07-03

Shilling Attacks Detection in Recommender Systems Based on Target Item Analysis

OPENALEX - Publications

Wei Zhou Junhao Wen Yun Sing Koh Qingyu Xiong Min Gao and 2 more

Recommender systems are highly vulnerable to shilling attacks, both by individuals and groups. Attackers who introduce biased ratings in order affect recommendations, have been shown negatively collaborative filtering (CF) algorithms. Previous research focuses only on the differences between genuine profiles attack profiles, ignoring group characteristics profiles. In this paper, we study use of statistical metrics detect rating patterns attackers Another question is that most existing...

10.1371/journal.pone.0130968 article EN cc-by PLoS ONE 2015-07-29

Real-time Smartphone Activity Classification Using Inertial Sensors—Recognition of Scrolling, Typing, and Watching Videos While Sitting or Walking

OPENALEX - Publications

Sijie Zhuo Lucas Sherlock Gillian Dobbie Yun Sing Koh Giovanni Russello and 1 more

By developing awareness of smartphone activities that the user is performing on their smartphone, such as scrolling feeds, typing and watching videos, we can develop application features are beneficial to users, personalization. It currently not possible access real-time directly, due standard privileges if internal movement sensors detect them, there may be implications for policies. Our research seeks understand whether sensor data from existing inertial measurement unit (IMU) (triaxial...

10.3390/s20030655 article EN cc-by Sensors 2020-01-24

Membership Inference Attacks on Machine Learning: A Survey

OPENALEX - Publications

Hongsheng Hu Zoran Salčić Lichao Sun Gillian Dobbie Philip S. Yu and 1 more

Machine learning (ML) models have been widely applied to various applications, including image classification, text generation, audio recognition, and graph data analysis. However, recent studies shown that ML are vulnerable membership inference attacks (MIAs), which aim infer whether a record was used train target model or not. MIAs on can directly lead privacy breach. For example, via identifying the fact clinical has associated with certain disease, an attacker owner of disease high...

10.48550/arxiv.2103.07853 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Membership Inference via Backdooring

OPENALEX - Publications

Hongsheng Hu Zoran Salčić Gillian Dobbie Jinjun Chen Lichao Sun and 1 more

Recently issued data privacy regulations like GDPR (General Data Protection Regulation) grant individuals the right to be forgotten. In context of machine learning, this requires a model forget about training sample if requested by owner (i.e., unlearning). As an essential step prior unlearning, it is still challenge for tell whether or not her have been used unauthorized party train learning model. Membership inference recently emerging technique identify was target model, and seems...

10.24963/ijcai.2022/532 article EN Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence 2022-07-01

A Dual Augmentation Framework for Domain Generalization with Covariate and Conditional Distribution Shifts

OPENALEX - Publications

Di Zhao Gillian Dobbie Jingfeng Zhang Hongsheng Hu Philippe Fournier‐Viger and 1 more

10.2139/ssrn.5106463 preprint EN 2025-01-01

An Evolutionary Particle Swarm Optimization algorithm for data clustering

OPENALEX - Publications

Shafiq Alam Gillian Dobbie Patricia Riddle

Clustering is an important data mining task and has been explored extensively by a number of researchers for different application areas such as finding similarities in images, text bio-informatics data. Various optimization techniques have proposed to improve the performance clustering algorithms. In this paper we propose novel algorithm that call evolutionary particle swarm (EPSO)-clustering which based on PSO. The evolution generations where particles are initially uniformly distributed...

10.1109/sis.2008.4668294 article EN IEEE Swarm Intelligence Symposium 2008-09-01

Anomaly detection and identification scheme for VM live migration in cloud infrastructure

OPENALEX - Publications

Tian Huang Yongxin Zhu Yafei Wu Stéphane Bressan Gillian Dobbie

10.1016/j.future.2015.06.005 article EN Future Generation Computer Systems 2015-07-03

Coming Soon ...