NFDI4DS | UHH-SEMS - Publication Details

Donna Xu

ORCID: 0000-0003-4453-7056

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5068418080

Research Areas

Text and Document Classification Technologies
Software System Performance and Reliability
Machine Learning and Algorithms
Domain Adaptation and Few-Shot Learning
Water Systems and Optimization
Machine Learning and Data Classification
Cloud Computing and Resource Management
Generative Adversarial Networks and Image Synthesis
Anomaly Detection Techniques and Applications
Bayesian Methods and Mixture Models
Multimodal Machine Learning Applications
Security and Verification in Computing
Infrastructure Resilience and Vulnerability Analysis
Cancer-related molecular mechanisms research
Access Control and Trust
Distributed systems and fault tolerance
Advanced Image and Video Retrieval Techniques
IoT and Edge/Fog Computing
Supply Chain Resilience and Risk Management
Advanced Database Systems and Queries
Face and Expression Recognition
Data Analysis with R
Functional Brain Connectivity Studies
Advanced MRI Techniques and Applications
Blind Source Separation Techniques

University of Technology Sydney
2018-2020

Commonwealth Scientific and Industrial Research Organisation
2015

Data61
2015

The University of Sydney
2015

Survey on Multi-Output Learning

OPENALEX - Publications

Donna Xu Yaxin Shi Ivor W. Tsang Yew-Soon Ong Chen Gong and 1 more

The aim of multi-output learning is to simultaneously predict multiple outputs given an input. It important problem for decision-making since making decisions in the real world often involves complex factors and criteria. In recent times, increasing number research studies have focused on ways at once. Such efforts transpired different forms according particular under study. Classic cases include multi-label learning, multi-dimensional multi-target regression, others. From our survey topic,...

10.1109/tnnls.2019.2945133 article EN IEEE Transactions on Neural Networks and Learning Systems 2019-01-01

Metric Learning for Multi-Output Tasks

OPENALEX - Publications

Weiwei Liu Donna Xu Ivor W. Tsang Wenjie Zhang

Multi-output learning with the task of simultaneously predicting multiple outputs for an input has increasingly attracted interest from researchers due to its wide application. The <inline-formula><tex-math notation="LaTeX">$k$</tex-math></inline-formula> nearest neighbor ( notation="LaTeX">$k \text{NN}$</tex-math></inline-formula> ) algorithm is one most popular frameworks handling multi-output problems. performance depends crucially on metric used compute distance between different...

10.1109/tpami.2018.2794976 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2018-01-18

A Survey on Multi-output Learning

OPENALEX - Publications

Donna Xu Yaxin Shi Ivor W. Tsang Yew-Soon Ong Gong Chen and 1 more

Multi-output learning aims to simultaneously predict multiple outputs given an input. It is important problem due the pressing need for sophisticated decision making in real-world applications. Inspired by big data, 4Vs characteristics of multi-output imposes a set challenges learning, terms volume, velocity, variety and veracity outputs. Increasing number works literature have been devoted study development novel approaches addressing encountered. However, it lacks comprehensive overview on...

10.48550/arxiv.1901.00248 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Online Product Quantization

OPENALEX - Publications

Donna Xu Ivor W. Tsang Ying Zhang

Approximate nearest neighbor (ANN) search has achieved great success in many tasks. However, existing popular methods for ANN search, such as hashing and quantization methods, are designed static databases only. They cannot handle well the database with data distribution evolving dynamically, due to high computational effort retraining model based on new database. In this paper, we address problem by developing an online product (online PQ) incrementally updating codebook that accommodates...

10.1109/tkde.2018.2817526 article EN IEEE Transactions on Knowledge and Data Engineering 2018-01-01

Achieving Reliable High-Frequency Releases in Cloud Environments

OPENALEX - Publications

Liming Zhu Donna Xu An Binh Tran Xiwei Xu Len Bass and 2 more

Continuous delivery and deployment are dramatically shortening release cycles from months to hours. Cloud applications with high-frequency releases often rely heavily on automated tools cloud infrastructure APIs deploy new software versions. The authors report reliability issues how these contribute them. They also analyze the trade-offs between using baked lightly virtual-image approaches, basis of experiments Amazon Web Service OpsWorks Chef configuration management tool. Finally, they...

10.1109/ms.2015.23 article EN IEEE Software 2015-01-12

A Data-Analytics Approach for Enterprise Resilience

OPENALEX - Publications

Donna Xu Ivor W. Tsang Eng K. Chew Cosimo Siclari Varun Kaul

Enterprise resilience plays an important role to prevent business services from disruptions caused by human-induced disasters such as failed change implementations and software bugs. Traditional expert-centric approach has difficulty maintain continued critical functions because the can often only be handled after their occurrence. This paper introduces a data-analytics approach, which leverages system monitoring data for enterprise resilience. With power of mining machine learning...

10.1109/mis.2019.2918092 article EN IEEE Intelligent Systems 2019-05-01

Making Real Time Data Analytics Available as a Service

OPENALEX - Publications

Donna Xu Dongyao Wu Xiwei Xu Liming Zhu Len Bass

Conducting (big) data analytics in an organization is not just about using a processing framework (e.g. Hadoop/Spark) to learn model from currently single file system HDFS). We frequently need pipeline real time other systems into the framework, and continually update learned model. The frameworks be easily invokable for different purposes produce models. subsequent updates integrated with product that may require prediction latest trained All these shared among teams purposes. In this...

10.1145/2737182.2737186 article EN 2015-04-22

Self Protecting Data Sharing Using Generic Policies

OPENALEX - Publications

Shiping Chen Danan Thilakanathan Donna Xu ‪Surya Nepal‬ Rafael A. Calvo

Although content sharing provides many benefits, owners lose full control of their once they are given away. Existing solutions provide limited capabilities access as vendor-specific, non-structured and non-flexible. In this paper, we present an open flexible software solution called SelfProtect Object (SPO). SPO bundles policy files in object that can protect its contents by itself anywhere anytime. Our is based on XACML, a generic language allowing fine-grain with rules conditions. We also...

10.1109/ccgrid.2015.84 article EN 2015-05-01

Label Embedding with Partial Heterogeneous Contexts

OPENALEX - Publications

Yaxin Shi Donna Xu Yuangang Pan Ivor W. Tsang Shirui Pan

Label embedding plays an important role in many real-world applications. To enhance the label relatedness captured by embeddings, multiple contexts can be adopted. However, these are heterogeneous and often partially observed practical tasks, imposing significant challenges to capture overall among labels. In this paper, we propose a general Partial Heterogeneous Context Embedding (PHCLE) framework address challenges. Categorizing into two groups, relational context descriptive context,...

10.1609/aaai.v33i01.33014926 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2019-07-17

On inferring structural connectivity from brain functional-MRI data

OPENALEX - Publications

Somwrita Sarkar Sanjay Chawla Donna Xu

The anatomical structure of the brain can be observed via non-invasive techniques such as diffusion imaging. However, these are imperfect because they miss connections that actually known to exist, especially long range inter-hemispheric ones. In this paper we formulate inverse problem inferring structural connectivity networks from experimentally functional Magnetic Resonance Imaging (fMRI), by formulating it a convex optimization problem. We show modeled an optimal sparse representation...

10.48550/arxiv.1502.06659 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Multiview Alignment and Generation in CCA via Consistent Latent Encoding

OPENALEX - Publications

Yaxin Shi Yuangang Pan Donna Xu Ivor W. Tsang

Multiview alignment, achieving one-to-one correspondence of multiview inputs, is critical in many real-world applications, especially for cross-view data analysis problems. An increasing amount work has studied this alignment problem with canonical correlation (CCA). However, existing CCA models are prone to misalign the multiple views due either neglect uncertainty or inconsistent encoding views. To tackle these two issues, letter studies from a Bayesian perspective. Delving into...

10.1162/neco_a_01309 article EN Neural Computation 2020-08-15

Label Embedding with Partial Heterogeneous Contexts

OPENALEX - Publications

Yaxin Shi Donna Xu Yuangang Pan Ivor W. Tsang Shirui Pan

10.48550/arxiv.1805.01199 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Probabilistic CCA with Implicit Distributions

OPENALEX - Publications

Yaxin Shi Yuangang Pan Donna Xu Ivor W. Tsang

Canonical Correlation Analysis (CCA) is a classic technique for multi-view data analysis. To overcome the deficiency of linear correlation in practical learning tasks, various CCA variants were proposed to capture nonlinear dependency. However, it non-trivial have an in-principle understanding these due their inherent restrictive assumption on and latent code distributions. Although some works studied probabilistic interpretation CCA, models still require explicit form distributions achieve...

10.48550/arxiv.1907.02345 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Multi-view Alignment and Generation in CCA via Consistent Latent Encoding

OPENALEX - Publications

Yaxin Shi Yuangang Pan Donna Xu Ivor W. Tsang

Multi-view alignment, achieving one-to-one correspondence of multi-view inputs, is critical in many real-world applications, especially for cross-view data analysis problems. Recently, an increasing number works study this alignment problem with Canonical Correlation Analysis (CCA). However, existing CCA models are prone to misalign the multiple views due either neglect uncertainty or inconsistent encoding views. To tackle these two issues, paper studies from Bayesian perspective. Delving...

10.48550/arxiv.2005.11716 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Coming Soon ...