NFDI4DS | UHH-SEMS - Publication Details

Bo Qiao

ORCID: 0000-0002-8997-8317

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5049886136

Research Areas

Cloud Computing and Resource Management
Software System Performance and Reliability
Network Security and Intrusion Detection
Anomaly Detection Techniques and Applications
Data Stream Mining Techniques
IoT and Edge/Fog Computing
Machine Learning and Data Classification
Software Engineering Research
Advanced Neural Network Applications
Blockchain Technology Applications and Security
Domain Adaptation and Few-Shot Learning
Machine Learning and Algorithms
Topic Modeling
Software Reliability and Analysis Research
Advanced Graph Neural Networks
Neural Networks and Applications
Time Series Analysis and Forecasting
Traffic Prediction and Management Techniques
Data Visualization and Analytics
Advanced Image and Video Retrieval Techniques
Data Management and Algorithms
Data Mining Algorithms and Applications
Adversarial Robustness in Machine Learning
Distributed and Parallel Computing Systems
Imbalanced Data Classification Techniques

Microsoft Research Asia (China)
2018-2024

Southern University of Science and Technology
2023

Microsoft Research (United Kingdom)
2019-2022

Hunan Agricultural University
2009-2021

Friedrich-Alexander-Universität Erlangen-Nürnberg
2018-2020

Northwestern Polytechnical University
2015

University of Reading
2008

Lanzhou Institute of Chemical Physics
2008

Chinese Academy of Sciences
2008

Northeastern University
2007

Robust log-based anomaly detection on unstable log data

OPENALEX - Publications

Xu Zhang Yong Xu Qingwei Lin Bo Qiao Hongyu Zhang and 12 more

Logs are widely used by large and complex software-intensive systems for troubleshooting. There have been a lot of studies on log-based anomaly detection. To detect the anomalies, existing methods mainly construct detection model using log event data extracted from historical logs. However, we find that do not work well in practice. These close-world assumption, which assumes is stable over time set distinct events known. our empirical study shows practice, often contains previously unseen...

10.1145/3338906.3338931 article EN 2019-08-09

A joint model for entity and relation extraction based on BERT

OPENALEX - Publications

Bo Qiao Zhuoyang Zou Yu Huang Kui Fang Xinghui Zhu and 1 more

10.1007/s00521-021-05815-z article EN Neural Computing and Applications 2021-03-08

Preparation of highly effective ferric hydroxide supported noble metal catalysts for CO oxidations: From gold to palladium

OPENALEX - Publications

Bo Qiao Liang Liu Junying Zhang Yu Deng

10.1016/j.jcat.2008.11.012 article EN Journal of Catalysis 2008-12-07

Towards intelligent incident management: why we need it and how we make it

OPENALEX - Publications

Zhuangbin Chen Yu Kang Liqun Li Xu Zhang Hongyu Zhang and 12 more

The management of cloud service incidents (unplanned interruptions or outages a service/product) greatly affects customer satisfaction and business revenue. After years efforts, enterprises are able to solve most automatically timely. However, in practice, we still observe critical that occurred an unexpected manner orchestrated diagnosis workflow failed mitigate them. In order accelerate the understanding unprecedented provide actionable recommendations, modern incident system employs...

10.1145/3368089.3417055 article EN 2020-11-08

Neural Feature Search: A Neural Architecture for Automated Feature Engineering

OPENALEX - Publications

Xiangning Chen Bo Qiao Weiyi Zhang Wei Wu Murali Chintalapati and 9 more

Feature engineering is a crucial step for developing effective machine learning models. Traditionally, feature performed manually, which requires much domain knowledge and time-consuming. In recent years, many automated methods have been proposed. These improve the accuracy of model by automatically transforming original features into set new features. However, existing either lack ability to perform high-order transformations or suffer from space explosion problem. this paper, we present...

10.1109/icdm.2019.00017 article EN 2021 IEEE International Conference on Data Mining (ICDM) 2019-11-01

Onion: identifying incident-indicating logs for cloud systems

OPENALEX - Publications

Xu Zhang Yong Xu Si Qin Shilin He Bo Qiao and 8 more

In cloud systems, incidents affect the availability of services and require quick mitigation actions. Once an incident occurs, operators developers often examine logs to perform fault diagnosis. However, large volume diverse overwhelming details in log data make manual diagnosis process time-consuming error-prone. this paper, we propose Onion, automatic solution for precisely efficiently locating incident-indicating logs, which can provide useful clues diagnosing incidents. We first point...

10.1145/3468264.3473919 article EN 2021-08-18

TraceArk: Towards Actionable Performance Anomaly Alerting for Online Service Systems

OPENALEX - Publications

Zhengran Zeng Yuqun Zhang Yong Xu Minghua Ma Bo Qiao and 10 more

Performance anomaly alerting based on trace data plays an important role in assuring the quality of online service systems. However, engineers find that many anomalies reported by existing techniques are not interest for them to take further actions. For a large scale with hundreds different microservices, current methods either fire lots false alarms applying simple thresholds temporal metrics (i.e., latency), or run complex end-to-end deep learning model limited interpretability. Engineers...

10.1109/icse-seip58684.2023.00029 article EN 2023-05-01

CloudDet: Interactive Visual Analysis of Anomalous Performances in Cloud Computing Systems

OPENALEX - Publications

Ke Xu Yun Wang Leni Yang Yifang Wang Bo Qiao and 4 more

Detecting and analyzing potential anomalous performances in cloud computing systems is essential for avoiding losses to customers ensuring the efficient operation of systems. To this end, a variety automated techniques have been developed identify anomalies computing. These are usually adopted track performance metrics system (e.g., CPU, memory, disk I/O), represented by multivariate time series. However, given complex characteristics data, effectiveness these methods affected. Thus,...

10.1109/tvcg.2019.2934613 article EN IEEE Transactions on Visualization and Computer Graphics 2019-01-01

T-SMOTE: Temporal-oriented Synthetic Minority Oversampling Technique for Imbalanced Time Series Classification

OPENALEX - Publications

Pu Zhao Chuan Luo Bo Qiao Lu Wang Saravan Rajmohan and 2 more

Time series classification is a popular and important topic in machine learning, it suffers from the class imbalance problem many real-world applications. In this paper, to address problem, we propose novel practical oversampling method named T-SMOTE, which can make full use of temporal information time-series data. particular, for each sample minority class, T-SMOTE generates multiple samples that are close border. Then, based on those near border, synthesizes more samples. Finally,...

10.24963/ijcai.2022/334 article EN Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence 2022-07-01

Counter-Empirical Attacking based on Adversarial Reinforcement Learning for Time-Relevant Scoring System

OPENALEX - Publications

Xiangguo Sun Hong Cheng Hang Dong Bo Qiao Si Qin and 1 more

Scoring systems are commonly seen for platforms in the era of Big Data. From credit scoring financial services to membership scores E-commerce shopping platforms, platform managers use such guide users towards encouraged activity pattern, and manage resources more effectively efficiently. To establish systems, several "empirical criteria" first determined, followed by a dedicated top-down design each score factor, which usually requires enormous effort adjust tune function new application...

10.1109/tkde.2023.3341430 article EN IEEE Transactions on Knowledge and Data Engineering 2023-12-12

Intelligent Virtual Machine Provisioning in Cloud Computing

OPENALEX - Publications

Chuan Luo Bo Qiao Xin Chen Pu Zhao Randolph Yao and 4 more

Virtual machine (VM) provisioning is a common and critical problem in cloud computing. In industrial platforms, there are huge number of VMs provisioned per day. Due to the complexity resource constraints, it needs be carefully optimized make platforms effectively utilize resources. Moreover, practice, VM from scratch requires fairly long time, which would degrade customer experience. Hence, advisable provision ahead for upcoming demands. this work, we formulate practical scenario as...

10.24963/ijcai.2020/208 article EN 2020-07-01

Efficient incident identification from multi-dimensional issue reports via meta-heuristic search

OPENALEX - Publications

Jiazhen Gu Chuan Luo Si Qin Bo Qiao Qingwei Lin and 8 more

In large-scale cloud systems, unplanned service interruptions and outages may cause severe degradation of availability. Such incidents can occur in a bursty manner, which will deteriorate user satisfaction. Identifying rapidly accurately is critical to the operation maintenance system. industrial practice, are typically detected through analyzing issue reports, generated over time by monitoring services. large number reports quite challenging. An report multi-dimensional: it has many...

10.1145/3368089.3409741 article EN 2020-11-08

NTAM: Neighborhood-Temporal Attention Model for Disk Failure Prediction in Cloud Platforms

OPENALEX - Publications

Chuan Luo Pu Zhao Bo Qiao Youjiang Wu Hongyu Zhang and 6 more

With the rapid deployment of cloud platforms, high service reliability is critical importance. An industrial platform contains a huge number disks, and disk failure common cause unreliability. In recent years, many machine learning based prediction approaches have been proposed, they can predict failures on status data before actually happen. this way, proactive actions be taken in advance to improve reliability. However, existing treat each individually do not explore influence neighboring...

10.1145/3442381.3449867 article EN 2021-04-19

PULNS: Positive-Unlabeled Learning with Effective Negative Sample Selector

OPENALEX - Publications

Chuan Luo Pu Zhao Chen Chen Bo Qiao Chao Du and 6 more

Positive-unlabeled learning (PU learning) is an important case of binary classification where the training data only contains positive and unlabeled samples. The current state-of-the-art approach for PU cost-sensitive approach, which casts as a problem relies on unbiased risk estimator correcting bias introduced by However, this requires knowledge class prior subject to potential label noise. In paper, we propose novel dubbed PULNS, equipped with effective negative sample selector, optimized...

10.1609/aaai.v35i10.17064 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

UFO: A UI-Focused Agent for Windows OS Interaction

OPENALEX - Publications

Chaoyun Zhang Liqun Li Shilin He Xu Zhang Bo Qiao and 7 more

We introduce UFO, an innovative UI-Focused agent to fulfill user requests tailored applications on Windows OS, harnessing the capabilities of GPT-Vision. UFO employs a dual-agent framework meticulously observe and analyze graphical interface (GUI) control information applications. This enables seamlessly navigate operate within individual across them requests, even when spanning multiple The incorporates interaction module, facilitating action grounding without human intervention enabling...

10.48550/arxiv.2402.07939 preprint EN arXiv (Cornell University) 2024-02-08

Assess and Summarize: Improve Outage Understanding with Large Language Models

OPENALEX - Publications

Pengxiang Jin Shenglin Zhang Minghua Ma Haozhe Li Yu Kang and 11 more

Cloud systems have become increasingly popular in recent years due to their flexibility and scalability. Each time cloud computing applications services hosted on the are affected by a outage, users can experience slow response times, connection issues or total service disruption, resulting significant negative business impact. Outages usually comprised of several concurring events/source causes, therefore understanding context outages is very challenging yet crucial first step toward...

10.1145/3611643.3613891 article EN 2023-11-30

Automatic Kernel Fusion for Image Processing DSLs

OPENALEX - Publications

Bo Qiao Oliver Reiche Frank Hannig Jürgen Teich

Programming image processing algorithms on hardware accelerators such as graphics units (GPUs) often exhibits a trade-off between software portability and performance portability. Domain-specific languages (DSLs) have proven to be promising remedy, which enable optimizations generation of efficient code from concise, high-level algorithm representation.

10.1145/3207719.3207723 article EN 2018-05-28

Fast Outage Analysis of Large-Scale Production Clouds with Service Correlation Mining

OPENALEX - Publications

Yaohui Wang Guozheng Li Zijian Wang Yu Kang Yangfan Zhou and 11 more

Cloud-based services are surging into popularity in recent years. However, outages, i.e., severe incidents that always impact multiple services, can dramatically affect user experience and incur economic losses. Locating the root-cause service, service contains root cause of outage, is a crucial step to mitigate outage. In current industrial practice, this generally performed bootstrap manner largely depends on human efforts: directly causes outage identified first, suspected traced back...

10.1109/icse43902.2021.00085 article EN 2021-05-01

Correlation-Aware Heuristic Search for Intelligent Virtual Machine Provisioning in Cloud Systems

OPENALEX - Publications

Chuan Luo Bo Qiao Wenqian Xing Xin Chen Pu Zhao and 8 more

The optimization of resource is crucial for the operation public cloud systems such as Microsoft Azure, well servers dedicated to workloads large customers 365. Those tasks often need take unknown parameters into consideration and can be formulated Prediction+Optimization problems. This paper proposes a new method named Correlation-Aware Heuristic Search (CAHS) that capable accounting uncertainty in delivering effective solutions difficult We apply this solving predictive virtual machine...

10.1609/aaai.v35i14.17467 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Snape: Reliable and Low-Cost Computing with Mixture of Spot and On-Demand VMs

OPENALEX - Publications

Fangkai Yang Lu Wang Zhenyu Xu Jue Zhang Liqun Li and 13 more

Cloud providers often have resources that are not being fully utilized, and they may offer them at a lower cost to make up for the reduced availability of these resources. However, customers be hesitant use such offerings (such as spot VMs) making trade-offs between resource is always straightforward. In this work, we propose Snape (Spot On-demand Perfect Mixture), an intelligent framework optimize by dynamically mixing on-demand VMs with VMs. Through detailed characterization based on real...

10.1145/3582016.3582028 article EN 2023-03-20

Differentiable gated autoencoders for unsupervised feature selection

OPENALEX - Publications

Zebin Chen Jintang Bian Bo Qiao Xiaohua Xie

10.1016/j.neucom.2024.128202 article EN Neurocomputing 2024-07-18

From loop fusion to kernel fusion: a domain-specific approach to locality optimization

OPENALEX - Publications

Bo Qiao Oliver Reiche Frank Hannig Jirgen Teich

Optimizing data-intensive applications such as image processing for GPU targets with complex memory hierarchies requires to explore the tradeoffs among locality, parallelism, and computation. Loop fusion one of classical optimization techniques has been proven effective improve locality at function level. Algorithms in are increasing their complexities generally consist many kernels a pipeline. The inter-kernel communications intensive exhibit another opportunity improvement system scope...

10.5555/3314872.3314901 article EN Symposium on Code Generation and Optimization 2019-02-16

AutoCCAG: An Automated Approach to Constrained Covering Array Generation

OPENALEX - Publications

Chuan Luo Jinkun Lin Shaowei Cai Xin Chen Bing He and 7 more

Combinatorial interaction testing (CIT) is an important technique for highly configurable software systems with demonstrated effectiveness in practice. The goal of CIT to generate test cases covering the interactions configuration options, under certain hard constraints. In this context, constrained arrays (CCAs) are frequently used as CIT. Constrained Covering Array Generation (CCAG) NP-hard combinatorial optimization problem, solving which requires effective method generating small CCAs....

10.1109/icse43902.2021.00030 article EN 2021-05-01

LS-sampling: an effective local search based sampling approach for achieving high t-wise coverage

OPENALEX - Publications

Chuan Luo Binqi Sun Bo Qiao Junjie Chen Hongyu Zhang and 3 more

There has been a rapidly increasing demand for developing highly configurable software systems, which urgently calls effective testing methods. In practice, t-wise coverage widely recognized as useful metric to evaluate the quality of test suite and achieving high is important ensuring adequacy. However, state-of-the-art methods usually cost fairly long time generate large suites pairwise (i.e., 2-wise coverage), would lead ineffective inefficient systems. this paper, we propose novel local...

10.1145/3468264.3468622 article EN 2021-08-18

Coming Soon ...