NFDI4DS | UHH-SEMS - Publication Details

Qinbao Song

ORCID: 0000-0001-5374-9523

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5102831118

Research Areas

Data Mining Algorithms and Applications
Face and Expression Recognition
Machine Learning and Data Classification
Text and Document Classification Technologies
Complex Network Analysis Techniques
Rough Sets and Fuzzy Logic
Advanced Clustering Algorithms Research
Imbalanced Data Classification Techniques
Data Management and Algorithms
Network Security and Intrusion Detection
Web Data Mining and Analysis
Data Stream Mining Techniques
Advanced Graph Neural Networks
Software Reliability and Analysis Research
Software System Performance and Reliability
Opinion Dynamics and Social Influence
Business Process Modeling and Analysis
Algorithms and Data Compression
Service-Oriented Architecture and Web Services
Video Analysis and Summarization
Gene expression and cancer classification
Advanced Image and Video Retrieval Techniques
Grey System Theory Applications
Evaluation and Optimization Models
Fuzzy Logic and Control Systems

Xi'an Jiaotong University
2009-2018

Wuhan University
2016

State Key Laboratory of Software Engineering
2016

The University of Texas at Dallas
2013

Northwest University
2009

Xi'an University of Science and Technology
2007

École Centrale de Lyon
2003

A Fast Clustering-Based Feature Subset Selection Algorithm for High-Dimensional Data

OPENALEX - Publications

Qinbao Song Jingjie Ni Guangtao Wang

Feature selection involves identifying a subset of the most useful features that produces compatible results as original entire set features. A feature algorithm may be evaluated from both efficiency and effectiveness points view. While concerns time required to find features, is related quality Based on these criteria, fast clustering-based (FAST) proposed experimentally in this paper. The FAST works two steps. In first step, are divided into clusters by using graph-theoretic clustering...

10.1109/tkde.2011.181 article EN IEEE Transactions on Knowledge and Data Engineering 2011-08-25

A novel ensemble method for classifying imbalanced data

OPENALEX - Publications

Zhongbin Sun Qinbao Song Xiaoyan Zhu Heli Sun Baowen Xu and 1 more

10.1016/j.patcog.2014.11.014 article EN Pattern Recognition 2014-11-30

Towards Online Multiresolution Community Detection in Large-Scale Networks

OPENALEX - Publications

Jianbin Huang Heli Sun Yaguang Liu Qinbao Song Tim Weninger

The investigation of community structure in networks has aroused great interest multiple disciplines. One the challenges is to find local communities from a starting vertex network without global information about entire network. Many existing methods tend be accurate depending on priori assumptions properties and predefined parameters. In this paper, we introduce new quality function present fast expansion algorithm for uncovering large-scale networks. proposed can detect multiresolution...

10.1371/journal.pone.0023829 article EN cc-by PLoS ONE 2011-08-24

Automatic Clustering via Outward Statistical Testing on Density Metrics

OPENALEX - Publications

Guangtao Wang Qinbao Song

Clustering is one of the research hotspots in field data mining and has extensive applications practice. Recently, Rodriguez Laio [1] published a clustering algorithm on Science that identifies centers an intuitive way clusters objects efficiently effectively. However, sensitive to preassigned parameter suffers from identification "ideal" number clusters. To overcome these shortages, this paper proposes new can detect automatically via statistical testing. Specifically, proposed first...

10.1109/tkde.2016.2535209 article EN IEEE Transactions on Knowledge and Data Engineering 2016-02-26

Revealing Density-Based Clustering Structure from the Core-Connected Tree of a Network

OPENALEX - Publications

Jianbin Huang Heli Sun Qinbao Song Hongbo Deng Jiawei Han

Clustering is an important technique for mining the intrinsic community structures in networks. The density-based network clustering method able to not only detect communities of arbitrary size and shape, but also identify hubs outliers. However, it requires manual parameter specification define clusters, sensitive density threshold which difficult determine. Furthermore, many real-world networks exhibit a hierarchical structure with embedded within other communities. Therefore, result...

10.1109/tkde.2012.100 article EN IEEE Transactions on Knowledge and Data Engineering 2013-06-28

Automatic recommendation of classification algorithms based on data set characteristics

OPENALEX - Publications

Qinbao Song Guangtao Wang Chao Wang

10.1016/j.patcog.2011.12.025 article EN Pattern Recognition 2012-01-11

A Feature Subset Selection Algorithm Automatic Recommendation Method

OPENALEX - Publications

Gang Wang Qinbao Song Heli Sun X. Zhang Baowen Xu and 1 more

Many feature subset selection (FSS) algorithms have been proposed, but not all of them are appropriate for a given problem. At the same time, so far there is rarely good way to choose FSS problem at hand. Thus, algorithm automatic recommendation very important and practically useful. In this paper, meta learning based method presented. The proposed first identifies data sets that most similar one hand by k-nearest neighbor classification algorithm, distances among these calculated on...

10.1613/jair.3831 article EN cc-by Journal of Artificial Intelligence Research 2013-05-15

A new unsupervised feature selection algorithm using similarity‐based feature clustering

OPENALEX - Publications

Xiaoyan Zhu Yu Wang Yingbin Li Yonghui Tan Guangtao Wang and 1 more

Abstract Unsupervised feature selection is an important problem, especially for high‐dimensional data. However, until now, it has been scarcely studied and the existing algorithms cannot provide satisfying performance. Thus, in this paper, we propose a new unsupervised algorithm using similarity‐based clustering, Feature Selection‐based Clustering (FSFC). FSFC removes redundant features according to results of clustering based on similarity. First, clusters their A proposed, which overcomes...

10.1111/coin.12192 article EN Computational Intelligence 2018-10-03

Detecting concept drift: An information entropy based method using an adaptive sliding window

OPENALEX - Publications

Lei Du Qinbao Song Xiaolin Jia

Concept drift in data stream poses many challenges and difficulties mining this tradition-distinct database. In paper, we focus on detecting concept evolving stream. We propose a novel method to detect using entrop

10.3233/ida-140645 article EN Intelligent Data Analysis 2014-04-30

Mining web browsing patterns for E-commerce

OPENALEX - Publications

Qinbao Song Martin Shepperd

10.1016/j.compind.2005.11.006 article EN Computers in Industry 2006-07-04

A Selective Detector Ensemble for Concept Drift Detection

OPENALEX - Publications

Lei Du Qinbao Song Lei Zhu Xiaoyan Zhu

Concept drifts usually originate from many causes instead of only one, which result in two types concept drifts: abrupt and gradual drifts. From the point view speed, pose strong challenges for data stream mining. In this paper, we propose a selective detector ensemble to detect both We first present our construction method, then introduce how use with proposed early-find-early-report rule. To evaluate performance compare it four drift detection methods on eight publicly available sets...

10.1093/comjnl/bxu050 article EN The Computer Journal 2014-06-20

Missing Data Imputation Techniques

OPENALEX - Publications

Qinbao Song Martin Shepperd

Intelligent data analysis techniques are useful for better exploring real-world sets. However, the sets always accompanied by missing that is one major factor affecting quality. At same time, good intelligent exploration requires quality data. Fortunately, Missing Data Imputation Techniques (MDITs) can be used to improve no method MDIT in all conditions, each has its own context. In this paper, we introduce MDITs KDD and machine learning communities presenting basic idea highlighting...

10.1504/ijbidm.2007.015485 article EN International Journal of Business Intelligence and Data Mining 2007-01-01

Selecting feature subset for high dimensional data via the propositional FOIL rules

OPENALEX - Publications

Guangtao Wang Qinbao Song Baowen Xu Yuming Zhou

10.1016/j.patcog.2012.07.028 article EN Pattern Recognition 2012-08-16

A dissimilarity-based imbalance data classification algorithm

OPENALEX - Publications

Xueying Zhang Qinbao Song Guangtao Wang Kaiyuan Zhang Liang He and 1 more

10.1007/s10489-014-0610-5 article EN Applied Intelligence 2014-11-21

Backward Path Growth for Efficient Mobile Sequential Recommendation

OPENALEX - Publications

Jianbin Huang Xuejun Huangfu Heli Sun Hui Li Peixiang Zhao and 2 more

The problem of mobile sequential recommendation is to suggest a route connecting set pick-up points for taxi driver so that he/she more likely get passengers with less travel cost. Essentially, key challenge this its high computational complexity. In paper, we propose novel dynamic programming based method solve the consisting two separate stages: an offline pre-processing stage and online search stage. pre-computes potential candidate sequences from points. A backward incremental sequence...

10.1109/tkde.2014.2298012 article EN IEEE Transactions on Knowledge and Data Engineering 2014-01-31

IncOrder: Incremental density-based community detection in dynamic networks

OPENALEX - Publications

Heli Sun Jianbin Huang Xin Zhang Jiao Liu Dong Wang and 3 more

10.1016/j.knosys.2014.07.015 article EN Knowledge-Based Systems 2014-10-05

LinkLPA: A Link‐Based Label Propagation Algorithm for Overlapping Community Detection in Networks

OPENALEX - Publications

Heli Sun Jiao Liu Jianbin Huang Guangtao Wang Xiaolin Jia and 1 more

Community detection is an important methodology for understanding the intrinsic structure and function of complex networks. Because overlapping community one characteristics real‐world networks should be considered detection, in this article, we propose algorithm, called link‐based label propagation algorithm (LinkLPA), to detect communities. link partition conceptually natural problem LinkLPA first transforms node into employs a new with preference on links instead nodes communities due...

10.1111/coin.12087 article EN Computational Intelligence 2016-03-17

CenLP: A centrality-based label propagation algorithm for community detection in networks

OPENALEX - Publications

Heli Sun Jiao Liu Jianbin Huang Guangtao Wang Zhou Yang and 2 more

10.1016/j.physa.2015.05.080 article EN Physica A Statistical Mechanics and its Applications 2015-05-21

Detecting overlapping communities in networks via dominant label propagation

OPENALEX - Publications

Heli Sun Jianbin Huang Yongqiang Tian Qinbao Song Huai-Liang Liu

Community detection is an important methodology for understanding the intrinsic structure and function of a real-world network. In this paper, we propose effective efficient algorithm, called Dominant Label Propagation Algorithm (Abbreviated as DLPA), to detect communities in complex networks. The algorithm simulates special voting process overlapping non-overlapping community networks simultaneously. Our very efficient, since its computational complexity almost linear number edges...

10.1088/1674-1056/24/1/018703 article EN Chinese Physics B 2015-01-01

An improved data characterization method and its application in classification algorithm recommendation

OPENALEX - Publications

Guangtao Wang Qinbao Song Xiaoyan Zhu

10.1007/s10489-015-0689-3 article EN Applied Intelligence 2015-07-01

A Multi-Label Learning Based Kernel Automatic Recommendation Method for Support Vector Machine

OPENALEX - Publications

Xueying Zhang Qinbao Song

Choosing an appropriate kernel is very important and critical when classifying a new problem with Support Vector Machine. So far, more attention has been paid on constructing kernels choosing suitable parameter values for specific function, but less selection. Furthermore, most of current selection methods focus seeking best the highest classification accuracy via cross-validation, they are time consuming ignore differences among number support vectors CPU SVM different kernels. Considering...

10.1371/journal.pone.0120455 article EN cc-by PLoS ONE 2015-04-20

A Generic Multilabel Learning-Based Classification Algorithm Recommendation Method

OPENALEX - Publications

Guangtao Wang Qinbao Song Xueying Zhang Kaiyuan Zhang

As more and classification algorithms continue to be developed, recommending appropriate a given problem is increasingly important. This article first distinguishes the algorithm recommendation methods by two dimensions: (1) meta-features, which are set of measures used characterize learning problems, (2) meta-target, represents relative performance on problem. In contrast existing whose meta-target usually in form either ranking candidate or single algorithm, this proposes new natural...

10.1145/2629474 article EN ACM Transactions on Knowledge Discovery from Data 2014-10-09

An empirical analysis of package-modularization metrics: Implications for software fault-proneness

OPENALEX - Publications

Yangyang Zhao Yibiao Yang Hongmin Lu Yuming Zhou Qinbao Song and 1 more

10.1016/j.infsof.2014.09.006 article EN Information and Software Technology 2014-10-11

Predicting the number of nearest neighbors for the k-NN classification algorithm

OPENALEX - Publications

Xueying Zhang Qinbao Song

k-Nearest Neighbor (k-NN) is one of the most widely used classification algorithms. When classifying a new instance, k-NN first finds out its k nearest neighbors, and then classifies it by voting for categories neighbors. Therefo

10.3233/ida-140650 article EN Intelligent Data Analysis 2014-04-30

A Simultaneous Confidence Band for Dense Longitudinal Regression

OPENALEX - Publications

Qinbao Song Ruoxue Liu Qin Shao Lijian Yang

AbstractWe present a method of using local linear smoothing to construct simultaneous confidence bands for the mean function densely spaced functional data. Our approach works well under mild conditions. In addition, estimator and its accompanying band enjoy semiparametric efficiency in sense that they are asymptotically equivalent counterparts obtained from random trajectories entirely observed without errors. We illustrate performance proposed through simulation study. Furthermore, an...

10.1080/03610926.2012.729643 article EN Communication in Statistics- Theory and Methods 2013-04-14

Coming Soon ...