Tao Wang

ORCID: 0000-0003-2369-2129
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Video Surveillance and Tracking Methods
  • Advanced Image and Video Retrieval Techniques
  • Graph Theory and Algorithms
  • Advanced Neural Network Applications
  • Face recognition and analysis
  • Advanced Graph Neural Networks
  • Data Management and Algorithms
  • Text and Document Classification Technologies
  • Biometric Identification and Security
  • Topic Modeling
  • Generative Adversarial Networks and Image Synthesis
  • Face and Expression Recognition
  • Image Retrieval and Classification Techniques
  • Advanced Vision and Imaging
  • Image Enhancement Techniques
  • Human Pose and Action Recognition
  • Network Packet Processing and Optimization
  • Advanced Computational Techniques and Applications
  • Natural Language Processing Techniques
  • Chaos-based Image/Signal Encryption
  • Advanced Image Processing Techniques
  • Multimodal Machine Learning Applications
  • Infrared Target Detection Methodologies
  • Advanced Text Analysis Techniques
  • Speech and Audio Processing

Beijing Jiaotong University
2016-2025

Alibaba Group (United States)
2025

Beijing Municipal Education Commission
2023-2024

Xi'an High Tech University
2024

Institute of Information Engineering
2024

Chinese Academy of Sciences
2024

Applied Mathematics (United States)
2024

University of Pennsylvania
2024

Changchun University of Technology
2023

Chengdu Neusoft University
2023

We present a novel boosting cascade based face detection framework using SURF features. The is derived from the well-known Viola-Jones (VJ) but distinguished by two key contributions. First, proposed deals with only several hundreds of multidimensional local patches instead thousands single dimensional haar features in VJ framework. Second, it takes AUC as criterion for convergence test each stage rather than conflicting criteria (false-positive-rate and detection-rate) These modifications...

10.1109/iccvw.2011.6130518 article EN 2011-11-01

In the past few decades, to reduce risk of X-ray in computed tomography (CT), low-dose CT image denoising has attracted extensive attention from researchers, which become an important research issue field medical images. recent years, with rapid development deep learning technology, many algorithms have emerged apply convolutional neural networks this task, achieving promising results. However, there are still some problems such as low efficiency, over-smoothed result, etc. paper, we propose...

10.1109/icsp48669.2020.9320928 preprint EN 2022 16th IEEE International Conference on Signal Processing (ICSP) 2020-12-06

Multi-Label Learning (MLL) aims to learn from the training data where each example is represented by a single instance while associated with set of candidate labels. Most existing MLL methods are typically designed handle problem missing However, in many real-world scenarios, labeling information for multi-label always redundant , which can not be solved classical methods, thus novel Partial Multi-label (PML) framework proposed cope such problem, i.e. removing noisy labels sets. In this...

10.1609/aaai.v33i01.33015016 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2019-07-17

Partial Label Learning (PLL) aims to learn from the data where each training example is associated with a set of candidate labels, among which only one correct. The key deal such problem disambiguate label sets and obtain correct assignments between instances their labels. In this paper, we interpret as instance-to-label matchings, reformulate task PLL matching selection problem. To model problem, propose novel Graph Matching based (GM-PLL) framework, (GM) scheme incorporated owing its...

10.1109/tkde.2019.2933837 article EN IEEE Transactions on Knowledge and Data Engineering 2019-08-08

Matching-based algorithms have been commonly used in planar object tracking. They often model a as set of keypoints, and then find correspondences between keypoint sets via descriptor matching. In previous work, unary constraints on appearances or locations are usually to guide the However, these approaches rarely utilize structure information object, thus suffering from various perturbation factors. this paper, we proposed graph-based tracker, named Gracker, which is able fully explore...

10.1109/tpami.2017.2716350 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2017-06-16

Downsampling is a crucial task for processing large scale and/or dense point clouds with limited resources. Owing to the development of deep learning, approaches task-oriented cloud downsampling have significant performance gains in preserving geometric information. However, most downsamling methods are by disordered and unstructured data, making it difficult continually improve performance. To address this issue, we propose light-weight Transformer network (LighTN) as an end-to-end...

10.1109/tmm.2023.3318073 article EN IEEE Transactions on Multimedia 2023-09-21

10.1109/icassp49660.2025.10889048 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

Graph matching aims at establishing correspondences between graph elements, and is widely used in many computer vision tasks. Among recently proposed algorithms, those utilizing the path following strategy have attracted special research attentions due to their exhibition of state-of-the-art performances. However, paths computed these algorithms often contain singular points, which could hurt performance if not dealt properly. To deal with this issue, we propose a novel strategy, named...

10.1109/tpami.2017.2767591 article EN publisher-specific-oa IEEE Transactions on Pattern Analysis and Machine Intelligence 2017-10-30

Learning-based approaches to graph matching have been developed and explored for more than a decade, grown rapidly in scope popularity recent years. However, previous learning-based algorithms, with or without deep learning strategy, mainly focus on the of node and/or edge affinities generation, pay less attention combinatorial solver. In this paper we propose fully trainable framework matching, which solving optimization are not explicitly separated as many arts. We firstly convert problem...

10.1109/cvpr42600.2020.00759 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Human facial age estimation has attracted much attention due to its potential applications in forensics, security, and biometrics. In contrast existing approaches that cast as either a multiclass classification or regression problem, this work, we propose novel approach combines the strength of cost-sensitive label ranking methods with power low-rank matrix recovery theories. Instead having make binary decision for each label, our ranks labels descending order terms their predicted relevance...

10.1109/tmm.2016.2608786 article EN IEEE Transactions on Multimedia 2016-09-13

With the increasing prevalence of intelligent traffic control and monitoring, research on vehicle re-identification (Re-ID) draws substantial attention in recent years. Different from other cross-view searching tasks such as person Re-ID, Re-ID problem is more challenging unpredictable viewpoint variations can greatly affect appearance vehicles. Existing studies mainly focus extracting global features based visual to represent identity target vehicle, while impact variation rarely...

10.1109/tits.2020.3025387 article EN IEEE Transactions on Intelligent Transportation Systems 2020-10-01

This paper proposes a general framework called Markov stationary features (MSF) to extend histogram based features. The MSF characterizes the spatial co-occurrence of patterns by chain models, and finally yields compact feature representation through analysis. Therefore, goes one step beyond histograms since it now involves structure information both within bins between bins. Moreover, still keeps simplicity, compactness, efficiency, robustness. We demonstrate how is used like color...

10.1109/cvpr.2008.4587839 article EN 2009 IEEE Conference on Computer Vision and Pattern Recognition 2008-06-01

Abstract In knowledge graph embedding, multidimensional representations of entities and relations are learned in vector space. Although distance-based embedding methods have shown promise link prediction, they neglect context information among the triplet components, i.e., head_entity, relation, tail_entity, limiting their ability to describe multivariate relation patterns mapping properties. Such denotes entity structural association inside same implies correlation between that not directly...

10.1007/s11063-024-11481-8 article EN cc-by Neural Processing Letters 2024-02-07

Visible-Infrared person Re-IDentification (VI-ReID) is a challenging cross-modality image retrieval task that aims to match pedestrians' images across visible and infrared cameras. To solve the modality gap, existing mainstream methods adopt learning paradigm converting into an classification with cross-entropy loss auxiliary metric losses. These losses follow strategy of adjusting distribution extracted embeddings reduce intra-class distance increase inter-class distance. However, such...

10.1109/tcsvt.2024.3377252 article EN IEEE Transactions on Circuits and Systems for Video Technology 2024-03-18

The advent of the information and intelligence era has led to explosive growth data. author proposes a hybrid data model based on differential privacy. main content this is study privacy, processing through noise mechanism, using calculation tuple attribute differences addition, finally constructing mixed privacy experiments. experimental results indicate that: as value k increases, clustering tend be optimal, verifying that original can reduce addition. However, ICMD-DP anonymizes dataset,...

10.12694/scpe.v26i2.3958 article EN Scalable Computing Practice and Experience 2025-02-10
Coming Soon ...