NFDI4DS | UHH-SEMS - Publication Details

Pengfei Xu

ORCID: 0000-0002-7304-734X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100600605

Research Areas

Advanced Image and Video Retrieval Techniques
Advanced Neural Network Applications
Video Surveillance and Tracking Methods
Image Retrieval and Classification Techniques
Human Pose and Action Recognition
Advanced Vision and Imaging
Adversarial Robustness in Machine Learning
Anomaly Detection Techniques and Applications
Robotics and Sensor-Based Localization
Domain Adaptation and Few-Shot Learning
Video Analysis and Summarization
Text and Document Classification Technologies
Music and Audio Processing
Face and Expression Recognition
Multimodal Machine Learning Applications
Stochastic Gradient Optimization Techniques
Statistical Methods and Inference
Distributed Sensor Networks and Detection Algorithms
Caching and Content Delivery
Fire Detection and Safety Systems
Medical Image Segmentation Techniques
Remote-Sensing Image Classification
Image and Signal Denoising Methods
Machine Learning and Data Classification
Mobile Agent-Based Network Management

Nanjing Audit University
2023

Pingdingshan University
2011-2022

Didi Chuxing (China)
2019-2021

Nanjing University of Posts and Telecommunications
2021

Rice University
2019-2020

University of Helsinki
2020

University of Science and Technology Liaoning
2020

Wuhan Ship Development & Design Institute
2014

Harbin Institute of Technology
2008-2013

Shanghai University
2011-2012

An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos

OPENALEX - Publications

Sicheng Zhao Yunsheng Ma Yang Gu Jufeng Yang Tengfei Xing and 4 more

Emotion recognition in user-generated videos plays an important role human-centered computing. Existing methods mainly employ traditional two-stage shallow pipeline, i.e. extracting visual and/or audio features and training classifiers. In this paper, we propose to recognize video emotions end-to-end manner based on convolutional neural networks (CNNs). Specifically, develop a deep Visual-Audio Attention Network (VAANet), novel architecture that integrates spatial, channel-wise, temporal...

10.1609/aaai.v34i01.5364 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Dual Dynamic Inference: Enabling More Efficient, Adaptive, and Controllable Deep Inference

OPENALEX - Publications

Yue Wang Jianghao Shen Ting-Kuei Hu Pengfei Xu Tan M. Nguyen and 3 more

State-of-the-art convolutional neural networks (CNNs) yield record-breaking predictive performance, yet at the cost of high-energy-consumption inference, that prohibits their widely deployments in resource-constrained Internet Things (IoT) applications. We propose a dual dynamic inference (DDI) framework highlights following aspects: 1) we integrate both input-dependent and resource-dependent mechanisms under unified order to fit varying IoT resource requirements practice. DDI is able...

10.1109/jstsp.2020.2979669 article EN publisher-specific-oa IEEE Journal of Selected Topics in Signal Processing 2020-03-09

E2-Train: Training State-of-the-art CNNs with Over 80% Energy Savings

OPENALEX - Publications

Yue Wang Ziyu Jiang Xiaohan Chen Pengfei Xu Yang Zhao and 2 more

Convolutional neural networks (CNNs) have been increasingly deployed to edge devices. Hence, many efforts made towards efficient CNN inference in resource-constrained platforms. This paper attempts explore an orthogonal direction: how conduct more energy-efficient training of CNNs, so as enable on-device training. We strive reduce the energy cost during training, by dropping unnecessary computations from three complementary levels: stochastic mini-batch on data level; selective layer update...

10.48550/arxiv.1910.13349 preprint EN other-oa arXiv (Cornell University) 2019-01-01

AutoDNNchip

OPENALEX - Publications

Pengfei Xu Xiaofan Zhang Cong Hao Yang Zhao Yongan Zhang and 5 more

Recent breakthroughs in Deep Neural Networks (DNNs) have fueled a growing demand for DNN chips. However, designing chips is non-trivial because: (1) mainstream DNNs millions of parameters and operations; (2) the large design space due to numerous choices dataflows, processing elements, memory hierarchy, etc.; (3) an algorithm/hardware co-design needed allow same functionality different decomposition, which would require hardware IPs meet application specifications. Therefore, take long time...

10.1145/3373087.3375306 preprint EN 2020-02-23

Fractional Skipping: Towards Finer-Grained Dynamic CNN Inference

OPENALEX - Publications

Jianghao Shen Yue Wang Pengfei Xu Yonggan Fu Zhangyang Wang and 1 more

While increasingly deep networks are still in general desired for achieving state-of-the-art performance, many specific inputs a simpler network might already suffice. Existing works exploited this observation by learning to skip convolutional layers an input-dependent manner. However, we argue their binary decision scheme, i.e., either fully executing or completely bypassing one layer input, can be enhanced introducing finer-grained, “softer” decisions. We therefore propose Dynamic...

10.1609/aaai.v34i04.6025 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

Rethinking Distributional Matching Based Domain Adaptation

OPENALEX - Publications

Bo Li Yezhen Wang Tong Che Shanghang Zhang Sicheng Zhao and 4 more

Domain adaptation (DA) is a technique that transfers predictive models trained on labeled source domain to an unlabeled target domain, with the core difficulty of resolving distributional shift between domains. Currently, most popular DA algorithms are based matching (DM). However in practice, realistic shifts (RDS) may violate their basic assumptions and as result these methods will fail. In this paper, order devise robust algorithms, we first systematically analyze limitations DM methods,...

10.48550/arxiv.2006.13352 preprint EN cc-by arXiv (Cornell University) 2020-01-01

A Densely Connected Network Based on U-Net for Medical Image Segmentation

OPENALEX - Publications

Zhenzhen Yang Pengfei Xu Yongpeng Yang Bing‐Kun Bao

The U-Net has become the most popular structure in medical image segmentation recent years. Although its performance for is outstanding, a large number of experiments demonstrate that classical network architecture seems to be insufficient when size targets changes and imbalance happens between target background different forms segmentation. To improve architecture, we develop new named densely connected (DenseUNet) this article. proposed DenseUNet adopts dense block feature extraction...

10.1145/3446618 article EN ACM Transactions on Multimedia Computing Communications and Applications 2021-07-22

Video indexing and recommendation based on affective analysis of viewers

OPENALEX - Publications

Sicheng Zhao Hongxun Yao Xiaoshuai Sun Pengfei Xu Xianming Liu and 1 more

Most previous works on video indexing and recommendation were only based the content of itself, without considering affective analysis viewers, which is an efficient important way to reflect viewers' attitudes, feelings evaluations videos. In this paper, we propose a novel method index recommend videos analysis, mainly facial expression recognition viewers. We first build classifier by embedding process building compositional Haar-like features into hidden conditional random fields (HCRFs)....

10.1145/2072298.2072043 article EN Proceedings of the 30th ACM International Conference on Multimedia 2011-11-28

Communication-efficient and Byzantine-robust distributed learning with statistical guarantee

OPENALEX - Publications

Xingcai Zhou Le Chang Pengfei Xu Shaogao Lv

10.1016/j.patcog.2023.109312 article EN Pattern Recognition 2023-01-11

Curriculum CycleGAN for Textual Sentiment Domain Adaptation with Multiple Sources

OPENALEX - Publications

Sicheng Zhao Yang Xiao Jiang Guo Xiangyu Yue Jufeng Yang and 3 more

Sentiment analysis of user-generated reviews or comments on products and services in social networks can help enterprises to analyze the feedback from customers take corresponding actions for improvement. To mitigate large-scale annotations target domain, domain adaptation (DA) provides an alternate solution by learning a transferable model other labeled source domains. Existing multi-source (MDA) methods either fail extract some discriminative features that are related sentiment, neglect...

10.1145/3442381.3449981 article EN 2021-04-19

Safety monitoring method of moving target in underground coal mine based on computer vision processing

OPENALEX - Publications

Pengfei Xu Zhiqing Zhou Zexun Geng

Coal is one of the main energy sources in China. The country attaches great importance to development coal mining industry, and production on rise. At same time, mine safety accidents are becoming more frequent, paying attention accidents. underground environment complex, noisy uneven, there will be problems such as occlusion high false detection rate during video monitoring. In order ensure personnel, moving target tracking based monitoring information significance for production. purpose...

10.1038/s41598-022-22564-8 article EN cc-by Scientific Reports 2022-10-25

Robust and Fast Vehicle Turn-counts at Intersections via an Integrated Solution from Detection, Tracking and Trajectory Modeling

OPENALEX - Publications

Zhihui Wang Bing Bai Yujun Xie Tengfei Xing Bineng Zhong and 7 more

In this paper, we address the problem of vehicle turn-counts by class at multiple intersections, which is greatly challenged inaccurate detection and tracking results caused heavy weather, occlusion, illumination variations, background clutter, etc. Therefore, complexity calls for an integrated solution that robustly extracts as much visual information possible efficiently combines it through sequential feedback cycles. We propose such algorithm, effectively detection, modeling, tracking,...

10.1109/cvprw50498.2020.00313 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2020-06-01

Where should I stand? Learning based human position recommendation for mobile photographing

OPENALEX - Publications

Pengfei Xu Hongxun Yao Rongrong Ji Xianming Liu Xiaoshuai Sun

10.1007/s11042-012-1343-2 article EN Multimedia Tools and Applications 2013-01-17

Cross-media manifold learning for image retrieval & annotation

OPENALEX - Publications

Xianming Liu Rongrong Ji Hongxun Yao Pengfei Xu Xiaoshuai Sun and 1 more

Fusion of visual content with textual information is an effective way for both content-based and keyword-based image retrieval. However, the performance & fusion affected greatly by data noise redundancy in text (such as surrounding HTML pages) intra-class diversity) aspects. This paper presents a manifold-based cross-media optimization scheme to achieve within unified framework. Cross-Media manifold co-training mechanism between Keyword-based Metric Space Vision-Based proposed creatively...

10.1145/1460096.1460121 article EN 2008-10-30

An Image Enhancement Algorithm Based on Fractional-Order Phase Stretch Transform and Relative Total Variation

OPENALEX - Publications

Wei Wang Ying Jia Qiming Wang Pengfei Xu

The main purpose of image enhancement technology is to improve the quality better assist those activities daily life that are widely dependent on it like healthcare, industries, education, and surveillance. Due influence complex environments, there risks insufficient detail low contrast in some images. Existing algorithms prone overexposure improper processing. This paper attempts treatment effect Phase Stretch Transform (PST) information medium frequencies. For this purpose, an algorithm...

10.1155/2021/8818331 article EN Computational Intelligence and Neuroscience 2021-01-01

Anisotropic Phase Stretch Transform-Based Algorithm for Segmentation of Activated Sludge Phase-Contrast Microscopic Image

OPENALEX - Publications

Pengfei Xu Zhiqing Zhou Hesheng Shi Zexun Geng

The activated sludge (AS) process is a biological treatment of wastewater used in sewage plants, which settling AS vitally important for treatment. In AS, however, bulking caused by filamentous bacteria will significantly reduce the capacity sludge. Traditionally, physicochemical method has been to monitor status or performance sludge, while it very compromising means modern digital quality control when image processing and analysis technology determine order avoid disadvantages deficient...

10.1109/access.2022.3166603 article EN cc-by IEEE Access 2022-01-01

Technical Research on Moving Target Monitoring and Intelligent Tracking Algorithm Based on Machine Vision

OPENALEX - Publications

Pengfei Xu Zhiqing Zhou Zexun Geng

Machine vision is an important branch of the rapid development modern artificial intelligence, and it a key technology to convert image information monitoring targets into digital signals. However, due wide range machine applications, this research focuses on its application in video surveillance. In era detection tracking moving objects have always been issue The simulation human realized by combining relevant functions computer acquisition device, which enables ability recognize...

10.1155/2022/7277926 article EN Wireless Communications and Mobile Computing 2022-05-27

A robust texture descriptor using multifractal analysis with Gabor filter

OPENALEX - Publications

Pengfei Xu Hongxun Yao Rongrong Ji Xiaoshuai Sun Xianming Liu

Texture classification analysis and play an important role in the domain of content-based image retrieval, segmentation, scene recognition image/video analysis. This paper proposes a novel robust texture descriptor on variance rotation, scale illumination, which combines dominant orientation multifractal base Gabor filter. The orientations are extracted corresponding Gaussian scales to handle rotation variance, then illumination invariant spectrum (MFS) is produced based multi-scale filters...

10.1145/1937728.1937763 article EN 2010-12-30

Wide baseline image mosaicing by integrating MSER and Hessian-Affine

OPENALEX - Publications

Yufang Ning Chen Ren Pengfei Xu

In this paper we propose a novel approach for wide-baseline image mosaicing which integrates MSER and Hessian-Affine detectors. are both robust detectors stereo matching they can be integrated owing to their availability in the structured scenes rich-textured separately. However, output shape of them is different, so cannot directly integrated. We use an affine covariant construction method unify shape. At same time, introduce standard elliptic equation ellipse parameters. The axial length...

10.1109/cisp.2011.6100566 article EN 2011-10-01

Dual Dynamic Inference: Enabling More Efficient, Adaptive and Controllable Deep Inference

OPENALEX - Publications

Yue Wang Jianghao Shen Ting-Kuei Hu Pengfei Xu Tan M. Nguyen and 3 more

10.48550/arxiv.1907.04523 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Multiple Set Matching with Bloom Matrix and Bloom Vector

OPENALEX - Publications

Francesco Concas Pengfei Xu Mohammad A. Hoque Jiaheng Lu Sasu Tarkoma

Bloom Filter is a space-efficient probabilistic data structure for checking the membership of elements in set. Given multiple sets, standard not sufficient when looking items to which an element or set input belong. An example case searching documents with keywords large text corpus, essentially matching problem where single keywords, and result possible candidate documents. This article solves by proposing two efficient Multifilters called Matrix Vector, generalize Filter. Both structures...

10.1145/3372409 article EN ACM Transactions on Knowledge Discovery from Data 2020-02-09

What is a complete set of keywords for image description & annotation on the web

OPENALEX - Publications

Xianming Liu Hongxun Yao Rongrong Ji Pengfei Xu Xiaoshuai Sun

Does there exist a compact set of keywords that can completely and effectively cover the image annotation problem by expanding from it? In this paper, we answer question presenting complete framework for annotation, which is motivated existence semantic ontology. To generate set, propose cross model optimization strategy both textual visual information topic decomposition, based on so-called Bipartite LSA model, minimize multimodal error energy functions in probabilistic Latent Semantic...

10.1145/1631272.1631369 article EN Proceedings of the 30th ACM International Conference on Multimedia 2009-10-19

Coming Soon ...