NFDI4DS | UHH-SEMS - Publication Details

Fan Feng

ORCID: 0000-0001-7959-2472

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5101581669

Research Areas

Human Pose and Action Recognition
Multimodal Machine Learning Applications
Advanced Image and Video Retrieval Techniques
Advanced Image Processing Techniques
Music and Audio Processing
Advanced Image Fusion Techniques
Image Enhancement Techniques
Anomaly Detection Techniques and Applications
Video Analysis and Summarization
Video Surveillance and Tracking Methods
Digital Media Forensic Detection
Domain Adaptation and Few-Shot Learning
Speech and Audio Processing
Advanced Steganography and Watermarking Techniques
Image and Signal Denoising Methods
Image Processing and 3D Reconstruction
Advanced Vision and Imaging
Advanced Measurement and Detection Methods
Infrared Target Detection Methodologies
Face and Expression Recognition
Recommender Systems and Techniques
Topic Modeling
Remote-Sensing Image Classification
Advanced Graph Neural Networks
Image and Object Detection Techniques

Beijing University of Posts and Telecommunications
2021-2024

Nanyang Institute of Technology
2024

Army Medical University
2021

Troy University
2020

Donghua University
2018

Huazhong University of Science and Technology
2018

Embedded Systems (United States)
2017

Wuhan University
2015

Ordnance Engineering College
2009

Weatherford College
2008

MSR-net:Low-light Image Enhancement Using Deep Convolutional Network

OPENALEX - Publications

Liang Shen Zihan Yue Fan Feng Quan Chen Shihao Liu and 1 more

Images captured in low-light conditions usually suffer from very low contrast, which increases the difficulty of subsequent computer vision tasks a great extent. In this paper, image enhancement model based on convolutional neural network and Retinex theory is proposed. Firstly, we show that multi-scale equivalent to feedforward with different Gaussian convolution kernels. Motivated by fact, consider Convolutional Neural Network(MSR-net) directly learns an end-to-end mapping between dark...

10.48550/arxiv.1711.02488 preprint EN other-oa arXiv (Cornell University) 2017-01-01

MAENet: A novel multi-head association attention enhancement network for completing intra-modal interaction in image captioning

OPENALEX - Publications

Nannan Hu Chunxiao Fan Yue Ming Fan Feng

10.1016/j.neucom.2022.11.045 article EN Neurocomputing 2022-11-18

3D-TDC: A 3D temporal dilation convolution framework for video action recognition

OPENALEX - Publications

Yue Ming Fan Feng Chao Li Jing‐Hao Xue

10.1016/j.neucom.2021.03.120 article EN Neurocomputing 2021-04-06

Deep joint rain and haze removal from a single image

OPENALEX - Publications

Liang Shen Zihan Yue Quan Chen Fan Feng Jie Ma

Rain removal from a single image is challenge which has been studied for long time. In this paper, novel convolutional neural network based on wavelet and dark channel proposed. On one hand, we think that rain streaks correspond to high frequency component of the image. Therefore, haar transform good choice separate background some extent. More specifically, LL subband more inclined express information, while HL, LH tend represent edges respectively. other accumulation distance makes look...

10.1109/icpr.2018.8545729 article EN 2022 26th International Conference on Pattern Recognition (ICPR) 2018-08-01

See, move and hear: a local-to-global multi-modal interaction network for video action recognition

OPENALEX - Publications

Fan Feng Yue Ming Nannan Hu Jiangwan Zhou

10.1007/s10489-023-04497-5 article EN Applied Intelligence 2023-03-15

CSS-Net: A Consistent Segment Selection Network for Audio-Visual Event Localization

OPENALEX - Publications

Fan Feng Yue Ming Nannan Hu Hui Yu Yuanan Liu

Audio-visual event (AVE) localization aims to localize the temporal boundaries of events that contains visual and audio contents, identify categories in unconstrained videos. Existing work usually utilizes successive video segments for modeling. However, ambient sounds or irrelevant targets some often cause problem audio-visual semantics inconsistency, resulting inaccurate global To tackle this issue, we present a consistent segment selection network (CSS-Net) paper. First, propose novel...

10.1109/tmm.2023.3270624 article EN IEEE Transactions on Multimedia 2023-04-26

People counting and pedestrian flow statistics based on convolutional neural network and recurrent neural network

OPENALEX - Publications

Jie Zhu Fan Feng Bo Shen

People counting and pedestrian flow statistics are challenging tasks because of the perspective distortions, appearance changes occlusion. In this paper, we address two tasks: people in images highly dense crowds a place over period time. Our first contribution is to propose new convolution neural network (CNN) model which composed deep shallow fully fulfill task counting. We extract different layer features from last network, concatenate them together. After that add deconvolution layers...

10.1109/yac.2018.8406516 article EN 2018-05-01

Visual object tracking: in the simultaneous presence of scale variation and occlusion

OPENALEX - Publications

Fan Feng Bo Shen Hongjian Liu

Visual object tracking is a challenging task when the appearance changes caused by scale variation and occlusion. In this paper, an algorithm proposed which capable of dealing with case that occlusion occur simultaneously. A kernelized correlation filter (KCF) first learned to obtain response, whose maximum value denotes optimal location. order represent sample better, convolutional features are extracted from pre-trained neural networks (CNNs). Then, strategy adaption used estimate during...

10.1080/21642583.2018.1536899 article EN cc-by-nc Systems Science & Control Engineering 2018-01-01

SSLNet: A network for cross-modal sound source localization in visual scenes

OPENALEX - Publications

Fan Feng Yue Ming Nannan Hu

10.1016/j.neucom.2022.05.098 article EN Neurocomputing 2022-05-31

TSFNet: Triple-Steam Image Captioning

OPENALEX - Publications

Nannan Hu Yue Ming Chunxiao Fan Fan Feng Boyang Lyu

Image captioning is a challenging task that generates natural language description based on the visual understanding of given image. Significant region representation milestone in image captioning. Despite great success existing region-based works, they only focus salient objects and encode these independently, still plagued by lack global contextual information relationships. In fact, structured relationships are exactly merits traditional grid features emerging scene graph features. this...

10.1109/tmm.2022.3215861 article EN IEEE Transactions on Multimedia 2022-11-03

A reversible authentication scheme for JPEG2000 images

OPENALEX - Publications

Wen Jia-fu Jiazhen Wang Fan Feng Bin Zhang

In this paper, a scalable, robust and recovery-driven authentication scheme targeting at verifying the authenticity of JPEG2000 images is proposed. It achieved by truncating bit planes wavelet coefficients into two portions in codec based on lowest compression rate (CBR). The invariant features, which are generated from upper portion, signed sender's private key to generate crypto signature. By embedding signature has ability for content as long final transmitted image not less than CBR....

10.1109/icemi.2009.5274015 article EN 2009-08-01

Deep joint rain and haze removal from single images

OPENALEX - Publications

Liang Shen Zihan Yue Quan Chen Fan Feng Jie Ma

Rain removal from a single image is challenge which has been studied for long time. In this paper, novel convolutional neural network based on wavelet and dark channel proposed. On one hand, we think that rain streaks correspond to high frequency component of the image. Therefore, haar transform good choice separate background some extent. More specifically, LL subband more inclined express information, while LH, HL, HH tend represent edges. other accumulation distance makes look like haze...

10.48550/arxiv.1801.06769 preprint EN other-oa arXiv (Cornell University) 2018-01-01

F2D-SIFPNet: a frequency 2D Slow-I-Fast-P network for faster compressed video action recognition

OPENALEX - Publications

Yue Ming Jiangwan Zhou Xia Jia Qingfang Zheng Xiong Lu and 2 more

10.1007/s10489-024-05408-y article EN Applied Intelligence 2024-04-01

Efficient and expressive high-resolution image synthesis via variational autoencoder-enriched transformers with sparse attention mechanisms

OPENALEX - Publications

Bingyin Tang Fan Feng

10.1117/1.jei.33.3.033002 article EN Journal of Electronic Imaging 2024-05-02

Cloud-cover assessment: From spectral properties to spatial domain natural scene statistic

OPENALEX - Publications

Shuigen Wang Chenwei Deng Xun Liu Zhenzhen Li Fan Feng and 1 more

Cloud contamination is the most common defect leading to quality degradation in remote sensing images. Numerous cloud-cover assessment (CCA) methods have been developed literature. The traditional Landsat 7 CCA algorithm attempted detect clouds by taking advantages of different spectral properties from five bands. However, it suffers weakness omitting thin cirrus and requirement thermal In this paper, we derived an automated (ACCA) model that measures statistical deviations spatial domain...

10.1109/igarss.2017.8128134 article EN 2017-07-01

An algorithm to recognize the target object contour based on 2D point clouds by laser-CCD-scanning

OPENALEX - Publications

Hongyong Mao Duanwei Shi Ji Zhou Pan Xu Shiyu Chen and 2 more

10.1007/s11859-015-1105-x article EN Wuhan University Journal of Natural Sciences 2015-07-08

Improving Conversational Recommendation System by Pretraining on Billions Scale of Knowledge Graph

OPENALEX - Publications

Chi-Man Wong Fan Feng Wen Zhang Chi‐Man Vong Hui Chen and 5 more

Conversational Recommender Systems (CRSs) in E-commerce platforms aim to recommend items users via multiple conversational interactions. Click-through rate (CTR) prediction models are commonly used for ranking candidate items. However, most CRSs suffer from the problem of data scarcity and sparseness. To address this issue, we propose a novel knowledge-enhanced deep cross network (K-DCN), two-step (pretrain fine-tune) CTR model We first construct billion-scale conversation knowledge graph...

10.48550/arxiv.2104.14899 preprint EN cc-by arXiv (Cornell University) 2021-01-01

WITHDRAWN: Sar Image Target Recognition Method Based On Sparse Representation Of Local Dictionary

OPENALEX - Publications

Han Hong-liang Yonglei Bai Wei Lü Fan Feng Jianhua Wang

10.1016/j.micpro.2021.104070 article EN Microprocessors and Microsystems 2021-02-01

A Refined Neural Network Recognition Architecture for Blurred Image Semantic Generalization

OPENALEX - Publications

Fan Feng Long Ma

The technique of generating a caption for blurred image described by AI still exists the hurdle recognition. In image, figuring out semantic subject is severe challenge. this paper, we implement classifier as an auxiliary oriented filter that combines with standard dense based architecture. This refined architecture used to categorize main from media and transform it into specific predicting range field. proposed framework can describe outcomes semantical relations are hidden in image; they...

10.1109/icsc.2020.00034 article EN 2020-02-01

Coming Soon ...