Fan Feng

ORCID: 0000-0001-7959-2472
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Human Pose and Action Recognition
  • Multimodal Machine Learning Applications
  • Advanced Image and Video Retrieval Techniques
  • Advanced Image Processing Techniques
  • Music and Audio Processing
  • Advanced Image Fusion Techniques
  • Image Enhancement Techniques
  • Anomaly Detection Techniques and Applications
  • Video Analysis and Summarization
  • Video Surveillance and Tracking Methods
  • Digital Media Forensic Detection
  • Domain Adaptation and Few-Shot Learning
  • Speech and Audio Processing
  • Advanced Steganography and Watermarking Techniques
  • Image and Signal Denoising Methods
  • Image Processing and 3D Reconstruction
  • Advanced Vision and Imaging
  • Advanced Measurement and Detection Methods
  • Infrared Target Detection Methodologies
  • Face and Expression Recognition
  • Recommender Systems and Techniques
  • Topic Modeling
  • Remote-Sensing Image Classification
  • Advanced Graph Neural Networks
  • Image and Object Detection Techniques

Beijing University of Posts and Telecommunications
2021-2024

Nanyang Institute of Technology
2024

Army Medical University
2021

Troy University
2020

Donghua University
2018

Huazhong University of Science and Technology
2018

Embedded Systems (United States)
2017

Wuhan University
2015

Ordnance Engineering College
2009

Weatherford College
2008

Images captured in low-light conditions usually suffer from very low contrast, which increases the difficulty of subsequent computer vision tasks a great extent. In this paper, image enhancement model based on convolutional neural network and Retinex theory is proposed. Firstly, we show that multi-scale equivalent to feedforward with different Gaussian convolution kernels. Motivated by fact, consider Convolutional Neural Network(MSR-net) directly learns an end-to-end mapping between dark...

10.48550/arxiv.1711.02488 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Rain removal from a single image is challenge which has been studied for long time. In this paper, novel convolutional neural network based on wavelet and dark channel proposed. On one hand, we think that rain streaks correspond to high frequency component of the image. Therefore, haar transform good choice separate background some extent. More specifically, LL subband more inclined express information, while HL, LH tend represent edges respectively. other accumulation distance makes look...

10.1109/icpr.2018.8545729 article EN 2022 26th International Conference on Pattern Recognition (ICPR) 2018-08-01

Audio-visual event (AVE) localization aims to localize the temporal boundaries of events that contains visual and audio contents, identify categories in unconstrained videos. Existing work usually utilizes successive video segments for modeling. However, ambient sounds or irrelevant targets some often cause problem audio-visual semantics inconsistency, resulting inaccurate global To tackle this issue, we present a consistent segment selection network (CSS-Net) paper. First, propose novel...

10.1109/tmm.2023.3270624 article EN IEEE Transactions on Multimedia 2023-04-26

People counting and pedestrian flow statistics are challenging tasks because of the perspective distortions, appearance changes occlusion. In this paper, we address two tasks: people in images highly dense crowds a place over period time. Our first contribution is to propose new convolution neural network (CNN) model which composed deep shallow fully fulfill task counting. We extract different layer features from last network, concatenate them together. After that add deconvolution layers...

10.1109/yac.2018.8406516 article EN 2018-05-01

Visual object tracking is a challenging task when the appearance changes caused by scale variation and occlusion. In this paper, an algorithm proposed which capable of dealing with case that occlusion occur simultaneously. A kernelized correlation filter (KCF) first learned to obtain response, whose maximum value denotes optimal location. order represent sample better, convolutional features are extracted from pre-trained neural networks (CNNs). Then, strategy adaption used estimate during...

10.1080/21642583.2018.1536899 article EN cc-by-nc Systems Science & Control Engineering 2018-01-01

Image captioning is a challenging task that generates natural language description based on the visual understanding of given image. Significant region representation milestone in image captioning. Despite great success existing region-based works, they only focus salient objects and encode these independently, still plagued by lack global contextual information relationships. In fact, structured relationships are exactly merits traditional grid features emerging scene graph features. this...

10.1109/tmm.2022.3215861 article EN IEEE Transactions on Multimedia 2022-11-03

In this paper, a scalable, robust and recovery-driven authentication scheme targeting at verifying the authenticity of JPEG2000 images is proposed. It achieved by truncating bit planes wavelet coefficients into two portions in codec based on lowest compression rate (CBR). The invariant features, which are generated from upper portion, signed sender's private key to generate crypto signature. By embedding signature has ability for content as long final transmitted image not less than CBR....

10.1109/icemi.2009.5274015 article EN 2009-08-01

Rain removal from a single image is challenge which has been studied for long time. In this paper, novel convolutional neural network based on wavelet and dark channel proposed. On one hand, we think that rain streaks correspond to high frequency component of the image. Therefore, haar transform good choice separate background some extent. More specifically, LL subband more inclined express information, while LH, HL, HH tend represent edges. other accumulation distance makes look like haze...

10.48550/arxiv.1801.06769 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Cloud contamination is the most common defect leading to quality degradation in remote sensing images. Numerous cloud-cover assessment (CCA) methods have been developed literature. The traditional Landsat 7 CCA algorithm attempted detect clouds by taking advantages of different spectral properties from five bands. However, it suffers weakness omitting thin cirrus and requirement thermal In this paper, we derived an automated (ACCA) model that measures statistical deviations spatial domain...

10.1109/igarss.2017.8128134 article EN 2017-07-01

Conversational Recommender Systems (CRSs) in E-commerce platforms aim to recommend items users via multiple conversational interactions. Click-through rate (CTR) prediction models are commonly used for ranking candidate items. However, most CRSs suffer from the problem of data scarcity and sparseness. To address this issue, we propose a novel knowledge-enhanced deep cross network (K-DCN), two-step (pretrain fine-tune) CTR model We first construct billion-scale conversation knowledge graph...

10.48550/arxiv.2104.14899 preprint EN cc-by arXiv (Cornell University) 2021-01-01

The technique of generating a caption for blurred image described by AI still exists the hurdle recognition. In image, figuring out semantic subject is severe challenge. this paper, we implement classifier as an auxiliary oriented filter that combines with standard dense based architecture. This refined architecture used to categorize main from media and transform it into specific predicting range field. proposed framework can describe outcomes semantical relations are hidden in image; they...

10.1109/icsc.2020.00034 article EN 2020-02-01
Coming Soon ...