NFDI4DS | UHH-SEMS - Publication Details

Constructing Self-Motivated Pyramid Curriculums for Cross-Domain Semantic Segmentation: A Non-Adversarial Approach

OPENALEX - Publications

Qing Lian Lixin Duan Fengmao Lv Boqing Gong

We propose a new approach, called self-motivated pyramid curriculum domain adaptation (PyCDA), to facilitate the of semantic segmentation neural networks from synthetic source domains real target domains. Our approach draws on an insight connecting two existing works: and self-training. Inspired by former, PyCDA constructs which contains various properties about domain. Those are mainly desired label distributions over images, image regions, pixels. By enforcing network observe those...

10.1109/iccv.2019.00686 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Progressive Modality Reinforcement for Human Multimodal Emotion Recognition from Unaligned Multimodal Sequences

OPENALEX - Publications

Fengmao Lv Xiang Chen Yanyong Huang Lixin Duan Guosheng Lin

Human multimodal emotion recognition involves time-series data of different modalities, such as natural language, visual motions, and acoustic behaviors. Due to the variable sampling rates for sequences from collected streams are usually unaligned. The asynchrony across modalities increases difficulty on conducting efficient fusion. Hence, this work mainly focuses fusion unaligned sequences. To end, we propose Progressive Modality Reinforcement (PMR) approach based recent advances crossmodal...

10.1109/cvpr46437.2021.00258 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

An efficient instance selection algorithm to reconstruct training set for support vector machine

OPENALEX - Publications

Chuan Liu Wenyong Wang Meng Wang Fengmao Lv Martin Konan

Support vector machine is a classification model which has been widely used in many nonlinear and high dimensional pattern recognition problems. However, it inefficient or impracticable to implement support dealing with large scale training set due its computational difficulties as well the complexity. In this paper, we study problem mainly context of reduction methods reconstruct for machine. We focus on fact uneven distribution instances space propose an efficient self-adaption instance...

10.1016/j.knosys.2016.10.031 article EN cc-by-nc-nd Knowledge-Based Systems 2016-11-02

Can Cross Entropy Loss Be Robust to Label Noise?

OPENALEX - Publications

Lei Feng Senlin Shu Zhuoyi Lin Fengmao Lv Li Li and 1 more

Trained with the standard cross entropy loss, deep neural networks can achieve great performance on correctly labeled data. However, if training data is corrupted label noise, models tend to overfit noisy labels, thereby achieving poor generation performance. To remedy this issue, several loss functions have been proposed and demonstrated be robust noise. Although most of stem from Categorical Cross Entropy (CCE) they fail embody intrinsic relationships between CCE other functions. In paper,...

10.24963/ijcai.2020/305 article EN 2020-07-01

Cross-Domain Semantic Segmentation via Domain-Invariant Interactive Relation Transfer

OPENALEX - Publications

Fengmao Lv Liang Tao Xiang Chen Guosheng Lin

Exploiting photo-realistic synthetic data to train semantic segmentation models has received increasing attention over the past years. However, domain mismatch between and real images will cause a significant performance drop when model trained with is directly applied real-world scenarios. In this paper, we propose new adaptation approach, called Pivot Interaction Transfer (PIT). Our method mainly focuses on constructing pivot information that common knowledge shared across domains as...

10.1109/cvpr42600.2020.00439 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Spatiotemporal variations of air pollutants and ozone prediction using machine learning algorithms in the Beijing-Tianjin-Hebei region from 2014 to 2021

OPENALEX - Publications

Yan Lyu Qinru Ju Fengmao Lv Jialiang Feng Xiaobing Pang and 1 more

10.1016/j.envpol.2022.119420 article EN Environmental Pollution 2022-05-05

Attention is not Enough: Mitigating the Distribution Discrepancy in Asynchronous Multimodal Sequence Fusion

OPENALEX - Publications

Tao Liang Guosheng Lin Lei Feng Yan Zhang Fengmao Lv

Videos flow as the mixture of language, acoustic, and vision modalities. A thorough video understanding needs to fuse time-series data different modalities for prediction. Due variable receiving frequency sequences from each modality, there usually exists inherent asynchrony across collected multimodal streams. Towards an efficient fusion asynchronous streams, we need model correlations between elements The recent Multimodal Transformer (MulT) approach extends self-attention mechanism...

10.1109/iccv48922.2021.00804 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Learning cross-domain semantic-visual relationships for transductive zero-shot learning

OPENALEX - Publications

Fengmao Lv Jianyang Zhang Guowu Yang Lei Feng Yufeng Yu and 1 more

10.1016/j.patcog.2023.109591 article EN Pattern Recognition 2023-04-02

An Effective Conversation-Based Botnet Detection Method

OPENALEX - Publications

Ruidong Chen Weina Niu Xiaosong Zhang Zhongliu Zhuo Fengmao Lv

A botnet is one of the most grievous threats to network security since it can evolve into many attacks, such as Denial‐of‐Service (DoS), spam, and phishing. However, current detection methods are inefficient identify unknown botnet. The high‐speed environment makes more difficult. To solve these problems, we improve progress packet processing technologies New Application Programming Interface (NAPI) zero copy propose an efficient quasi‐real‐time intrusion system. Our work detects using...

10.1155/2017/4934082 article EN cc-by Mathematical Problems in Engineering 2017-01-01

Expanding Large Pre-trained Unimodal Models with Multimodal Information Injection for Image-Text Multimodal Classification

OPENALEX - Publications

Liang Tao Guosheng Lin Mingyang Wan Tianrui Li Guojun Ma and 1 more

Fine-tuning pre-trained models for downstream tasks is mainstream in deep learning. However, the are limited to be fine-tuned by data from a specific modality. For example, as visual model, DenseNet cannot directly take textual its input. Hence, although large such or BERT have great potential recognition tasks, they weaknesses leveraging multimodal information, which new trend of This work focuses on fine-tuning unimodal with inputs image-text pairs and expanding them recognition. To this...

10.1109/cvpr52688.2022.01505 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Self-Training Vision Language BERTs With a Unified Conditional Model

OPENALEX - Publications

Xiaofeng Yang Fengmao Lv Fayao Liu Guosheng Lin

Natural language BERTs are trained with corpus in a self-supervised manner. Unlike natural BERTs, vision need paired data to train, which restricts the scale of VL-BERT pretraining. We propose self-training approach that allows training VL-BERTs from unlabeled image data. The proposed method starts our unified conditional model– BERT model can perform zero-shot generation. Given different conditions, generate captions, dense and even questions. use labeled train teacher pseudo captions on...

10.1109/tcsvt.2023.3235704 article EN IEEE Transactions on Circuits and Systems for Video Technology 2023-01-10

C2IMUFS: Complementary and Consensus Learning-Based Incomplete Multi-View Unsupervised Feature Selection

OPENALEX - Publications

Yanyong Huang Zongxin Shen Yuxin Cai Xiuwen Yi Dongjie Wang and 2 more

Multi-view unsupervised feature selection (MUFS) has been demonstrated as an effective technique to reduce the dimensionality of multi-view unlabeled data. The existing methods assume that all views are complete. However, data usually incomplete, i.e., a part instances presented on some but not views. Besides, learning complete similarity graph, important promising technology in MUFS methods, cannot achieve due missing In this paper, we propose complementary and consensus learning-based...

10.1109/tkde.2023.3266595 article EN IEEE Transactions on Knowledge and Data Engineering 2023-04-12

Co‐Occurring Extremes of Fine Particulate Matter (PM2.5) and Ground‐Level Ozone in the Summer of Southern China

OPENALEX - Publications

Yan Lyu Haonan Wu Xiaoran Liu Fuliang Han Fengmao Lv and 2 more

Abstract Concurrent pollution of fine particulate matter (PM 2.5 ) and ozone has been increasingly reported in China recently. Here, we further confirm widespread co‐occurring summertime PM ‐ozone extremes southern China. Annual‐average frequency co‐occurrence is above 50% from 2015 to 2022, especially Pearl River Delta region (72 ± 12%). The spatial extent (city numbers) temporal persistence (co‐occurrence days) for cities with >50% increase at a rate two cities/year 14 days/year,...

10.1029/2023gl106527 article EN cc-by-nc-nd Geophysical Research Letters 2024-01-19

C2T-HR3D: Cross-fusion of CNN and Transformer for High-speed Railway Dropper Defect Detection

OPENALEX - Publications

Jin He Fengmao Lv Jun Liu Min Wu Badong Chen and 1 more

10.1109/tim.2025.3540132 article EN IEEE Transactions on Instrumentation and Measurement 2025-01-01

Adaptive Multi-Scale Language Reinforcement for Multimodal Named Entity Recognition

OPENALEX - Publications

Enping Li Tianrui Li Huaishao Luo Jielei Chu Lixin Duan and 1 more

10.1109/tmm.2025.3543105 article EN IEEE Transactions on Multimedia 2025-01-01

Multi-Scale CNN-Transformer Hybrid Network for Rail Fastener Defect Detection

OPENALEX - Publications

Jin He Wei Wang Fengmao Lv Haonan Luo Gexiang Zhang and 1 more

10.1109/tits.2025.3540846 article EN IEEE Transactions on Intelligent Transportation Systems 2025-01-01

Segmenting Anything in the Dark via Depth Perception

OPENALEX - Publications

Peng Liu Jinhong Deng Lixin Duan Wen Li Fengmao Lv

10.1109/tmm.2025.3557612 article EN IEEE Transactions on Multimedia 2025-01-01

Source-free domain adaptation with unrestricted source hypothesis

OPENALEX - Publications

Jiujun He Liang Wu Chaofan Tao Fengmao Lv

10.1016/j.patcog.2023.110246 article EN Pattern Recognition 2024-01-08

A new Centroid-Based Classification model for text categorization

OPENALEX - Publications

Chuan Liu Wenyong Wang Guang-hui Tu Yu Xiang Siyang Wang and 1 more

The automatic text categorization technique has gained significant attention among researchers because of the increasing availability online information. Therefore, many different learning approaches have been designed in field. Among them, widely used method is Centroid-Based Classifier (CBC) due to its theoretical simplicity and computational efficiency. However, classification accuracy CBC greatly depends on data distribution. Thus it leads a misfit model also poor performance when...

10.1016/j.knosys.2017.08.020 article EN cc-by-nc-nd Knowledge-Based Systems 2017-08-30

Breast tumor classification through learning from noisy labeled ultrasound images

OPENALEX - Publications

Zhantao Cao Guowu Yang Chen Qin Xiaolong Chen Fengmao Lv

Purpose To train deep learning models to differentiate benign and malignant breast tumors in ultrasound images, we need collect many training samples with clear labels. In general, biopsy results can be used as benign/malignant However, most clinical generally do not have results. Previous works proposed generating labels according Breast Imaging, Reporting Data System (BI‐RADS) ratings. this approach will cause noisy labels, which means that the produced from BI‐RADS diagnoses may...

10.1002/mp.13966 article EN Medical Physics 2019-12-14

Adaptive graph-based generalized regression model for unsupervised feature selection

OPENALEX - Publications

Yanyong Huang Zongxin Shen Fuxu Cai Tianrui Li Fengmao Lv

10.1016/j.knosys.2021.107156 article EN Knowledge-Based Systems 2021-05-26

Colorectal Polyp Segmentation in the Deep Learning Era: A Comprehensive Survey

OPENALEX - Publications

Zhenyu Wu Fengmao Lv Chenglizhao Chen Aimin Hao Shuo Li

Colorectal polyp segmentation (CPS), an essential problem in medical image analysis, has garnered growing research attention. Recently, the deep learning-based model completely overwhelmed traditional methods field of CPS, and more CPS have emerged, bringing into learning era. To help researchers quickly grasp main techniques, datasets, evaluation metrics, challenges, trending this paper presents a systematic comprehensive review deep-learning-based from 2014 to 2023, total 115 technical...

10.48550/arxiv.2401.11734 preprint EN cc-by-nc-sa arXiv (Cornell University) 2024-01-01

Constructing Self-motivated Pyramid Curriculums for Cross-Domain Semantic Segmentation: A Non-Adversarial Approach

OPENALEX - Publications

Qing Lian Fengmao Lv Lixin Duan Boqing Gong

We propose a new approach, called self-motivated pyramid curriculum domain adaptation (PyCDA), to facilitate the of semantic segmentation neural networks from synthetic source domains real target domains. Our approach draws on an insight connecting two existing works: and self-training. Inspired by former, PyCDA constructs which contains various properties about domain. Those are mainly desired label distributions over images, image regions, pixels. By enforcing network observe those...

10.48550/arxiv.1908.09547 preprint EN other-oa arXiv (Cornell University) 2019-01-01

TarGAN: Generating target data with class labels for unsupervised domain adaptation

OPENALEX - Publications

Fengmao Lv Jun Zhu Guowu Yang Lixin Duan

10.1016/j.knosys.2019.02.015 article EN Knowledge-Based Systems 2019-02-19

Weakly-Supervised Cross-Domain Road Scene Segmentation via Multi-Level Curriculum Adaptation

OPENALEX - Publications

Fengmao Lv Guosheng Lin Peng Liu Guowu Yang Sinno Jialin Pan and 1 more

Semantic segmentation, which aims to acquire pixel-level understanding about images, is among the key components in computer vision. To train a good segmentation model for real-world it usually requires huge amount of time and labor effort obtain sufficient annotations images beforehand. get rid such nontrivial burden, one can use simulators automatically generate synthetic that inherently contain full them images. However, training with cannot lead performance due domain difference between...

10.1109/tcsvt.2020.3040343 article EN IEEE Transactions on Circuits and Systems for Video Technology 2020-11-25