NFDI4DS | UHH-SEMS - Publication Details

Multi-scale Location-Aware Kernel Representation for Object Detection

OPENALEX - Publications

Hao Wang Qilong Wang Mingqi Gao Peihua Li Wangmeng Zuo

Although Faster R-CNN and its variants have shown promising performance in object detection, they only exploit simple first-order representation of proposals for final classification regression. Recent methods demonstrate that the integration high-order statistics into deep convolutional neural networks can achieve impressive improvement, but their goal is to model whole images by discarding location information so cannot be directly adopted detection. In this paper, we make an attempt...

10.1109/cvpr.2018.00136 article EN 2018-06-01

Liver Vessels Segmentation Based on 3d Residual U-NET

OPENALEX - Publications

Wei Yu Bin Fang Yongqing Liu Mingqi Gao Shenhai Zheng and 1 more

Recently, extraction of blood vessels has aroused widespread interests in medical image analysis. In this work, to accelerate convergence speed and enhance the representation for discriminative features, we introduce residual block structure ResNet into 3D U-Net, construct a new Residual U-Net architect segment hepatic portal veins from abdominal CT volumes. addition, develop weighted Dice loss function cope with challenges pixel imbalance, vessel boundary segmentation small segmentation....

10.1109/icip.2019.8802951 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2019-08-26

Filter pruning with uniqueness mechanism in the frequency domain for efficient neural networks

OPENALEX - Publications

Shuo Zhang Mingqi Gao Qiang Ni Jungong Han

10.1016/j.neucom.2023.02.004 article EN Neurocomputing 2023-02-08

Feature fusion and non-negative matrix factorization based active contours for texture segmentation

OPENALEX - Publications

Mingqi Gao Hengxin Chen Shenhai Zheng Bin Fang

10.1016/j.sigpro.2019.01.021 article EN Signal Processing 2019-01-30

Weakly-Supervised RGBD Video Object Segmentation

OPENALEX - Publications

Jinyu Yang Mingqi Gao Feng Zheng Xiantong Zhen Rongrong Ji and 2 more

Depth information opens up opportunities for video object segmentation (VOS) to be more accurate and robust in complex scenes. However, RGBD VOS is still unexplored due the high-cost collection time-consuming annotation of data. In this work, we first introduce a new benchmark VOS, named DepthVOS, which contains 350 videos (over 55k frames) annotated with masks bounding boxes. Then, propose novel strong baseline model - Fused Color-Depth Network (FusedCDNet) can learned merely under box...

10.1109/tip.2024.3374130 article EN IEEE Transactions on Image Processing 2024-01-01

A factorization based active contour model for texture segmentation

OPENALEX - Publications

Mingqi Gao Hengxin Chen Shenhai Zheng Bin Fang

This paper presents a factorization based active contour model for 2-phase texture segmentation. We utilize the local spectral histogram as features, and then establish novel energy function on theory of matrix decomposition. Unlike existing methods, we only choose combination weights from object region background to handle motion curve. compare proposed method recently methods experiments are performed synthetic real-world images. The experimental results show that our is more robust...

10.1109/icip.2016.7533173 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2016-08-17

Automatic liver tumour segmentation in CT combining FCN and NMF-based deformable model

OPENALEX - Publications

Shenhai Zheng Bin Fang Laquan Li Mingqi Gao Yi Wang and 1 more

Automatic liver tumour segmentation is an important step towards digital medical research, clinical diagnosis and therapy planning. However, the existence of noise, low contrast heterogeneity make automatic remaining open challenge. In this work, we focus on a novel method to segment in abdomen images from CT scans using fully convolutional networks (FCN) non-negative matrix factorization (NMF) based deformable model. We train FCN for semantic preprocessed training data by BM3D. The...

10.1080/21681163.2018.1493618 article EN Computer Methods in Biomechanics and Biomedical Engineering Imaging & Visualization 2019-06-27

Video Object Segmentation using Point-based Memory Network

OPENALEX - Publications

Mingqi Gao Jungong Han Feng Zheng James J. Q. Yu Giovanni Montana

Recent years have witnessed the prevalence of memory-based methods for Semi-supervised Video Object Segmentation (SVOS) which utilise past frames efficiently label propagation. When conducting feature matching, fine-grained multi-scale matching has typically been performed using all query points, inevitably results in redundant computations and thus makes fusion ineffective. In this paper, we develop a new Point-based Memory Network, termed as PMNet, to perform on hard samples only, assuming...

10.1016/j.patcog.2022.109073 article EN cc-by Pattern Recognition 2022-09-26

Decoupling Multimodal Transformers for Referring Video Object Segmentation

OPENALEX - Publications

Mingqi Gao Jinyu Yang Jungong Han Ke Lü Feng Zheng and 1 more

Referring Video Object Segmentation (RVOS) aims to segment the text-depicted object from video sequences. With excellent capabilities in long-range modelling and information interaction, transformers have been increasingly applied existing RVOS architectures. To better leverage multimodal data, most efforts focus on interaction between visual textual features. However, they ignore syntactic structures of text during where all components are intertwined, resulting ambiguous vision-language...

10.1109/tcsvt.2023.3284979 article EN IEEE Transactions on Circuits and Systems for Video Technology 2023-06-09

A Novel Race Classification Method Based on Periocular Features Fusion

OPENALEX - Publications

Hengxin Chen Mingqi Gao Karl Ricanek Wei Xu Bin Fang

Race identification is an essential ability for human eyes. classification by machine based on face image can be used in some practical application fields. Employing holistic analysis, local feature extraction and 3D model, many race methods have been introduced. In this paper, we propose a novel fusion periocular region features classifying East Asian from Caucasian. With the landmarks, extract five textures or geometrical interesting regions which contain available discriminating...

10.1142/s0218001417500264 article EN International Journal of Pattern Recognition and Artificial Intelligence 2017-01-23

B-Spline based globally optimal segmentation combining low-level and high-level information

OPENALEX - Publications

Shenhai Zheng Bin Fang Laquan Li Mingqi Gao Rui Chen and 1 more

10.1016/j.patcog.2017.08.011 article EN Pattern Recognition 2017-08-07

A variational approach to liver segmentation using statistics from multiple sources

OPENALEX - Publications

Shenhai Zheng Bin Fang Laquan Li Mingqi Gao Yi Wang

Medical image segmentation plays an important role in digital medical research, and therapy planning delivery. However, the presence of noise low contrast renders automatic liver extremely challenging task. In this study, we focus on a variational approach to computed tomography scan volumes semiautomatic slice-by-slice manner. method, one slice is selected its connected component region determined manually initialize subsequent process. From guiding slice, execute proposed method downward...

10.1088/1361-6560/aaa360 article EN Physics in Medicine and Biology 2017-12-21

Unveiling the Power of Visible-Thermal Video Object Segmentation

OPENALEX - Publications

Jinyu Yang Mingqi Gao Runmin Cong Chengjie Wang Feng Zheng and 1 more

Despite recent progress, Video Object Segmentation (VOS) remains challenging in complex situations such as low light and dark scenes. In this paper, we tackle the visibility limitations by introducing thermal information auxillary for VOS. Specifically, generate a hybrid benchmark dataset Visible-Thermal VOS, named VisT300, which contains 300 videos with visible frames corresponding object mask annotations. Besides, integration Network, VTiNet, is proposed to use both cross-modal cross-frame...

10.1109/tcsvt.2023.3345852 article EN IEEE Transactions on Circuits and Systems for Video Technology 2023-12-21

Multi-scale B-spline level set segmentation based on Gaussian kernel equalization

OPENALEX - Publications

Shenhai Zheng Bin Fang Patrick S. P. Wang Laquan Li Mingqi Gao

Images with weak contrast, overlapped noise and texture of the object background make many PDE based methods disabled. To address these problems, this paper presents a novel combined multi-scale variational framework level set segmentation model. Its formulation consists edge-based term, region-based term shape constraint term. The is constructed using newly defined edge stopping function. derived from parameter-free Gaussian probability density function (pdf) multiple kernel are used to...

10.1109/icip.2016.7533175 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2016-08-17

Multi-scale Location-aware Kernel Representation for Object Detection

OPENALEX - Publications

Hao Wang Qilong Wang Mingqi Gao Peihua Li Wangmeng Zuo

Although Faster R-CNN and its variants have shown promising performance in object detection, they only exploit simple first-order representation of proposals for final classification regression. Recent methods demonstrate that the integration high-order statistics into deep convolutional neural networks can achieve impressive improvement, but their goal is to model whole images by discarding location information so cannot be directly adopted detection. In this paper, we make an attempt...

10.48550/arxiv.1804.00428 preprint EN cc-by arXiv (Cornell University) 2018-01-01

Texture image segmentation using fused features and active contour

OPENALEX - Publications

Mingqi Gao Hengxin Chen Shenhai Zheng Bin Fang Lin Zhang

This paper introduces an effective active contour model for texture segmentation. To improve the robustness against noise and illumination, a novel descriptor named local statistical variation degree (LSVD) is presented to express textural features, which uses corner point deletion isolated region detection operations eliminate image patches unrelated with object regions. And then fused features combined LSVD Gabor can be constructed structure in many scene. During segmentation stage,...

10.1109/icpr.2016.7899935 article EN 2016-12-01

A novel variational method for liver segmentation based on statistical shape model prior and enforced local statistical feature

OPENALEX - Publications

Shenhai Zheng Bin Fang Laquan Li Mingqi Gao Hong-Suo Zhang and 2 more

Medical image segmentation plays an important role in digital medical research, therapy planning, and computer aided diagnosis. However, the existence of noise low contrast make automatic liver remains open challenge. In this work we focus on a novel variational semi-automatic method. First, used signed distance functions (SDF) representing pattern shapes to build statistical shape model. Then global Gaussian fitting energy enforced local feature were established guide PCA-based topological...

10.1109/isbi.2017.7950515 article EN 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI) 2017-04-01

An improved active shape model method for facial landmarking based on relative position feature

OPENALEX - Publications

Hengxin Chen Mingqi Gao Bin Fang

Active Shape Model (ASM) is a most effective method of facial landmarking. It employs two models, profile model and shape model, to match the position landmark. In this paper, we introduce new based on relative feature (RPF) in local region improve ASM. We found fact that landmarks with larger matching error have more displacement. So, our method, RPF used adjust displacement every iteration. STASM (Stacked ASM) practical standard ASM proved be best locating face landmarks. Our experiments...

10.1142/s0219691317500084 article EN International Journal of Wavelets Multiresolution and Information Processing 2016-11-17

Experience the dougong construction in virtual reality

OPENALEX - Publications

Jie Zhang Hengxin Chen Jiahui Wang Mingqi Gao

Dougong is a unique culture in Chinese traditional architecture. In University, the Architectural students usually use video, pictures, and even handmade craft to learn knowledge about Dougong. However, making these complicated components by hands requires lot of facilities. To solve problems, this paper builds learning application using Virtual Reality (VR) technology, where can master how construct interacting with virtual models. addition module, creates simulated scene showing great...

10.1145/3281505.3281666 article EN 2018-11-28

SCN: Dilated silhouette convolutional network for video action recognition

OPENALEX - Publications

Michelle Hua Mingqi Gao Zichun Zhong

10.1016/j.cagd.2021.101965 article EN Computer Aided Geometric Design 2021-02-01

Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling

OPENALEX - Publications

Jie Ruan Xiao Pu Mingqi Gao Xiaojun Wan Yuesheng Zhu

Human evaluation is viewed as a reliable method for NLG which expensive and time-consuming. To save labor costs, researchers usually perform human on small subset of data sampled from the whole dataset in practice. However, different selection subsets will lead to rankings systems. give more correct inter-system ranking make gold standard reliable, we propose Constrained Active Sampling Framework (CASF) judgment. CASF operates through Learner, Systematic Sampler Controller select...

10.1609/aaai.v38i17.29857 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

Better than Random: Reliable NLG Human Evaluation with Constrained Active Sampling

OPENALEX - Publications

Jie Ruan Xiao Pu Mingqi Gao Xiaojun Wan Yuesheng Zhu

Human evaluation is viewed as a reliable method for NLG which expensive and time-consuming. To save labor costs, researchers usually perform human on small subset of data sampled from the whole dataset in practice. However, different selection subsets will lead to rankings systems. give more correct inter-system ranking make gold standard reliable, we propose Constrained Active Sampling Framework (CASF) judgment. CASF operates through Learner, Systematic Sampler Controller select...

10.48550/arxiv.2406.07967 preprint EN arXiv (Cornell University) 2024-06-12

Themis: Towards Flexible and Interpretable NLG Evaluation

OPENALEX - Publications

Xinyu Hu Lin Li Mingqi Gao Xunjian Yin Xiaojun Wan

The evaluation of natural language generation (NLG) tasks is a significant and longstanding research issue. With the recent emergence powerful large models (LLMs), some studies have turned to LLM-based automatic methods, which demonstrate great potential become new paradigm following traditional string-based model-based metrics. However, despite improved performance existing they still possess deficiencies, such as dependency on references limited flexibility. Therefore, in this paper, we...

10.48550/arxiv.2406.18365 preprint EN arXiv (Cornell University) 2024-06-26