Mingqi Gao

ORCID: 0000-0002-8688-8228
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Medical Image Segmentation Techniques
  • Advanced Neural Network Applications
  • Advanced Image and Video Retrieval Techniques
  • AI in cancer detection
  • Video Surveillance and Tracking Methods
  • Image Retrieval and Classification Techniques
  • Image and Object Detection Techniques
  • Visual Attention and Saliency Detection
  • Advanced Image Fusion Techniques
  • Target Tracking and Data Fusion in Sensor Networks
  • Control Systems and Identification
  • Face and Expression Recognition
  • Fault Detection and Control Systems
  • Image Enhancement Techniques
  • Face recognition and analysis
  • Image and Signal Denoising Methods
  • Fuzzy Logic and Control Systems
  • Automated Road and Building Extraction
  • Traffic Prediction and Management Techniques
  • Gait Recognition and Analysis
  • AI-based Problem Solving and Planning
  • Speech and dialogue systems
  • Human Pose and Action Recognition
  • Robotics and Sensor-Based Localization
  • Advanced Numerical Analysis Techniques

University of Warwick
2022-2024

Southern University of Science and Technology
2022-2024

Peking University
2024

Wayne State University
2021

Chongqing University
2016-2019

Harbin Institute of Technology
2018

Although Faster R-CNN and its variants have shown promising performance in object detection, they only exploit simple first-order representation of proposals for final classification regression. Recent methods demonstrate that the integration high-order statistics into deep convolutional neural networks can achieve impressive improvement, but their goal is to model whole images by discarding location information so cannot be directly adopted detection. In this paper, we make an attempt...

10.1109/cvpr.2018.00136 article EN 2018-06-01

Recently, extraction of blood vessels has aroused widespread interests in medical image analysis. In this work, to accelerate convergence speed and enhance the representation for discriminative features, we introduce residual block structure ResNet into 3D U-Net, construct a new Residual U-Net architect segment hepatic portal veins from abdominal CT volumes. addition, develop weighted Dice loss function cope with challenges pixel imbalance, vessel boundary segmentation small segmentation....

10.1109/icip.2019.8802951 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2019-08-26

Depth information opens up opportunities for video object segmentation (VOS) to be more accurate and robust in complex scenes. However, RGBD VOS is still unexplored due the high-cost collection time-consuming annotation of data. In this work, we first introduce a new benchmark VOS, named DepthVOS, which contains 350 videos (over 55k frames) annotated with masks bounding boxes. Then, propose novel strong baseline model - Fused Color-Depth Network (FusedCDNet) can learned merely under box...

10.1109/tip.2024.3374130 article EN IEEE Transactions on Image Processing 2024-01-01

This paper presents a factorization based active contour model for 2-phase texture segmentation. We utilize the local spectral histogram as features, and then establish novel energy function on theory of matrix decomposition. Unlike existing methods, we only choose combination weights from object region background to handle motion curve. compare proposed method recently methods experiments are performed synthetic real-world images. The experimental results show that our is more robust...

10.1109/icip.2016.7533173 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2016-08-17

Automatic liver tumour segmentation is an important step towards digital medical research, clinical diagnosis and therapy planning. However, the existence of noise, low contrast heterogeneity make automatic remaining open challenge. In this work, we focus on a novel method to segment in abdomen images from CT scans using fully convolutional networks (FCN) non-negative matrix factorization (NMF) based deformable model. We train FCN for semantic preprocessed training data by BM3D. The...

10.1080/21681163.2018.1493618 article EN Computer Methods in Biomechanics and Biomedical Engineering Imaging & Visualization 2019-06-27

Recent years have witnessed the prevalence of memory-based methods for Semi-supervised Video Object Segmentation (SVOS) which utilise past frames efficiently label propagation. When conducting feature matching, fine-grained multi-scale matching has typically been performed using all query points, inevitably results in redundant computations and thus makes fusion ineffective. In this paper, we develop a new Point-based Memory Network, termed as PMNet, to perform on hard samples only, assuming...

10.1016/j.patcog.2022.109073 article EN cc-by Pattern Recognition 2022-09-26

Referring Video Object Segmentation (RVOS) aims to segment the text-depicted object from video sequences. With excellent capabilities in long-range modelling and information interaction, transformers have been increasingly applied existing RVOS architectures. To better leverage multimodal data, most efforts focus on interaction between visual textual features. However, they ignore syntactic structures of text during where all components are intertwined, resulting ambiguous vision-language...

10.1109/tcsvt.2023.3284979 article EN IEEE Transactions on Circuits and Systems for Video Technology 2023-06-09

Race identification is an essential ability for human eyes. classification by machine based on face image can be used in some practical application fields. Employing holistic analysis, local feature extraction and 3D model, many race methods have been introduced. In this paper, we propose a novel fusion periocular region features classifying East Asian from Caucasian. With the landmarks, extract five textures or geometrical interesting regions which contain available discriminating...

10.1142/s0218001417500264 article EN International Journal of Pattern Recognition and Artificial Intelligence 2017-01-23

Medical image segmentation plays an important role in digital medical research, and therapy planning delivery. However, the presence of noise low contrast renders automatic liver extremely challenging task. In this study, we focus on a variational approach to computed tomography scan volumes semiautomatic slice-by-slice manner. method, one slice is selected its connected component region determined manually initialize subsequent process. From guiding slice, execute proposed method downward...

10.1088/1361-6560/aaa360 article EN Physics in Medicine and Biology 2017-12-21

Despite recent progress, Video Object Segmentation (VOS) remains challenging in complex situations such as low light and dark scenes. In this paper, we tackle the visibility limitations by introducing thermal information auxillary for VOS. Specifically, generate a hybrid benchmark dataset Visible-Thermal VOS, named VisT300, which contains 300 videos with visible frames corresponding object mask annotations. Besides, integration Network, VTiNet, is proposed to use both cross-modal cross-frame...

10.1109/tcsvt.2023.3345852 article EN IEEE Transactions on Circuits and Systems for Video Technology 2023-12-21

Images with weak contrast, overlapped noise and texture of the object background make many PDE based methods disabled. To address these problems, this paper presents a novel combined multi-scale variational framework level set segmentation model. Its formulation consists edge-based term, region-based term shape constraint term. The is constructed using newly defined edge stopping function. derived from parameter-free Gaussian probability density function (pdf) multiple kernel are used to...

10.1109/icip.2016.7533175 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2016-08-17

Although Faster R-CNN and its variants have shown promising performance in object detection, they only exploit simple first-order representation of proposals for final classification regression. Recent methods demonstrate that the integration high-order statistics into deep convolutional neural networks can achieve impressive improvement, but their goal is to model whole images by discarding location information so cannot be directly adopted detection. In this paper, we make an attempt...

10.48550/arxiv.1804.00428 preprint EN cc-by arXiv (Cornell University) 2018-01-01

This paper introduces an effective active contour model for texture segmentation. To improve the robustness against noise and illumination, a novel descriptor named local statistical variation degree (LSVD) is presented to express textural features, which uses corner point deletion isolated region detection operations eliminate image patches unrelated with object regions. And then fused features combined LSVD Gabor can be constructed structure in many scene. During segmentation stage,...

10.1109/icpr.2016.7899935 article EN 2016-12-01

Medical image segmentation plays an important role in digital medical research, therapy planning, and computer aided diagnosis. However, the existence of noise low contrast make automatic liver remains open challenge. In this work we focus on a novel variational semi-automatic method. First, used signed distance functions (SDF) representing pattern shapes to build statistical shape model. Then global Gaussian fitting energy enforced local feature were established guide PCA-based topological...

10.1109/isbi.2017.7950515 article EN 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI) 2017-04-01

Active Shape Model (ASM) is a most effective method of facial landmarking. It employs two models, profile model and shape model, to match the position landmark. In this paper, we introduce new based on relative feature (RPF) in local region improve ASM. We found fact that landmarks with larger matching error have more displacement. So, our method, RPF used adjust displacement every iteration. STASM (Stacked ASM) practical standard ASM proved be best locating face landmarks. Our experiments...

10.1142/s0219691317500084 article EN International Journal of Wavelets Multiresolution and Information Processing 2016-11-17

Dougong is a unique culture in Chinese traditional architecture. In University, the Architectural students usually use video, pictures, and even handmade craft to learn knowledge about Dougong. However, making these complicated components by hands requires lot of facilities. To solve problems, this paper builds learning application using Virtual Reality (VR) technology, where can master how construct interacting with virtual models. addition module, creates simulated scene showing great...

10.1145/3281505.3281666 article EN 2018-11-28

Human evaluation is viewed as a reliable method for NLG which expensive and time-consuming. To save labor costs, researchers usually perform human on small subset of data sampled from the whole dataset in practice. However, different selection subsets will lead to rankings systems. give more correct inter-system ranking make gold standard reliable, we propose Constrained Active Sampling Framework (CASF) judgment. CASF operates through Learner, Systematic Sampler Controller select...

10.1609/aaai.v38i17.29857 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2024-03-24

Human evaluation is viewed as a reliable method for NLG which expensive and time-consuming. To save labor costs, researchers usually perform human on small subset of data sampled from the whole dataset in practice. However, different selection subsets will lead to rankings systems. give more correct inter-system ranking make gold standard reliable, we propose Constrained Active Sampling Framework (CASF) judgment. CASF operates through Learner, Systematic Sampler Controller select...

10.48550/arxiv.2406.07967 preprint EN arXiv (Cornell University) 2024-06-12

The evaluation of natural language generation (NLG) tasks is a significant and longstanding research issue. With the recent emergence powerful large models (LLMs), some studies have turned to LLM-based automatic methods, which demonstrate great potential become new paradigm following traditional string-based model-based metrics. However, despite improved performance existing they still possess deficiencies, such as dependency on references limited flexibility. Therefore, in this paper, we...

10.48550/arxiv.2406.18365 preprint EN arXiv (Cornell University) 2024-06-26
Coming Soon ...