NFDI4DS | UHH-SEMS - Publication Details

Mingyu You

ORCID: 0000-0003-2758-167X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5010299064

Research Areas

Advanced Neural Network Applications
Face and Expression Recognition
Emotion and Mood Recognition
Human Pose and Action Recognition
Speech and Audio Processing
Multimodal Machine Learning Applications
Traditional Chinese Medicine Studies
Video Surveillance and Tracking Methods
Advanced Image and Video Retrieval Techniques
Domain Adaptation and Few-Shot Learning
Respiratory and Cough-Related Research
Face recognition and analysis
Voice and Speech Disorders
Robot Manipulation and Learning
Speech Recognition and Synthesis
Infant Health and Development
3D Shape Modeling and Analysis
Reinforcement Learning in Robotics
Image Retrieval and Classification Techniques
Biomedical Text Mining and Ontologies
Gait Recognition and Analysis
Advanced Vision and Imaging
Advanced Data Compression Techniques
Text and Document Classification Technologies
Music and Audio Processing

Tongji University
2016-2025

Shanghai Institute of Computing Technology
2024

Jiangsu Province Hospital
2023

Nanjing Medical University
2023

Nanjing University
2008-2011

Shanghai Dianji University
2009

Zhejiang University
2004-2008

Shanghai University of Engineering Science
2008

Self-Training With Progressive Augmentation for Unsupervised Cross-Domain Person Re-Identification

OPENALEX - Publications

Xinyu Zhang Jiewei Cao Chunhua Shen Mingyu You

Person re-identification (Re-ID) has achieved great improvement with deep learning and a large amount of labelled training data. However, it remains challenging task for adapting model trained in source domain data to target only unlabelled available. In this work, we develop self-training method progressive augmentation framework (PAST) promote the performance progressively on dataset. Specially, our PAST consists two stages, namely, conservative stage promoting stage. The captures local...

10.1109/iccv.2019.00831 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Reading car license plates using deep neural networks

OPENALEX - Publications

Hui Li Peng Wang Mingyu You Chunhua Shen

10.1016/j.imavis.2018.02.002 article EN Image and Vision Computing 2018-03-06

Mask Encoding for Single Shot Instance Segmentation

OPENALEX - Publications

Rufeng Zhang Zhi Tian Chunhua Shen Mingyu You Youliang Yan

To date, instance segmentation is dominated by two-stage methods, as pioneered Mask R-CNN. In contrast, one-stage alternatives cannot compete with R-CNN in mask AP, mainly due to the difficulty of compactly representing masks, making design methods very challenging. this work, we propose a simple single-shot framework, termed encoding based (MEInst). Instead predicting two-dimensional directly, MEInst distills it into compact and fixed-dimensional representation vector, which allows task be...

10.1109/cvpr42600.2020.01024 preprint EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Part-Guided Attention Learning for Vehicle Instance Retrieval

OPENALEX - Publications

Xinyu Zhang Rufeng Zhang Jiewei Cao Dong Gong Mingyu You and 1 more

Vehicle instance retrieval (IR) often requires one to recognize the fine-grained visual differences between vehicles. Besides holistic appearance of vehicles which is easily affected by viewpoint variation and distortion, vehicle parts also provide crucial cues differentiate near-identical Motivated these observations, we introduce a <i>Part-Guided Attention Network</i> (PGAN) pinpoint prominent part regions effectively combine global local information for discriminative feature learning....

10.1109/tits.2020.3030301 article EN IEEE Transactions on Intelligent Transportation Systems 2020-10-29

Cough event classification by pretrained deep neural network

OPENALEX - Publications

Jia‐Ming Liu Mingyu You Zheng Wang Guozheng Li Xianghuai Xu and 1 more

Cough is an essential symptom in respiratory diseases. In the measurement of cough severity, accurate and objective monitor expected by disease society. This paper aims to introduce a better performed algorithm, pretrained deep neural network (DNN), classification problem, which key step monitor. The models are built from two steps, pretrain fine-tuning, followed Hidden Markov Model (HMM) decoder capture tamporal information audio signals. By unsupervised pretraining belief network, good...

10.1186/1472-6947-15-s4-s2 article EN cc-by BMC Medical Informatics and Decision Making 2015-11-25

Diverse Knowledge Distillation for End-to-End Person Search

OPENALEX - Publications

Xinyu Zhang Xinlong Wang Jia-Wang Bian Chunhua Shen Mingyu You

Person search aims to localize and identify a specific person from gallery of images. Recent methods can be categorized into two groups, i.e., two-step end-to-end approaches. The former views as independent tasks achieves dominant results using separately trained detection re-identification (Re-ID) models. latter performs in an fashion. Although the approaches yield higher inference efficiency, they largely lag behind those counterparts terms accuracy. In this paper, we argue that gap...

10.1609/aaai.v35i4.16454 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Design of an Efficient CNN-Based Cough Detection System on Lightweight FPGA

OPENALEX - Publications

Peng Peng Kai Jiang Mingyu You Jialin Xie Hongjun Zhou and 4 more

Precisely and automatically detecting the cough sound is of vital clinical importance. Nevertheless, due to privacy protection considerations, transmitting raw audio data cloud not permitted, therefore there a great demand for an efficient, accurate, low-cost solution at edge device. To address this challenge, we propose semi-custom software-hardware co-design methodology help build detection system. Specifically, first design scalable compact convolutional neural network (CNN) structure...

10.1109/tbcas.2023.3236976 article EN IEEE Transactions on Biomedical Circuits and Systems 2023-01-13

Emotion Recognition from Noisy Speech

OPENALEX - Publications

Mingyu You Chun Chen Jiajun Bu Jia Liu Jianhua Tao

This paper presents an emotion recognition system from clean and noisy speech. Geodesic distance was adopted to preserve the intrinsic geometry of emotional Based on geodesic estimation, enhanced Lipschitz embedding developed embed 64-dimensional acoustic features into a six-dimensional space. In order avoid problems brought by noise reduction, speech performed directly. Linear discriminant analysis (LDA), principal component (PCA) feature selection sequential forward (SFS) with support...

10.1109/icme.2006.262865 article EN 2006-07-01

A robust multimodal approach for emotion recognition

OPENALEX - Publications

Mingli Song Mingyu You Na Li Chun Chen

10.1016/j.neucom.2007.07.041 article EN Neurocomputing 2008-03-13

Cough detection using deep neural networks

OPENALEX - Publications

Jia‐Ming Liu Mingyu You Zheng Wang Guozheng Li Xianghuai Xu and 1 more

Cough detection and assessment have crucial clinical value for respiratory diseases. Subjective assessments are widely adopted in measurement nowadays, but they neither accurate nor reliable. An automatic objective system cough is strongly expected. Automatic from audio signal has been studied by peer works. But still facing some difficulties like unsatisfactory accuracy or lacking large scale validation. In this paper, deep neural networks (DNN) applied to model acoustic features detection....

10.1109/bibm.2014.6999220 article EN 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2014-11-01

Embedded Feature Selection for Multi-label Classification of Music Emotions

OPENALEX - Publications

Mingyu You Jiaming Liu Guozheng Li Yan Chen

When detecting of emotions from music, many features are extracted the original music data.However, there redundant or irrelevant features, which will reduce performance classification models.Considering feature problems, we propose an embedded selection method, called Multi-label Embedded Feature Selection (MEFS), to improve by selecting features.MEFS embeds classifier and considers label correlation.Other three representative multi-label methods, known as LP-Chi, max avg, together with...

10.1080/18756891.2012.718113 article EN cc-by International Journal of Computational Intelligence Systems 2012-01-01

Adversarial Generation of Training Examples: Applications to Moving Vehicle License Plate Recognition

OPENALEX - Publications

Xinlong Wang Zhipeng Man Mingyu You Chunhua Shen

Generative Adversarial Networks (GAN) have attracted much research attention recently, leading to impressive results for natural image generation. However, date little success was observed in using GAN generated images improving classification tasks. Here we attempt explore, the context of car license plate recognition, whether it is possible generate synthetic training data improve recognition accuracy. With a carefully-designed pipeline, show that answer affirmative. First, large-scale set...

10.48550/arxiv.1707.03124 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Cough signal recognition with Gammatone Cepstral Coefficients

OPENALEX - Publications

Jia‐Ming Liu Mingyu You Guozheng Li Zheng Wang Xianghuai Xu and 4 more

Cough Recognition is a valuable classification problem in healthcare. Generally, feature representation contributes lot to the overall classifying performance. In this paper, novel extraction method, Gammatone Cepstral Coefficients (GTCC), investigated for cough recognition. The accuracy of GTCC comparing with MFCC evaluated on designed dataset following 10 fold cross-validation schemes. Considering imbalance that dataset, weighted SVM applied as base classifier. results indicate surpass...

10.1109/chinasip.2013.6625319 article EN 2013-07-01

Intelligent ZHENG Classification of Hypertension Depending on ML-kNN and Information Fusion

OPENALEX - Publications

Guozheng Li Shixing Yan Mingyu You Sheng Sun Aihua Ou

Hypertension is one of the major causes heart cerebrovascular diseases. With a good accumulation hypertension clinical data on hand, research hypertension's ZHENG differentiation an important and attractive topic, as Traditional Chinese Medicine (TCM) lies primarily in “treatment based differentiation.” From view mining, modeled classification problem. In this paper, ML-kNN—a multilabel learning model—is used model for hypertension. Feature-level information fusion also further utilization...

10.1155/2012/837245 article EN Evidence-based Complementary and Alternative Medicine 2012-01-01

Novel feature extraction method for cough detection using NMF

OPENALEX - Publications

Mingyu You Huihui Wang Zeqin Liu Chong Chen Jia‐Ming Liu and 2 more

Cough is a common symptom in respiratory diseases. To provide valuable clinical information for cough diagnosis and monitoring, objectively evaluating the quantity intensity of based on detection by pattern recognition technologies needed. aims to extract boundaries events from an audio stream. From spectral visualisation, it found that energy spectrum signal spreads widely whole frequency band, which very different speech signal. However, almost all feature extraction methods previous work...

10.1049/iet-spr.2016.0341 article EN IET Signal Processing 2017-01-13

Fully integer-based quantization for mobile convolutional neural network inference

OPENALEX - Publications

Peng Peng Mingyu You Weisheng Xu Jiaxin Li

10.1016/j.neucom.2020.12.035 article EN Neurocomputing 2020-12-23

Dynamic dense CRF inference for video segmentation and semantic SLAM

OPENALEX - Publications

Mingyu You Chaoxian Luo Hongjun Zhou Shaoqing Zhu

10.1016/j.patcog.2022.109023 article EN Pattern Recognition 2022-09-10

Self-Organised Sequential Multi-Agent Reinforcement Learning for Closely Cooperation Tasks

OPENALEX - Publications

浩平川副 Mingyu You Hongjun Zhou Bin He

10.1109/lra.2025.3559828 article EN IEEE Robotics and Automation Letters 2025-01-01

Imagine: Image-Guided 3D Part Assembly with Structure Knowledge Graph

OPENALEX - Publications

Weihao Wang Lan Yu Mingyu You Bin He

3D part assembly is a promising task in computer vision and robotics, focusing on assembling parts together by predicting their 6-DoF poses. Like most shape understanding tasks, existing methods primarily address this memorizing the poses of during training process, leading to inaccuracies complex assemblies poor generalization novel categories. In order essentially improve performance, structure knowledge target indispensable before assembling, which abstracts potential composition...

10.1609/aaai.v39i8.32850 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

Inquiry diagnosis of coronary heart disease in Chinese medicine based on symptom-syndrome interactions

OPENALEX - Publications

Guozheng Li Sheng Sun Mingyu You Yalei Wang Guoping Liu

There is a long history of coronary heart disease (CHD) diagnosis and treatment in Chinese medicine (CM), but formalized description CM knowledge still unavailable. This study aims to analyze set clinical data, which important urgent.Relative associated density (RAD) was used the one-way links between symptoms or syndromes both. RAD results were further symptom selection.Analysis dataset CHD revealed some significant relationships, not only also syndromes. Using select based on different...

10.1186/1749-8546-7-9 article EN cc-by Chinese Medicine 2012-04-05

Systematic evaluation of deep face recognition methods

OPENALEX - Publications

Mingyu You Xuan Han Yangliu Xu Li Li

10.1016/j.neucom.2020.01.023 article EN Neurocomputing 2020-01-13

MBFQuant: A Multiplier-Bitwidth-Fixed, Mixed-Precision Quantization Method for Mobile CNN-Based Applications

OPENALEX - Publications

Peng Peng Mingyu You Kai Jiang Youzao Lian Weisheng Xu

Deploying Convolutional Neural Network (CNN)-based applications to mobile platforms can be challenging due the conflict between restricted computing capacity of devices and heavy computational overhead running a CNN. quantization is promising way alleviating this problem. However, network result in accuracy degradation especially case with compact CNN architectures that are designed for applications. This paper presents novel efficient mixed-precision pipeline, called MBFQuant. It redefines...

10.1109/tip.2023.3268562 article EN IEEE Transactions on Image Processing 2023-01-01

Speech Emotion Recognition using an Enhanced Co-Training Algorithm

OPENALEX - Publications

Jia Liu Chun Chen Jiajun Bu Mingyu You Jianhua Tao

In previous systems of speech emotion recognition, supervised learning are frequently employed to train classifiers on lots labeled examples. However, the labeling abundant data requires much time and many human efforts. This paper presents an enhanced co-training algorithm utilize a large amount unlabeled utterances for building semi-supervised system. It uses two conditionally independent attribute views(i.e. temporal features statistic features) examples augment smaller set Our...

10.1109/icme.2007.4284821 article EN 2007-07-01

Coming Soon ...