Ming Gao

ORCID: 0000-0001-5359-6213
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Topic Modeling
  • Natural Language Processing Techniques
  • Advanced Graph Neural Networks
  • Text and Document Classification Technologies
  • Multimodal Machine Learning Applications
  • Speech Recognition and Synthesis
  • Advanced Computational Techniques and Applications
  • Service-Oriented Architecture and Web Services
  • Voice and Speech Disorders
  • Sentiment Analysis and Opinion Mining
  • Advanced Text Analysis Techniques
  • Phonetics and Phonology Research
  • Wireless Communication Security Techniques
  • Sensor Technology and Measurement Systems
  • Advanced Algorithms and Applications
  • Network Security and Intrusion Detection
  • Chinese history and philosophy
  • Data Management and Algorithms
  • Semantic Web and Ontologies
  • Security in Wireless Sensor Networks
  • Linguistics and Cultural Studies
  • Music and Audio Processing
  • Dysphagia Assessment and Management
  • Translation Studies and Practices
  • Biomedical Text Mining and Ontologies

University of Science and Technology of China
2024

East China Normal University
2015-2023

Dongbei University of Finance and Economics
2007-2022

Shanghai University
2019-2020

Shanghai University of Engineering Science
2019

Shanghai Key Laboratory of Trustworthy Computing
2015

Xizang Minzu University
2001-2006

Yuan Ze University
2003

The automatic construction of large-scale knowledge graphs has received much attention from both academia and industry in the past few years. Notable graph systems include Google Knowledge Graph, DBPedia, YAGO, NELL, Probase many others. organizes information a structured way by explicitly describing relations among entities. Since entity identification relation extraction are highly depending on language itself, data sources largely determine processed, extracted, ultimately how formed,...

10.1109/icdew.2015.7129545 article EN 2015-04-01

Relation Extraction (RE) is a vital step to complete Knowledge Graph (KG) by extracting entity relations from texts. However, it usually suffers the long-tail issue. This paper proposes novel approach learn relation prototypes unlabeled texts, facilitate RE transferring knowledge types with sufficient training data. We as an implicit factor between entities, which reflects meanings of and their proximities. construct co-occurrence graph capture both first-order second-order proximities for...

10.1109/tkde.2021.3096200 article EN IEEE Transactions on Knowledge and Data Engineering 2021-01-01

Relation extraction (RE) aims at extracting the relation between two entities from text corpora. It is a crucial task for Knowledge Graph (KG) construction. Most existing methods predict an entity pair by learning training sentences, which contain targeted pair. In contrast to distant supervision approaches that suffer insufficient corpora extract relations, our proposal of mining implicit mutual massive unlabeled transfers semantic information pairs into RE model, more expressive and...

10.1109/icde48307.2020.00093 article EN 2022 IEEE 38th International Conference on Data Engineering (ICDE) 2020-04-01

Dysarthric speech poses significant challenges for individuals with dysarthria, impacting their ability to communicate socially. Despite the widespread use of Automatic Speech Recognition (ASR), accurately recognizing dysarthric remains a formidable task, largely due limited availability data. To address this gap, we developed Chinese Dysarthria Database (CDSD), most extensive collection dysarthria data date, featuring 133 hours recordings from 44 speakers. Our benchmarks reveal best...

10.21437/interspeech.2024-1597 article EN Interspeech 2022 2024-09-01

10.1109/slt61566.2024.10832239 article EN 2022 IEEE Spoken Language Technology Workshop (SLT) 2024-12-02

Named Entity Recognition (NER) on Clinical Electronic Medical Records (CEMR) is a fundamental step in extracting disease knowledge by identifying specific entity terms such as diseases, symptoms, etc. However, the state-of-the-art NER methods based Long Short-Term Memory (LSTM) fail to exploit GPU parallelism fully under massive medical records. Although novel method Iterated Dilated CNNs (ID-CNNs) can accelerate network computing, it tends ignore word-order feature and semantic information...

10.3390/fi11090185 article EN cc-by Future Internet 2019-08-26

Relation Extraction (RE) is a vital step to complete Knowledge Graph (KG) by extracting entity relations from texts.However, it usually suffers the long-tail issue. The training data mainly concentrates on few types of relations, leading lackof sufficient annotations for remaining relations. In this paper, we propose general approach learn relation prototypesfrom unlabeled texts, facilitate extraction transferring knowledge with trainingdata. We prototypes as an implicit factor between...

10.48550/arxiv.2011.13574 preprint EN cc-by arXiv (Cornell University) 2020-01-01

Automatic text generation techniques with either extractive-based or generative-based methods are becoming increasingly popular and widely used in industry. In contrast to existing approaches that ignore the coherence, our proposed approach integrates both keyword coverage coherence into optimization framework. this paper, we employ semantics-based syntax-based metrics evaluate coherence. Extensive experiments on a real corpus demonstrate method outperforms baselines overall regrading ROUGE...

10.1109/ccis48116.2019.9073682 article EN 2019-12-01

A new method of building intelligent sentiment lexicon based on LDA and word clustering is put forward in this paper. In order to make seed words more representative universal, uses topic model build the term vectors select words. The improved SO-PMI algorithm has been used calculate emotional tendency each word. addition, domain lexicon's automatic extension update designed deal with dynamic corpus data. Experiments show that proposed can higher accuracy, reflect change words' real time. It...

10.1504/ijitm.2020.10028762 article EN International Journal of Information Technology and Management 2020-01-01

We study the problem of multimodal fusion in this paper. Recent exchanging-based methods have been proposed for vision-vision fusion, which aim to exchange embeddings learned from one modality other. However, most them project inputs multimodalities into different low-dimensional spaces and cannot be applied sequential input data. To solve these issues, paper, we propose a novel model MuSE text-vision based on Transformer. first use two encoders separately map spaces. Then employ decoders...

10.48550/arxiv.2309.02190 preprint EN cc-by arXiv (Cornell University) 2023-01-01

We present the Chinese Dysarthria Speech Database (CDSD) as a valuable resource for dysarthria research. This database comprises speech data from 24 participants with dysarthria. Among these participants, one recorded an additional 10 hours of data, while each hour, resulting in 34 material. To accommodate varying cognitive levels, our text pool primarily consists content AISHELL-1 dataset and speeches by primary secondary school students. When read texts, they must use mobile device or ZOOM...

10.48550/arxiv.2310.15930 preprint EN cc-by-nc-nd arXiv (Cornell University) 2023-01-01

This paper presents a topic-enhanced recurrent autoencoder model to improve the accuracy of sentiment classification short texts. First, concept is proposed tackle problems in recursive including 'increasing computation complexity' and 'ignoring natural word order'. Then, enhanced with topic information generated by joint sentiment-topic (JST) model. Besides, order identify negations ironies texts, lexicon utilised add feature dimensions for sentence representations. Experiments are...

10.1504/ijims.2020.10028655 article EN International Journal of Internet Manufacturing and Services 2020-01-01
Coming Soon ...