Yiming Xu

ORCID: 0000-0003-3011-700X
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Anomaly Detection Techniques and Applications
  • Text and Document Classification Technologies
  • Domain Adaptation and Few-Shot Learning
  • Time Series Analysis and Forecasting
  • Network Security and Intrusion Detection
  • Spam and Phishing Detection
  • Multimodal Machine Learning Applications
  • Data Stream Mining Techniques
  • Data-Driven Disease Surveillance
  • Cancer-related molecular mechanisms research
  • Semantic Web and Ontologies
  • COVID-19 diagnosis using AI
  • Logic, programming, and type systems
  • Machine Learning and ELM
  • Topic Modeling
  • Biomedical Text Mining and Ontologies
  • Machine Learning and Data Classification
  • IoT-based Smart Home Systems
  • Viral Infections and Outbreaks Research
  • Emotion and Mood Recognition
  • Web Data Mining and Analysis
  • Advanced Image and Video Retrieval Techniques
  • Formal Methods in Verification
  • EEG and Brain-Computer Interfaces
  • Logic, Reasoning, and Knowledge

Tokyo Institute of Technology
2025

Bellevue Hospital Center
2023

Northwestern University
2020-2022

Meta (United States)
2021

Australian National University
2020

Xi’an University
2020

Fudan University
2019

10.1109/icassp49660.2025.10890602 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

The goal of domain adaptation (DA) is to train a good model for target domain, with large amount labeled data in source but only limited the domain. Conventional closed set (CSDA) assumes and label spaces are same. However, this not quite practical real-world applications. In work, we study problem open (OSDA), which requires space partially overlap space. Consequently, solution OSDA unknown classes detection separation, normally achieved by introducing threshold prediction classes; however,...

10.1109/tnnls.2021.3105614 article EN IEEE Transactions on Neural Networks and Learning Systems 2021-08-30

In the cross-domain image classification scenario, domain adaption aims to address challenge of transferring knowledge obtained from source target that is regarded as similar but different domain. To get more reliable invariant representations, recent methods start consider class-level distribution alignment across and domains by adaptively assigning pseudo labels. However, these approaches are vulnerable error accumulation hence unable preserve category consistency. Because accuracy labels...

10.1145/3357384.3357918 article EN 2019-11-03

Ontology learning is a critical task in industry, dealing with identifying and extracting concepts captured text data such that these can be used different tasks, e.g. information retrieval. non-trivial due to several reasons limited amount of prior research work automatically learns domain specific ontology from data. In our work, we propose two-stage classification system learn an unstructured We first collect candidate concepts, which are classified into irrelevant collocates by...

10.5220/0009980100290039 preprint EN cc-by-nc-nd 2020-01-01

In model serving, having one fixed during the entire often life-long inference process is usually detrimental to performance, as data distribution evolves over time, resulting in lack of reliability trained on historical data. It important detect changes and retrain time. The existing methods generally have three weaknesses: 1) using only classification error rate signal, 2) assuming ground truth labels are immediately available after features from samples received 3) unable decide what use...

10.1109/bigdata52589.2021.9671279 article EN 2021 IEEE International Conference on Big Data (Big Data) 2021-12-15

Ontology learning is a critical task in industry, dealing with identifying and extracting concepts captured text data such that these can be used different tasks, e.g. information retrieval. non-trivial due to several reasons limited amount of prior research work automatically learns domain specific ontology from data. In our work, we propose two-stage classification system learn an unstructured We first collect candidate concepts, which are classified into irrelevant collocates by...

10.48550/arxiv.1903.04360 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Common domain adaptation techniques assume that the source and target share an identical label space, which is problematic since when samples are unlabeled we have no knowledge on whether two domains same space. When this not case, existing methods fail to perform well because additional unknown classes also matched with during adaptation. In paper, tackle open set problem under assumption spaces only partially overlap, task becomes exist, how detect avoid aligning them domain. We propose...

10.1109/bigdata55660.2022.10020365 article EN 2021 IEEE International Conference on Big Data (Big Data) 2022-12-17

Abstract Aiming at the problem that definition of crowd abnormal behavior detection is ambiguous and difficult to combine with context semantics, an algorithm using OCC human emotion model combined entropy proposed. First calculate for crowd, determine whether value abnormal, if it further extract optical flow OF HOG. Then project into two-dimensional vector data, send CNN local feature extraction achieve description emotions. Finally, predict abnormality occurs according judgment factor....

10.1088/1742-6596/1622/1/012051 article EN Journal of Physics Conference Series 2020-09-01

We study the problem of visual question answering (VQA) in images by exploiting supervised domain adaptation, where there is a large amount labeled data source but only limited target with goal to train good model. A straightforward solution fine-tune pre-trained model using those data, it usually cannot work well due considerable difference between distributions and domains. Moreover, availability multiple modalities (i.e., images, questions answers) VQA poses further challenges...

10.48550/arxiv.1911.04058 preprint EN other-oa arXiv (Cornell University) 2019-01-01

At present, the detection of crowd abnormal behavior has problem unclear definition anomalies and mutual occlusion. Existing researches mostly detect appearance characteristics individuals, extract analyze movement groups. Anomaly is achieved through comparison parameters thresholds. However, context semantics cannot be effectively utilized consistent with actual situation. There a disconnection between description. In this paper, we use designed convolutional neural network to...

10.1145/3411016.3411166 article EN 2020-06-19

Practical natural language processing (NLP) tasks often exhibit long-tailed distributions accompanied by noisy labels, posing significant challenges to the generalization and robustness of complex models such as Deep Neural Networks (DNNs). Traditional resampling techniques like oversampling or undersampling, while commonly employed, can easily lead overfitting. A growing trend involves leveraging small amounts metadata learn data weights, alongside demonstrated benefits self-supervised...

10.1109/icmla58977.2023.00283 article EN 2023-12-15

In model serving, having one fixed during the entire often life-long inference process is usually detrimental to performance, as data distribution evolves over time, resulting in lack of reliability trained on historical data. It important detect changes and retrain time. The existing methods generally have three weaknesses: 1) using only classification error rate signal, 2) assuming ground truth labels are immediately available after features from samples received 3) unable decide what use...

10.48550/arxiv.2012.04759 preprint EN other-oa arXiv (Cornell University) 2020-01-01
Coming Soon ...