Chao Zeng

ORCID: 0000-0001-5705-8789
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Multimodal Machine Learning Applications
  • Advanced Image and Video Retrieval Techniques
  • Domain Adaptation and Few-Shot Learning
  • Video Analysis and Summarization
  • Advanced Image Fusion Techniques
  • Natural Language Processing Techniques
  • Speech Recognition and Synthesis
  • Topic Modeling
  • Image and Video Quality Assessment
  • Visual Attention and Saliency Detection
  • ECG Monitoring and Analysis
  • Image Enhancement Techniques
  • Robotics and Sensor-Based Localization
  • Non-Invasive Vital Sign Monitoring
  • Industrial Vision Systems and Defect Detection
  • Advanced Numerical Analysis Techniques
  • Web Data Mining and Analysis
  • Video Surveillance and Tracking Methods
  • World Trade Organization Law
  • Advanced Measurement and Metrology Techniques
  • Spam and Phishing Detection
  • Image and Signal Denoising Methods
  • Advanced Image Processing Techniques
  • EEG and Brain-Computer Interfaces
  • Medical Image Segmentation Techniques

Xi'an Jiaotong University
2024

Hubei University
2023

City University of Hong Kong
2021-2022

Zhejiang University of Technology
2020-2022

Fukuoka Institute of Technology
2018

Cross-modal hashing has received widespread attentions on cross-modal retrieval task due to its superior efficiency and low storage cost. However, most existing methods learn binary codes directly from multimedia data, which cannot fully utilize the semantic knowledge of data. Furthermore, they ranking based similarity relevance data points with multi-label. And usually use a relax constraint hash code causes non-negligible quantization loss in optimization. In this paper, method called Deep...

10.1145/3372278.3390711 article EN 2020-06-02

With the rapid development of deep neural networks, cross-modal hashing has made great progress. However, information different types data is asymmetrical, that to say, if resolution an image high enough, it can reproduce almost 100% real-world scenes. text usually carries personal emotion and not objective so we generally think will be much richer than text. Although most existing methods unify semantic feature extraction hash function learning modules for end-to-end learning, they ignore...

10.1109/tnnls.2022.3174970 article EN IEEE Transactions on Neural Networks and Learning Systems 2022-05-23

QRS complex detection is usually the most important step for automated electrocardiogram (ECG) analysis. In this paper, we present a new approach of detection. The Mexican-hat wavelet and Morlet are used to transform ECG signal, according trait that modulus maxima two coefficients above correspond with R peaks detector unit waves which based on jump sequence in coefficient proposed. Traditional methods employs only one kind perform transformation ECG, whereas proposed method use at same...

10.4304/jcp.8.11.2951-2958 article EN Journal of Computers 2013-11-01

There is a huge demand on multilingual tourism information of Japan because the increasing number tourists from foreign countries. Most them may expect typical and stereotyped culture, nature, modern society Japan. However, people different backgrounds, cultures, languages might aspects Japan, as well. In this paper, we analyze these kinds differences cultural preference for We propose machine‐learning‐based method to figure out countries based comparing access logs site in languages. focus...

10.1002/tee.22841 article EN IEEJ Transactions on Electrical and Electronic Engineering 2018-12-04

Email classification methods based on the content general use Vector Space Model. The model is constructed frequency of every independent word appearing in content. Frequency VSM does not take context environment into account, thus feature vectors can accurately represent content, which will result inaccurate classification. This paper presents a new approach to Concept Model using WordNet. In our approach, WordNet we extract high-level information categories during training process by...

10.1109/fgcns.2008.7 article EN 2008-12-01

10.1007/s13042-022-01506-w article EN International Journal of Machine Learning and Cybernetics 2022-03-25

Image processing on surfaces has drawn significant interest in recent years, particularly the context of denoising. Salt-and-pepper noise is a special type which randomly sets portion image pixels to minimum or maximum intensity while keeping others unaffected. In this paper, We propose L$_p$TV models triangle meshes recover images corrupted by salt-and-pepper surfaces. establish lower bound for data fitting term recovered image. Motivated property, we corresponding algorithm based proximal...

10.48550/arxiv.2409.11139 preprint EN arXiv (Cornell University) 2024-09-17

<title>Abstract</title> The thermal error samples of the machine tool feed system are limited and highly nonlinear, making it difficult for a single prediction model to accurately predict errors under complex operating conditions. To improve accuracy, this paper proposes modeling method based on fusion MCNN-LSTM-SVM tailored small samples, combining deep learning with traditional algorithms. A Multi-Scale Convolutional Neural Network (MCNN) Long Short-Term Memory (LSTM) work in parallel...

10.21203/rs.3.rs-5377194/v1 preprint EN cc-by Research Square (Research Square) 2024-11-27

With the rapid growth in scale and complexity of large language models (LLMs), costs training inference have risen substantially. Model compression has emerged as a mainstream solution to reduce memory usage computational overhead. This paper presents Group Quantization Sparse Acceleration (\textbf{GQSA}), novel technique tailored for LLMs. Traditional methods typically focus exclusively on either quantization or sparsification, but relying single strategy often results significant...

10.48550/arxiv.2412.17560 preprint EN arXiv (Cornell University) 2024-12-23

<p indent=0mm>Cross-modal Hashing has received a lot of attentions in the field cross-modal retrieval due to its high efficiency and low storage cost. Most existing methods learn Hash codes directly from multimodal data cannot fully utilize semantic information data, so distribution consistency low-dimensional features across modalities be guaranteed. To this end, adversarial projection learning based for (APLH) is proposed, which uses training different ensure modalities. On basis, matching...

10.3724/sp.j.1089.2021.18599 article EN Journal of Computer-Aided Design & Computer Graphics 2021-06-01

The image captioning task is about to generate suitable descriptions from images. For this there can be several challenges such as accuracy, fluency and diversity. However are few metrics that cover all these properties while evaluating results of models.In paper we first conduct a comprehensive investigation on contemporary metrics. Motivated by the auto-encoder mechanism research advances word embeddings propose learning based for captioning, which call Intrinsic Image Captioning...

10.48550/arxiv.2012.07333 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Image registration has become a hot topic in compute vision and shown its variety applications medical areas. The best alignment of images could be estimated by maximization cost function using an iterative optimization, thus optimization is one the most important parts image method. Local optimizations, such as Powell’s method, frequently failed due to existence local minima function, therefore, global methods are required. In this paper, new evolutionary algorithm, group search optimizer...

10.4156/jcit.vol8.issue6.119 article EN Journal of Convergence Information Technology 2013-03-31

Automatically evaluating the quality of image captions can be very challenging since human language is quite flexible that there various expressions for same meaning. Most current captioning metrics rely on token level matching between candidate caption and ground truth label sentences. It usually neglects sentence-level information. Motivated by auto-encoder mechanism contrastive representation learning advances, we propose a learning-based metric captioning, which call Intrinsic Image...

10.48550/arxiv.2106.15312 preprint EN other-oa arXiv (Cornell University) 2021-01-01
Coming Soon ...