NFDI4DS | UHH-SEMS - Publication Details

Xinyuan Zhou

ORCID: 0000-0003-2815-8857

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5102864167

Research Areas

Speech Recognition and Synthesis
Speech and Audio Processing
Natural Language Processing Techniques
Music and Audio Processing
Topic Modeling
Air Quality Monitoring and Forecasting
Industrial Technology and Control Systems
Robot Manipulation and Learning
Advanced Sensor Technologies Research
Air Quality and Health Impacts
Atmospheric chemistry and aerosols
Visual Attention and Saliency Detection
Advanced Power Generation Technologies
Evolutionary Algorithms and Applications
Mechanical and Thermal Properties Analysis
Autoimmune and Inflammatory Disorders Research
Multimodal Machine Learning Applications
Advanced Image and Video Retrieval Techniques
Regional Development and Environment
Advanced SAR Imaging Techniques
Time Series Analysis and Forecasting
Color perception and design
Reservoir Engineering and Simulation Methods
Lysosomal Storage Disorders Research
Robotic Path Planning Algorithms

Sichuan University
2023-2025

Qingdao Huanghai University
2018-2024

University College London
2019-2024

Beijing Jiaotong University
2022-2023

Shanghai Normal University
2020-2023

Army Medical University
2023

Northwestern Polytechnical University
2021

National University of Singapore
2020

China Nonferrous Metal Mining (China)
2012

China National Petroleum Corporation (China)
2003

Application of XGBoost algorithm in the optimization of pollutant concentration

OPENALEX - Publications

Jiangtao Li Xingqin An Qingyong Li Chao Wang Haomin Yu and 2 more

10.1016/j.atmosres.2022.106238 article EN Atmospheric Research 2022-05-13

Learn to Navigate: Cooperative Path Planning for Unmanned Surface Vehicles Using Deep Reinforcement Learning

OPENALEX - Publications

Xinyuan Zhou Peng Wu Haifeng Zhang Weihong Guo Yuanchang Liu

Unmanned surface vehicle (USV) has witnessed a rapid growth in the recent decade and been applied various practical applications both military civilian domains. USVs can either be deployed as single unit or multiple vehicles fleet to conduct ocean missions. Central control of USV formations, path planning is key technology that ensures navigation safety by generating collision free trajectories. Compared with conventional algorithms, deep reinforcement learning (RL) based algorithms provides...

10.1109/access.2019.2953326 article EN cc-by IEEE Access 2019-01-01

Multi-Encoder-Decoder Transformer for Code-Switching Speech Recognition

OPENALEX - Publications

Xinyuan Zhou Emre Yılmaz Yanhua Long Yijie Li Haizhou Li

Code-switching (CS) occurs when a speaker alternates words of two or more languages within single sentence across sentences.Automatic speech recognition (ASR) CS has to deal with at the same time.In this study, we propose Transformer-based architecture symmetric language-specific encoders capture individual language attributes, that improve acoustic representation each language.These representations are combined using multi-head attention mechanism in decoder module.Each encoder and its...

10.21437/interspeech.2020-2488 article EN Interspeech 2022 2020-10-25

Optimization research on air quality numerical model forecasting effects based on deep learning methods

OPENALEX - Publications

Wei Wang Xingqin An Qingyong Li Yangli‐ao Geng Haomin Yu and 1 more

10.1016/j.atmosres.2022.106082 article EN Atmospheric Research 2022-02-14

The USTC-NELSLIP Offline Speech Translation Systems for IWSLT 2022

OPENALEX - Publications

Weitai Zhang Zhongyi Ye Haitao Tang Xiaoxi Li Xinyuan Zhou and 8 more

Weitai Zhang, Zhongyi Ye, Haitao Tang, Xiaoxi Li, Xinyuan Zhou, Jing Yang, Jianwei Cui, Pan Deng, Mohan Shi, Yifan Song, Dan Liu, Junhua Lirong Dai. Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022). 2022.

10.18653/v1/2022.iwslt-1.15 article EN cc-by 2022-01-01

Bridging Modality Gap with Large Speech and Language Models for End-to-End Speech-to-Text Translation

OPENALEX - Publications

Weitai Zhang Simran Naagar Zhongyi Ye Peiwang Tang Xinyuan Zhou and 2 more

10.1109/icassp49660.2025.10890787 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

SAR Ship Detector Using Cross-stage Feature Fusion and Decoupled Head with Mutual Guidance

OPENALEX - Publications

Yixin Qiao Xiaoxiao Yin Xinyuan Zhou Shiyong Lan Guangming Deng

10.1109/icassp49660.2025.10890187 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

Design of a high-precision exhaled breath detection circuit for 24-channel gas concentration sampling

OPENALEX - Publications

Wen-Jun Liu Kun Li Shuo Feng Yan Pu Xinyuan Zhou

10.1117/12.3067148 article EN 2025-05-09

Lithium ameliorates Niemann-Pick C1 disease phenotypes by impeding STING/SREBP2 activation

OPENALEX - Publications

Shiqian Han Qijun Wang Yongfeng Song Mao Pang Chunguang Ren and 10 more

Niemann-Pick disease type C (NP-C) is a genetic lysosomal disorder associated with progressive neurodegenerative phenotypes. Its therapeutic options are very limited. Here, we show that lithium treatment improves ataxia and feeding phenotypes, attenuates cerebellar inflammation degeneration, extends survival in Npc1 mouse models. In addition, suppresses STING activation, SREBP2 processing to its mature form the expression of target genes mice Npc1-deficient fibroblasts. Lithium impedes...

10.1016/j.isci.2023.106613 article EN cc-by-nc-nd iScience 2023-04-08

Multi-Channel Target Speech Extraction with Channel Decorrelation and Target Speaker Adaptation

OPENALEX - Publications

Jiangyu Han Xinyuan Zhou Yanhua Long Yijie Li

The end-to-end approaches for single-channel target speech extraction have attracted widespread attention. However, the studies multi-channel are still relatively limited. In this work, we propose two methods exploiting spatial information to extract speech. first one is using a adaptation layer in parallel encoder architecture. second designing channel decorrelation mechanism inter-channel differential enhance representation. We compare proposed with strong state-of-the-art baselines....

10.1109/icassp39728.2021.9414244 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021-05-13

Data-Centric Financial Large Language Models

OPENALEX - Publications

Zhixuan Chu Huaiyu Guo Xinyuan Zhou Yijia Wang Fei Yu and 7 more

Large language models (LLMs) show promise for natural tasks but struggle when applied directly to complex domains like finance. LLMs have difficulty reasoning about and integrating all relevant information. We propose a data-centric approach enable better handle financial tasks. Our key insight is that rather than overloading the LLM with everything at once, it more effective preprocess pre-understand data. create (FLLM) using multitask prompt-based finetuning achieve data pre-processing...

10.48550/arxiv.2310.17784 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Pre-Trained Acoustic-and-Textual Modeling for End-To-End Speech-To-Text Translation

OPENALEX - Publications

Weitai Zhang Hanyi Zhang Chenxuan Liu Zhongyi Ye Xinyuan Zhou and 2 more

End-to-end paradigm has aroused more and interests attention for improving speech-to-text translation (ST) recently. Existing end-to-end models mainly attributes attempts to address the problem of modeling burden data scarcity, while always fail maintain both cross-modal cross-lingual mapping well at same time. In this work, we investigate methods endto-end ST with pre-trained acoustic-and-textual models. Our acoustic encoder decoder begins processing source speech sequence as usual. A...

10.1109/icassp48485.2024.10446635 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-Based LVCSR

OPENALEX - Publications

Xinyuan Zhou Grandee Lee Emre Yılmaz Yanhua Long Jiaen Liang and 1 more

Transformer has shown impressive performance in automatic speech recognition.It uses an encoder-decoder structure with self-attention to learn the relationship between high-level representation of source inputs and embedding target outputs.In this paper, we propose a novel decoder that features self-and-mixed attention (SMAD) deep acoustic (DAS) improve Transformer-based LVCSR.Specifically, introduce mechanism multi-layer for multiple levels abstraction.We also design mixed learns alignment...

10.21437/interspeech.2020-2556 article EN Interspeech 2022 2020-10-25

CGF: A Category Guidance Based PM$_{2.5}$ Sequence Forecasting Training Framework

OPENALEX - Publications

Haomin Yu Jilin Hu Xinyuan Zhou Chenjuan Guo Bin Yang and 1 more

PM <inline-formula><tex-math notation="LaTeX">$_{2.5}$</tex-math></inline-formula> concentration forecasting is important yet challenging. First, complicated local fluctuations in concentrations disturb modeling global trends. Second, errors are often accumulated through an autoregressive process. To contend with the two challenges, we propose a C ategory G uidance based notation="LaTeX">${_{2.5}}$</tex-math></inline-formula> sequence F orecasting training framework...

10.1109/tkde.2023.3253703 article EN IEEE Transactions on Knowledge and Data Engineering 2023-03-08

Speech-and-Text Transformer: Exploiting Unpaired Text for End-to-End Speech Recognition

OPENALEX - Publications

Qinyi Wang Xinyuan Zhou Haizhou Li

10.1561/116.00000001 article EN cc-by-nc APSIPA Transactions on Signal and Information Processing 2023-01-01

Submission of USTC’s System for the IWSLT 2023 - Offline Speech Translation Track

OPENALEX - Publications

Xinyuan Zhou Jianwei Cui Zhongyi Ye Yi‐Chi Wang Luzhen Xu and 3 more

This paper describes the submissions of research group USTC-NELSLIP to 2023 IWSLT Offline Speech Translation competition, which involves translating spoken English into written Chinese. We utilize both cascaded models and end-to-end for this task. To improve performance models, we introduce Whisper reduce errors in intermediate source language text, achieving a significant improvement ASR recognition performance. For propose Stacked Acoustic-and-Textual En- coding extension (SATE-ex), feeds...

10.18653/v1/2023.iwslt-1.15 article EN cc-by 2023-01-01

Cognition, willingness, and behavior towards human papillomavirus vaccination in Chinese university students: Planned behavior, health beliefs, and media influence

OPENALEX - Publications

Xinyuan Zhou Thomas William Whyke Aiqing Wang

This study assessed Human papillomavirus (HPV) vaccination knowledge, willingness, and status among University of Nottingham Ningbo undergraduate students, utilizing the Theory Planned Behaviour (TPB) Health Belief Model (HBM). Self-administered questionnaires covered demographics, sexual behavior, factors influencing intentions. Quantitative qualitative analyses included t-tests, ANOVA, Pearson correlation, logistic regression, linear regression. Of 373 surveyed HPV rate was notably higher...

10.1177/20594364241230860 article EN cc-by-nc Global Media and China 2024-02-12

Decoupled Hyperbolic Graph Attention Network for Modeling Substitutable and Complementary Item Relationships

OPENALEX - Publications

Zhiheng Zhou Tao Wang Linfang Hou Xinyuan Zhou Mian Ma and 1 more

Modeling substitutable and complementary item relationships is a fundamental important topic for recommendation in e-commerce online scenarios. In the real world, are usually coupled, heterogeneous they also have abundant side information hierarchical data structures. Recently, to take full advantage of both sides topological structure, graph neural networks widely explored relationship modeling. However, existing methods crude decoupling relationships. Their model designs lack deep insight...

10.1145/3511808.3557281 article EN Proceedings of the 31st ACM International Conference on Information & Knowledge Management 2022-10-16

Multi-Encoder-Decoder Transformer for Code-Switching Speech Recognition

OPENALEX - Publications

Xinyuan Zhou Emre Yılmaz Yanhua Long Yijie Li Haizhou Li

Code-switching (CS) occurs when a speaker alternates words of two or more languages within single sentence across sentences. Automatic speech recognition (ASR) CS has to deal with at the same time. In this study, we propose Transformer-based architecture symmetric language-specific encoders capture individual language attributes, that improve acoustic representation each language. These representations are combined using multi-head attention mechanism in decoder module. Each encoder and its...

10.48550/arxiv.2006.10414 preprint EN other-oa arXiv (Cornell University) 2020-01-01

CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier

OPENALEX - Publications

Tiantian Tang Xinyuan Zhou Yanhua Long Yijie Li Jiaen Liang

Domain mismatch is a noteworthy issue in acoustic event detection tasks, as the target domain data difficult to access most real applications. In this study, we propose novel CNN-based discriminative training framework compensation method handle issue. It uses parallel discriminator learn pair of high-level intermediate representations. Together with binary loss, discriminators are forced maximally exploit discrimination heterogeneous information each audio clip events, which results robust...

10.48550/arxiv.2103.14297 preprint EN other-oa arXiv (Cornell University) 2021-01-01

A Novel Method for Pairwise Alignment Based on an Ant Colony Algorithm

OPENALEX - Publications

LI Gang-cheng Xinyuan Zhou Jie Yang Huanwen Chen Qiong Cai and 1 more

10.1166/jctn.2010.1577 article EN Journal of Computational and Theoretical Nanoscience 2010-07-24

An Ant Colony Pairwise Alignment Based on the Simplified Grid

OPENALEX - Publications

Xinyuan Zhou Dachao Li Jiawei Luo Ping Zhang

10.1166/jctn.2010.1359 article EN Journal of Computational and Theoretical Nanoscience 2010-01-01

Self-and-Mixed Attention Decoder with Deep Acoustic Structure for Transformer-based LVCSR

OPENALEX - Publications

Xinyuan Zhou Grandee Lee Emre Yılmaz Yanhua Long Jiaen Liang and 1 more

The Transformer has shown impressive performance in automatic speech recognition. It uses the encoder-decoder structure with self-attention to learn relationship between high-level representation of source inputs and embedding target outputs. In this paper, we propose a novel decoder that features self-and-mixed attention (SMAD) deep acoustic (DAS) improve Transformer-based LVCSR. Specifically, introduce mechanism multi-layer for multiple levels abstraction. We also design mixed learns...

10.48550/arxiv.2006.10407 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Coming Soon ...