NFDI4DS | UHH-SEMS - Publication Details

Zihan Zhang

ORCID: 0000-0002-0415-7721

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100410545

Research Areas

Speech Recognition and Synthesis
Speech and Audio Processing
Natural Language Processing Techniques
Topic Modeling
Music and Audio Processing
Particle physics theoretical and experimental studies
Distributed and Parallel Computing Systems
Advanced Data Compression Techniques
Traffic Prediction and Management Techniques
Domain Adaptation and Few-Shot Learning
Speech and dialogue systems
Internet Traffic Analysis and Secure E-voting
Mathematics, Computing, and Information Processing
Anomaly Detection Techniques and Applications
Advanced Optical Network Technologies
Text Readability and Simplification
Opportunistic and Delay-Tolerant Networks
Advanced Data Storage Technologies
Advanced Graph Neural Networks
Explainable Artificial Intelligence (XAI)
Bluetooth and Wireless Communication Technologies
COVID-19 diagnosis using AI
Computational Physics and Python Applications
Advanced SAR Imaging Techniques
Advanced Adaptive Filtering Techniques

University of Manchester
2025

Northwestern Polytechnical University
2023-2024

National University of Defense Technology
2024

East Sussex County Council
2023

Microsoft (Finland)
2023

Fuzhou University
2023

Beihang University
2017-2022

Shenyang Jianzhu University
2022

Tongji University
2021

Dalian Maritime University
2021

Reconstructing short-lived particles using hypergraph representation learning

OPENALEX - Publications

C. J. Birch-sykes Brian V. Le R. F. Y. Peters E. L. Simpson Zihan Zhang

In collider experiments, the kinematic reconstruction of heavy, short-lived particles is vital for precision tests Standard Model and in searches physics beyond it. Performing events with many final-state jets, such as all-hadronic decay top-antitop quark pairs, challenging. We present (HyPER), a novel architecture based on graph neural networks that uses hypergraph representation learning to build more powerful efficient representations events. HyPER used reconstruct parent from sets...

10.1103/physrevd.111.032004 article EN cc-by Physical review. D/Physical review. D. 2025-02-11

Normalized ground state solutions for Kirchhoff equations with critical exponential growth in <inline-formula><tex-math id="M1">$ \mathbb{R}^{2 } $</tex-math></inline-formula>

OPENALEX - Publications

Zihan Zhang Jianjun Zhang

10.3934/dcdss.2025040 article EN Discrete and Continuous Dynamical Systems - S 2025-01-01

A high utility itemset mining algorithm based on subsume index

OPENALEX - Publications

Wei Song Zihan Zhang Jinhong Li

10.1007/s10115-015-0900-1 article EN Knowledge and Information Systems 2015-12-09

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

OPENALEX - Publications

Ting Jiang Shaohan Huang Shengyue Luo Zihan Zhang Haizhen Huang and 6 more

Low-rank adaptation is a popular parameter-efficient fine-tuning method for large language models. In this paper, we analyze the impact of low-rank updating, as implemented in LoRA. Our findings suggest that updating mechanism may limit ability LLMs to effectively learn and memorize new knowledge. Inspired by observation, propose called MoRA, which employs square matrix achieve high-rank while maintaining same number trainable parameters. To it, introduce corresponding non-parameter...

10.48550/arxiv.2405.12130 preprint EN arXiv (Cornell University) 2024-05-20

Learning from My Friends: Few-Shot Personalized Conversation Systems via Social Networks

OPENALEX - Publications

Zhiliang Tian Wei Bi Zihan Zhang Dongkyu Lee Yiping Song and 1 more

Personalized conversation models (PCMs) generate responses according to speaker preferences. Existing personalized tasks typically require extract preferences from user descriptions or their histories, which are scarce for newcomers and inactive users. In this paper, we propose a few-shot task with an auxiliary social network. The requires given few conversations the methods mainly designed incorporate histories. Those can hardly model speakers so connections between speakers. To better...

10.1609/aaai.v35i15.17638 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2021-05-18

Two-Stage Neural Network for ICASSP 2023 Speech Signal Improvement Challenge

OPENALEX - Publications

Mingshuai Liu Shubo Lv Zihan Zhang Runduo Han Xiang Hao and 4 more

In ICASSP 2023 speech signal improvement challenge, we developed a dual-stage neural model which improves quality induced by different distortions in stage-wise divide-and-conquer fashion. Specifically, the first stage, network focuses on recovering missing components of spectrum, while second our aims to further suppress noise, reverberation, and artifacts introduced first-stage model. Achieving 0.446 final score 0.517 P.835 score, system ranks 4th non-real-time track.

10.1109/icassp49357.2023.10094827 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Dual-Alignment Pre-training for Cross-lingual Sentence Embedding

OPENALEX - Publications

Ziheng Li Shaohan Huang Zihan Zhang Zhihong Deng Qiang Lou and 5 more

Ziheng Li, Shaohan Huang, Zihan Zhang, Zhi-Hong Deng, Qiang Lou, Haizhen Jian Jiao, Furu Wei, Weiwei Qi Zhang. Proceedings of the 61st Annual Meeting Association for Computational Linguistics (Volume 1: Long Papers). 2023.

10.18653/v1/2023.acl-long.191 article EN cc-by 2023-01-01

An N-ary Tree-based Model for Similarity Evaluation on Mathematical Formulae

OPENALEX - Publications

Yifan Dai Liangyu Chen Zihan Zhang

Accurate and efficient measurements for evaluating the similarity between mathematical formulae play an important role in information retrieval. Most previous studies have focused on representing different types to catch their features combining traditional structure matching algorithms. This paper presents a new unsupervised model called N-ary Tree-based Formula Embedding Model (NTFEM) task of evaluation. Using n-ary tree represent formula, we convert formula into linear sequence that can...

10.1109/smc42975.2020.9283495 article EN 2022 IEEE International Conference on Systems, Man, and Cybernetics (SMC) 2020-10-11

Long-Tailed Classification with Gradual Balanced Loss and Adaptive Feature Generation

OPENALEX - Publications

Zihan Zhang Xiang Xiang

The real-world data distribution is essentially long-tailed, which poses great challenge to the deep model. In this work, we propose a new method, Gradual Balanced Loss and Adaptive Feature Generator (GLAG) alleviate imbalance. GLAG first learns balanced robust feature model with Loss, then fixes augments under-represented tail classes on level knowledge from well-represented head classes. And generated samples are mixed up real training during epochs. general loss it can combine different...

10.48550/arxiv.2203.00452 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Optimizing ROOT IO For Analysis

OPENALEX - Publications

Brian Bockelman Zihan Zhang J. Pivarski

The ROOT I/O (RIO) subsystem is foundational to most HEP experiments - it provides a file format, set of APIs/semantics, and reference implementation in C++. It often found at the base an experiment's framework used serialize data; case LHC experiment, this may be hundreds petabytes files! Individual physicists will further use RIO perform their end-stage analysis, reading from intermediate files they generate experiment data.

10.1088/1742-6596/1085/3/032012 article EN Journal of Physics Conference Series 2018-09-01

BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators

OPENALEX - Publications

Zihan Zhang Jiayao Sun Xianjun Xia Chuanzeng Huang Yijian Xiao and 1 more

Packet loss is a common and unavoidable problem in voice over internet phone (VoIP) systems. To deal with the problem, we propose band-split packet concealment network (BS-PLCNet). Specifically, split full-band signal into wide-band (0-8kHz) high-band (8-24kHz). The signals are processed by gated convolutional recurrent (GCRN), while counterpart simple GRU network. ensure high speech quality automatic recognition (ASR) compatibility, multi-task learning (MTL) framework including fundamental...

10.48550/arxiv.2401.03687 preprint EN other-oa arXiv (Cornell University) 2024-01-01

SELM: Speech Enhancement using Discrete Tokens and Language Models

OPENALEX - Publications

Ziqian Wang Xinfa Zhu Zihan Zhang YuanJun Lv Ning Jiang and 2 more

Language models (LMs) have recently shown superior performances in various speech generation tasks, demonstrating their powerful ability for semantic context modeling. Given the intrinsic similarity between and enhancement, harnessing information is advantageous enhancement tasks. In light of this, we propose SELM, a novel paradigm that integrates discrete tokens leverages language models. SELM comprises three stages: encoding, modeling, decoding. We transform continuous waveform signals...

10.1109/icassp48485.2024.10447464 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2024-03-18

Bs-Plcnet: Band-Split Packet Loss Concealment Network with Multi-Task Learning Framework and Multi-Discriminators

OPENALEX - Publications

Zihan Zhang Jiayao Sun Xianjun Xia Chuanzeng Huang Yijian Xiao and 1 more

10.1109/icasspw62465.2024.10627343 article EN 2024-04-14

Reconstruction of Short-Lived Particles using Graph-Hypergraph Representation Learning

OPENALEX - Publications

C. J. Birch-sykes Brian V. Le Yvonne Peters E. L. Simpson Zihan Zhang

In collider experiments, the kinematic reconstruction of heavy, short-lived particles is vital for precision tests Standard Model and in searches physics beyond it. Performing events with many final-state jets, such as all-hadronic decay topantitop quark pairs, challenging. We present HyPER, a graph neural network that uses blended graph-hypergraph representation learning to reconstruct parent from sets objects. HyPER tested on simulation shown perform favorably when compared existing...

10.48550/arxiv.2402.10149 preprint EN arXiv (Cornell University) 2024-02-15

E5-V: Universal Embeddings with Multimodal Large Language Models

OPENALEX - Publications

Ting Jiang Minghui Song Zihan Zhang Haizhen Huang Wei‐Wei Deng and 4 more

Multimodal large language models (MLLMs) have shown promising advancements in general visual and understanding. However, the representation of multimodal information using MLLMs remains largely unexplored. In this work, we introduce a new framework, E5-V, designed to adapt for achieving universal embeddings. Our findings highlight significant potential representing inputs compared previous approaches. By leveraging with prompts, E5-V effectively bridges modality gap between different types...

10.48550/arxiv.2407.12580 preprint EN arXiv (Cornell University) 2024-07-17

Dualsep: A Light-Weight Dual-Encoder Convolutional Recurrent Network For Real-Time In-Car Speech Separation

OPENALEX - Publications

Ziqian Wang Jiayao Sun Zihan Zhang Xingchen Li Jie Liu and 1 more

10.1109/slt61566.2024.10832223 article EN 2022 IEEE Spoken Language Technology Workshop (SLT) 2024-12-02

Optimal Multi-Distribution Learning

OPENALEX - Publications

Zihan Zhang Wenhao Zhan Yuxin Chen Simon S. Du Jason D. Lee

Multi-distribution learning (MDL), which seeks to learn a shared model that minimizes the worst-case risk across $k$ distinct data distributions, has emerged as unified framework in response evolving demand for robustness, fairness, multi-group collaboration, etc. Achieving data-efficient MDL necessitates adaptive sampling, also called on-demand throughout process. However, there exist substantial gaps between state-of-the-art upper and lower bounds on optimal sample complexity. Focusing...

10.48550/arxiv.2312.05134 preprint EN other-oa arXiv (Cornell University) 2023-01-01

MixDec Sampling: A Soft Link-based Sampling Method of Graph Neural Network for Recommendation

OPENALEX - Publications

Xiangjin Xie Yuxin Chen Ruipeng Wang Xianli Zhang Shilei Cao and 8 more

Graph neural networks have been widely used in recent recommender systems, where negative sampling plays an important role. Existing methods restrict the relationship between nodes as either hard positive pairs or pairs. This leads to loss of structural information, and lacks mechanism generate for with few neighbors. To overcome limitations, we propose a novel soft link-based method, namely MixDec Sampling, which consists Mixup Sampling module Decay module. The augments node features by...

10.1109/icdm54844.2022.00070 article EN 2021 IEEE International Conference on Data Mining (ICDM) 2022-11-01

Increasing Parallelism in the ROOT I/O Subsystem

OPENALEX - Publications

G. Amádio Brian Bockelman Philippe Canal D. Piparo Enric Tejedor and 1 more

When processing large amounts of data, the rate at which reading and writing can take place is a critical factor. High energy physics data relying on ROOT no exception. The recent parallelisation LHC experiments' software frameworks analysis ever increasing amount collision collected by experiments further emphasised this issue underlying need implicit parallelism expressed within I/O.

10.1088/1742-6596/1085/3/032014 article EN Journal of Physics Conference Series 2018-09-01

Railway line data optimization and formal modelling and verification of generation mothed

OPENALEX - Publications

Zihan Zhang Zhongtian Liu

Abstract The research on upgrading and modification of the existing railway signal system found that line basic datasheet data engineering cannot satisfy application requirement, it is necessary to study optimization line. This paper presents a method for generation process verification. Based analysis update requirements, hierarchical block storage model established by expanding datasheet, design process. formal verification combining UML with NuSMV model, which verifies activity, certainty...

10.1088/1757-899x/688/3/033013 article EN IOP Conference Series Materials Science and Engineering 2019-11-01

Two-stage Neural Network for ICASSP 2023 Speech Signal Improvement Challenge

OPENALEX - Publications

Mingshuai Liu Shubo Lv Zihan Zhang Runduo Han Xiang Hao and 4 more

10.48550/arxiv.2303.07621 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Calculating Specific Integrals by Using Residue Theorem

OPENALEX - Publications

Zihan Zhang Xinyang Xu Xinhao Yang

The aim of this paper is to solve the some specific integrals such as , and . traditional method that people normally decompose polynomial into several partial fractions first. This process involves adding it all up, expanding brackets, doing matrices computation, which takes too many steps calculation. fraction part requires using Euler’s formula large amounts brackets prove multiplication those denominators equal denominator given in original equation. Once making one little mistake, next...

10.54097/hset.v38i.5820 article EN cc-by-nc Highlights in Science Engineering and Technology 2023-03-16

Dual-Alignment Pre-training for Cross-lingual Sentence Embedding

OPENALEX - Publications

Ziheng Li Shaohan Huang Zihan Zhang Zhihong Deng Qiang Lou and 5 more

Recent studies have shown that dual encoder models trained with the sentence-level translation ranking task are effective methods for cross-lingual sentence embedding. However, our research indicates token-level alignment is also crucial in multilingual scenarios, which has not been fully explored previously. Based on findings, we propose a dual-alignment pre-training (DAP) framework embedding incorporates both and alignment. To achieve this, introduce novel representation learning (RTL)...

10.48550/arxiv.2305.09148 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Coming Soon ...