NFDI4DS | UHH-SEMS - Publication Details

Jiayao Sun

ORCID: 0009-0007-1002-6879

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5032074032

Research Areas

Speech and Audio Processing
Speech Recognition and Synthesis
Music and Audio Processing
Advanced Adaptive Filtering Techniques
Advanced Fiber Optic Sensors
Luminescence Properties of Advanced Materials
Luminescence and Fluorescent Materials
Microfluidic and Bio-sensing Technologies
Orbital Angular Momentum in Optics
Optical Coherence Tomography Applications
Photonic and Optical Devices
Video Analysis and Summarization
Molecular Sensors and Ion Detection
Near-Field Optical Microscopy
X-ray Diffraction in Crystallography
Organic Light-Emitting Diodes Research
Optical and Acousto-Optic Technologies
Electronic and Structural Properties of Oxides
Nanoplatforms for cancer theranostics
Mechanical and Optical Resonators
Mechanical stress and fatigue analysis
Advanced Data Compression Techniques
Aerodynamics and Fluid Dynamics Research
Structural Load-Bearing Analysis
Advanced Optical Network Technologies

Northwestern Polytechnical University
2022-2024

Northeast Petroleum University
2023-2024

Beijing University of Chemical Technology
2021-2022

Soochow University
2021

Zhangjiagang First People's Hospital
2021

Jiangsu University
2012-2014

Lanzhou University
2012

S-DCCRN: Super Wide Band DCCRN with Learnable Complex Feature for Speech Enhancement

OPENALEX - Publications

Shubo Lv Yihui Fu Mengtao Xing Jiayao Sun Lei Xie and 3 more

In speech enhancement, complex neural network has shown promising performance due to their effectiveness in processing complex-valued spectrum. Most of the recent enhancement approaches mainly focus on wide-band signal with a sampling rate 16K Hz. However, research super wide band (e.g., 32K Hz) or even full-band (48K) denoising using deep learning is still its infancy difficulty modeling more frequency bands and particularly high components. this paper, we extend our previous convolution...

10.1109/icassp43922.2022.9747029 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

Carbazole&benzoindole-based purely organic phosphors: a comprehensive phosphorescence mechanism, tunable lifetime and an advanced encryption system

OPENALEX - Publications

Chen Qian Zhimin Ma Bingxin Yang Xianjiang Li Jiayao Sun and 5 more

The same molecule synthesized from different carbazoles may show various properties, which originate the trace isomer in purchased carbazole. By changing content of isomers, phosphorescence lifetime can be quantitatively adjusted.

10.1039/d1tc03020e article EN Journal of Materials Chemistry C 2021-01-01

Multi-Task Deep Residual Echo Suppression with Echo-Aware Loss

OPENALEX - Publications

Shimin Zhang Ziteng Wang Jiayao Sun Yihui Fu Biao Tian and 2 more

This paper introduces the NWPU Team's entry to ICASSP 2022 AEC Challenge. We take a hybrid approach that cascades linear with neural post-filter. The former is used deal echo components while latter suppresses residual non-linear components. use gated convolutional F-T-LSTM network (GFTNN) as backbone and shape post-filter by multi-task learning (MTL) framework, where voice activity detection (VAD) module adopted an auxiliary task along suppression, aim avoid over suppression may cause...

10.1109/icassp43922.2022.9746733 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022-04-27

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

OPENALEX - Publications

Wang He Pengcheng Guo Yue Li Ao Zhang Jiayao Sun and 11 more

10.1109/icasspw62465.2024.10627712 article EN 2024-04-14

Compact single fiber optical tweezer–micropipette system for completely noninvasive cell sorting

OPENALEX - Publications

Yunkai Wang Lu Yan Yongqiang Sun Taiji Dong Yekun Zhou and 4 more

Bridging optical tweezers and microfluidics can form a multifunctional platform, which overcome the difficulties of precise manipulation in hydrodynamic flow noninvasive method. However, when integrated into microfluidic chip, fiber optic tweezer loses its flexibility. Here, we propose compact single tweezer–micropipette system. It sort particles by differences shape refractive index completely way while retaining flexibility, high selectivity, precision tweezer. Compact channels are formed...

10.1063/5.0139071 article EN Applied Physics Letters 2023-06-05

The synthesis and afterglow luminescence properties of a novel red afterglow phosphor: SnO2:Sm3+,Zr4+

OPENALEX - Publications

Jiachi Zhang Xinlong Ma Qingsong Qin Liurong Shi Jiayao Sun and 3 more

10.1016/j.matchemphys.2012.08.033 article EN Materials Chemistry and Physics 2012-09-04

Compressive Behavior of Stainless Steel–Concrete–Carbon Steel Double-Skin Tubular (SCCDST) Members Subjected to External Hydraulic Pressure

OPENALEX - Publications

Jiantao Wang Yang Kai-lin Jiayao Sun

The new-type stainless steel–concrete–carbon steel double-skin tubular (SCCDST) members, characterized by their exceptional corrosion resistance and mechanical bearing capacity, have promising applications in ocean engineering, particularly deep-water engineering. external hydraulic pressure interfacial action of various materials intensify the complexity composite performance SCCDST members. This paper describes an analytical investigation on concentric compressive members under pressure....

10.3390/jmse12030406 article EN cc-by Journal of Marine Science and Engineering 2024-02-26

A host-guest organic afterglow system with significant guest induced enhancement of phosphorescence

OPENALEX - Publications

Jiayao Sun Chen Qian Zhimin Ma Shitao Wang Zhiyong Ma

10.1016/j.dyepig.2022.110196 article EN Dyes and Pigments 2022-02-25

Multi-Task Sub-Band Network For Deep Residual Echo Suppression

OPENALEX - Publications

Jiayao Sun Dawei Luo Zhaoxia Li Jindong Li Yukai Ju and 1 more

This paper introduces the SWANT team’s entry to ICASSP 2023 AEC Challenge. We submit a system that cascades linear filter with neural post-filter. Particularly, we adopt sub-band processing handle full-band signals and shape network multi-task learning, where dual signal voice activity detection (DSVAD) echo estimation are adopted as auxiliary tasks. Moreover, particularly improve time frequency convolution module (TFCM) increase receptive field using small kernels. Finally, our has ranked...

10.1109/icassp49357.2023.10095137 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Highly sensitive Fabry-Perot acoustic sensor based on optic fiber spherical end surface

OPENALEX - Publications

Jiayao Sun Lun Yan Chunlei Jiang Yunkai Wang Yan Lu and 3 more

10.1016/j.yofte.2023.103440 article EN Optical Fiber Technology 2023-07-20

Micro-Nano Fiber Flexible Multimodal Sensors for Fingerprint Recognition

OPENALEX - Publications

Yunkai Wang Xianli Yu Chunlei Jiang Tao Wang Taiji Dong and 5 more

Multimodal biometric sensing and processing systems can significantly improve the success rate of identification authentication compared to traditional unimodal techniques. We propose a flexible micro-nano fiber (MNF) multimodal sensor for fingerprint recognition. used polydimethylsiloxane (PDMS) as substrate placed MNF in s-shape on PDMS. The surface PDMS is covered with film thickness only 2 <inline-formula xmlns:mml="http://www.w3.org/1998/Math/MathML"...

10.1109/jsen.2023.3347201 article EN IEEE Sensors Journal 2024-01-03

An Audio-Quality-Based Multi-Strategy Approach For Target Speaker Extraction in the Misp 2023 Challenge

OPENALEX - Publications

Runduo Han Xiaopeng Yan Weiming Xu Pengcheng Guo Jiayao Sun and 4 more

10.1109/icasspw62465.2024.10627638 article EN 2024-04-14

Up-Conversion Photostimulated Luminescence of Mg 2 SnO 4 for Optical Storage

OPENALEX - Publications

Jiachi Zhang Qingsong Qin Minghui Yu Jiayao Sun Liurong Shi and 1 more

We report the first observation of up-conversion photostimulated luminescence in non-doped Mg2SnO4. Stimulated by 980 nm infrared laser (reading) after ultraviolet irradiation (writing), phosphor shows emission band covering 470–550 nm, which is due to recombination F centers with holes. After ceasing irradiation, storage intensity would rapidly decrease 59% its original 2.5 h and then not degrade anymore. It suggested that Mg2SnO4 has potential applications for optical storage. Accordingly,...

10.1088/0256-307x/28/2/027802 article EN Chinese Physics Letters 2011-02-01

Listening to the Underwater Acoustic Based on Fiber-Optic Tweezers Technology

OPENALEX - Publications

Jiayao Sun Lun Yan Chunlei Jiang Yunkai Wang Tao Wang and 5 more

10.1109/jsen.2024.3361175 article EN IEEE Sensors Journal 2024-02-16

A cascade splicing-based multimode fiber-tapered single-mode fiber structure for pressure sensing

OPENALEX - Publications

Yang Zhang Bingkun Gao Chunlei Jiang Yunkai Wang Taiji Dong and 5 more

10.1016/j.yofte.2023.103549 article EN Optical Fiber Technology 2023-10-20

A semi-supervised incremental learning method based on adaptive probabilistic hypergraph for video semantic detection

OPENALEX - Publications

Yongzhao Zhan Jiayao Sun Dejiao Niu Qirong Mao Jianping Fan

10.1007/s11042-014-1866-9 article EN Multimedia Tools and Applications 2014-01-23

U-shaped optical microfiber-based liquid viscosity measurement

OPENALEX - Publications

Dong Li Xiufang Wang Yunkai Wang Chunlei Jiang Taiji Dong and 3 more

10.1016/j.yofte.2023.103502 article EN Optical Fiber Technology 2023-09-01

S-DCCRN: Super Wide Band DCCRN with learnable complex feature for speech enhancement

OPENALEX - Publications

Shubo Lv Yihui Fu Mengtao Xing Jiayao Sun Lei Xie and 3 more

In speech enhancement, complex neural network has shown promising performance due to their effectiveness in processing complex-valued spectrum. Most of the recent enhancement approaches mainly focus on wide-band signal with a sampling rate 16K Hz. However, research super wide band (e.g., 32K Hz) or even full-band (48K) denoising is still lacked difficulty modeling more frequency bands and particularly high components. this paper, we extend our previous deep convolution recurrent (DCCRN)...

10.48550/arxiv.2111.08387 preprint EN cc-by arXiv (Cornell University) 2021-01-01

ICMC-ASR: The ICASSP 2024 In-Car Multi-Channel Automatic Speech Recognition Challenge

OPENALEX - Publications

He Wang Pengcheng Guo Yue Li Ao Zhang Jiayao Sun and 11 more

To promote speech processing and recognition research in driving scenarios, we build on the success of Intelligent Cockpit Speech Recognition Challenge (ICSRC) held at ISCSLP 2022 launch ICASSP 2024 In-Car Multi-Channel Automatic (ICMC-ASR) Challenge. This challenge collects over 100 hours multi-channel data recorded inside a new energy vehicle 40 noise for augmentation. Two tracks, including automatic (ASR) diarization (ASDR) are set up, using character error rate (CER) concatenated minimum...

10.48550/arxiv.2401.03473 preprint EN other-oa arXiv (Cornell University) 2024-01-01

An audio-quality-based multi-strategy approach for target speaker extraction in the MISP 2023 Challenge

OPENALEX - Publications

Runduo Han Xiaopeng Yan Weiming Xu Pengcheng Guo Jiayao Sun and 4 more

This paper describes our audio-quality-based multi-strategy approach for the audio-visual target speaker extraction (AVTSE) task in Multi-modal Information based Speech Processing (MISP) 2023 Challenge. Specifically, adopts different strategies on audio quality, striking a balance between interference removal and speech preservation, which benifits back-end automatic recognition (ASR) systems. Experiments show that achieves character error rate (CER) of 24.2% 33.2% Dev Eval set,...

10.48550/arxiv.2401.03697 preprint EN other-oa arXiv (Cornell University) 2024-01-01

BS-PLCNet: Band-split Packet Loss Concealment Network with Multi-task Learning Framework and Multi-discriminators

OPENALEX - Publications

Zihan Zhang Jiayao Sun Xianjun Xia Chuanzeng Huang Yijian Xiao and 1 more

Packet loss is a common and unavoidable problem in voice over internet phone (VoIP) systems. To deal with the problem, we propose band-split packet concealment network (BS-PLCNet). Specifically, split full-band signal into wide-band (0-8kHz) high-band (8-24kHz). The signals are processed by gated convolutional recurrent (GCRN), while counterpart simple GRU network. ensure high speech quality automatic recognition (ASR) compatibility, multi-task learning (MTL) framework including fundamental...

10.48550/arxiv.2401.03687 preprint EN other-oa arXiv (Cornell University) 2024-01-01

Fiber-Optic Probes for Ring-Shaped Multiparticle Capture

OPENALEX - Publications

Tao Wang Linzhi Yao Bingkun Gao Chunlei Jiang Yunkai Wang and 5 more

Conventional single-fiber optical tweezers usually capture particles at the front or side end of tip a fiber probe, thus enabling manipulation in limited range. In this paper, we design and fabricate novel which is prepared by integrating common single-mode (SMF) silica capillary microtubular (COF), excitation LP <sub xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">21</sub> higher-order modes fiber-optic probes, its output light field has multiple...

10.1109/jsen.2023.3345372 article EN IEEE Sensors Journal 2024-01-17

Bs-Plcnet: Band-Split Packet Loss Concealment Network with Multi-Task Learning Framework and Multi-Discriminators

OPENALEX - Publications

Zihan Zhang Jiayao Sun Xianjun Xia Chuanzeng Huang Yijian Xiao and 1 more

10.1109/icasspw62465.2024.10627343 article EN 2024-04-14

DualSep: A Light-weight dual-encoder convolutional recurrent network for real-time in-car speech separation

OPENALEX - Publications

Ziqian Wang Jiayao Sun Zihan Zhang Xingchen Li Jie Liu and 1 more

Advancements in deep learning and voice-activated technologies have driven the development of human-vehicle interaction. Distributed microphone arrays are widely used in-car scenarios because they can accurately capture voices passengers from different speech zones. However, increase number audio channels, coupled with limited computational resources low latency requirements systems, presents challenges for multi-channel separation. To migrate problems, we propose a lightweight framework...

10.48550/arxiv.2409.08610 preprint EN arXiv (Cornell University) 2024-09-13

Dualsep: A Light-Weight Dual-Encoder Convolutional Recurrent Network For Real-Time In-Car Speech Separation

OPENALEX - Publications

Ziqian Wang Jiayao Sun Zihan Zhang Xingchen Li Jie Liu and 1 more

10.1109/slt61566.2024.10832223 article EN 2022 IEEE Spoken Language Technology Workshop (SLT) 2024-12-02

Coming Soon ...