NFDI4DS | UHH-SEMS - Publication Details

Jianyuan Wang

ORCID: 0000-0002-6467-4018

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5102021424

Research Areas

Speech and Audio Processing
Music and Audio Processing
Hearing Loss and Rehabilitation
Advanced Vision and Imaging
Digital Media Forensic Detection
Multilevel Inverters and Converters
Advanced Algorithms and Applications
Environmental remediation with nanomaterials
Advanced Sensor and Control Systems
Recycling and Waste Management Techniques
Toxic Organic Pollutants Impact
Robotics and Sensor-Based Localization
Advanced Neural Network Applications
Industrial Automation and Control Systems
Domain Adaptation and Few-Shot Learning
Advancements in Semiconductor Devices and Circuit Design
Electrical Fault Detection and Protection
Microbial bioremediation and biosurfactants
Computer Graphics and Visualization Techniques
Energy Load and Power Forecasting
3D Shape Modeling and Analysis
Higher Education and Teaching Methods
Occupational Health and Safety Research
Synthesis and biological activity
Infrared Target Detection Methodologies

Xi'an University of Technology
2004-2025

University of Oxford
2023-2024

Northeast Electric Power University
2012-2023

Oxford Research Group
2023

Australian National University
2023

Xiamen University
2020-2022

Chinese Research Academy of Environmental Sciences
2020-2022

Group Sense (China)
2022

Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning

OPENALEX - Publications

Weixuan Sun Jiayi Zhang Jianyuan Wang Zheyuan Liu Yiran Zhong and 4 more

Self-supervised audio-visual source localization aims to locate sound-source objects in video frames without extra annotations. Recent methods often approach this goal with the help of contrastive learning, which assumes only audio and visual contents from same are positive samples for each other. However, assumption would suffer false negative real-world training. For example, an sample, treating class as may mislead model therefore harm learned representations (e.g., a siren wailing...

10.1109/cvpr52729.2023.00621 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment

OPENALEX - Publications

Jianyuan Wang Christian Rupprecht David Novotný

Camera pose estimation is a long-standing computer vision problem that to date often relies on classical methods, such as handcrafted keypoint matching, RANSAC and bundle adjustment. In this paper, we propose formulate the Structure from Motion (SfM) inside probabilistic diffusion framework, modelling conditional distribution of camera poses given input images. This novel view an old has several advantages. (i) The nature framework mirrors iterative procedure (ii) formulation allows seamless...

10.1109/iccv51070.2023.00896 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

MUNet: Motion uncertainty-aware semi-supervised video object segmentation

OPENALEX - Publications

Jiadai Sun Yuxin Mao Yuchao Dai Yiran Zhong Jianyuan Wang

10.1016/j.patcog.2023.109399 article EN Pattern Recognition 2023-02-07

Audio-Visual Segmentation with Semantics

OPENALEX - Publications

Jinxing Zhou Xuyang Shen Jianyuan Wang Jiayi Zhang Weixuan Sun and 6 more

10.1007/s11263-024-02261-x article EN International Journal of Computer Vision 2024-10-15

Vicinity Vision Transformer

OPENALEX - Publications

Weixuan Sun Zhen Qin Hui Deng Jianyuan Wang Yi Zhang and 5 more

Vision transformers have shown great success on numerous computer vision tasks. However, their central component, softmax attention, prohibits from scaling up to high-resolution images, due both the computational complexity and memory footprint being quadratic. Linear attention was introduced in natural language processing (NLP) which reorders self-attention mechanism mitigate a similar issue, but directly applying existing linear may not lead satisfactory results. We investigate this...

10.1109/tpami.2023.3285569 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2023-06-13

Audio-Visual Segmentation with Semantics

OPENALEX - Publications

Jinxing Zhou Xuyang Shen Jianyuan Wang Jiayi Zhang Weixuan Sun and 6 more

We propose a new problem called audio-visual segmentation (AVS), in which the goal is to output pixel-level map of object(s) that produce sound at time image frame. To facilitate this research, we construct first benchmark, i.e., AVSBench, providing pixel-wise annotations for sounding objects audible videos. It contains three subsets: AVSBench-object (Single-source subset, Multi-sources subset) and AVSBench-semantic (Semantic-labels subset). Accordingly, settings are studied: 1)...

10.48550/arxiv.2301.13190 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Research on double transistor open circuit fault diagnosis of T‐type three level rectifier based on mixed logical dynamical model

OPENALEX - Publications

Jianyuan Wang Yuxiang Liu Dongsheng Yuan Cong Liu Kai Zuo

Abstract The T‐type three‐level rectifier has garnered significant attention due to its ability enhance the voltage waveform quality in power systems and reduce electromagnetic interference with other equipment. To ensure high reliability high‐power wind photovoltaic generation systems, conducting fault diagnosis for rectifiers is crucial. This paper first analyzes input current characteristics of both in‐phase out‐of‐phase dual transistor open‐circuit faults. A current‐extended observer...

10.1049/elp2.12536 article EN cc-by-nc-nd IET Electric Power Applications 2025-01-01

FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views

OPENALEX - Publications

Shangzhan Zhang Jianyuan Wang Yinghao Xu Nan Xue Christian Rupprecht and 3 more

We present FLARE, a feed-forward model designed to infer high-quality camera poses and 3D geometry from uncalibrated sparse-view images (i.e., as few 2-8 inputs), which is challenging yet practical setting in real-world applications. Our solution features cascaded learning paradigm with pose serving the critical bridge, recognizing its essential role mapping structures onto 2D image planes. Concretely, FLARE starts estimation, whose results condition subsequent of geometric structure...

10.48550/arxiv.2502.12138 preprint EN arXiv (Cornell University) 2025-02-17

Enhancing the interface stability of Li1.3Al0.3Ti1.7(PO4)3 and lithium metal by amorphous Li1.5Al0.5Ge1.5(PO4)3 modification

OPENALEX - Publications

Lianchuan Li Ziqi Zhang Linshan Luo Run You Jinlong Jiao and 5 more

10.1007/s11581-020-03503-x article EN Ionics 2020-05-16

Pathways and influential factors study on the formation of PBDD/Fs during co-processing BDE-209 in cement kiln simulation system

OPENALEX - Publications

Jinzhong Yang Haibin Yu Zhen Xie Yufei Yang Xiaoyan Zheng and 4 more

10.1016/j.ecoenv.2020.110246 article EN Ecotoxicology and Environmental Safety 2020-02-03

Spatial Steerability of GANs via Self-Supervision from Discriminator

OPENALEX - Publications

Jianyuan Wang Lalit Bhagat Ceyuan Yang Yinghao Xu Yujun Shen and 2 more

Generative models make huge progress to the photorealistic image synthesis in recent years. To enable humans steer generation process and customize output, many works explore interpretable dimensions of latent space GANs. Existing methods edit attributes output such as orientation or color scheme by varying code along certain directions. However, these usually require additional human annotations for each pretrained model, they mostly focus on editing global attributes. In this work, we...

10.1109/tpami.2024.3422820 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2024-07-03

Occurrence and formation pathways analysis of PBDD/Fs from 2,4,6-tribromophenol under thermal reaction conditions

OPENALEX - Publications

Qingqi Die Jinzhong Yang Jianyuan Wang Jian Wang Yufei Yang and 2 more

Polybrominated dibenzo-p-dioxins and dibenzofurans (PBDD/Fs) are highly toxic persistent compounds that provoke a wave of publicity. Bromophenols important precursors for forming PBDD/Fs, their reaction path has always been research hotspot. In this study, the formation characteristic PBDD/Fs from 2,4,6-TBP were studied. The yields 2,3,7,8-substituted 2,4,6,8-TBDF different thermal products ranged 0.067 to 10.3 ng/g 0.207-9.68 ng/g, respectively. effects adding Cu, Fe, Sb2O3 investigated...

10.1016/j.ecoenv.2022.113449 article EN cc-by-nc-nd Ecotoxicology and Environmental Safety 2022-03-28

A series fault arc detection method based on denoising autoencoder and deep residual network

OPENALEX - Publications

Jianyuan Wang Xue Li Yuhui Zhang

Given the problem that existing series arc fault identification methods use features such as time-frequency domain of current signal basis for identification, resulting in relatively limited detection solutions, and directly extracting using deep learning algorithms have insufficient feature extraction, a new method based on denoising autoencoder (DAE) residual network (ResNet) is proposed. First, large number training samples are obtained through sliding window data normalization methods,...

10.3389/fenrg.2024.1341281 article EN cc-by Frontiers in Energy Research 2024-03-14

Green synthesis and antitumor activity of (E)-diethyl 2-styrylquinoline-3,4-dicarboxylates

OPENALEX - Publications

Hong Zhang Jianyuan Wang Cheng Li Di Zhao T. Liang and 1 more

Application of an environmentally benign and non-toxic eutectic mixture DMU/LTA for the green synthesis ( E )-diethyl 2-arylvinylquinoline-3,4-dicarboxylates is described. A preliminary antitumor evaluation was then assayed.

10.1039/d4ra04588b article EN cc-by-nc RSC Advances 2024-01-01

PartGen: Part-level 3D Generation and Reconstruction with Multi-View Diffusion Models

OPENALEX - Publications

Minghao Chen Роман Шаповалов Iro Laina Tom Monnier Jianyuan Wang and 2 more

Text- or image-to-3D generators and 3D scanners can now produce assets with high-quality shapes textures. These typically consist of a single, fused representation, like an implicit neural field, Gaussian mixture, mesh, without any useful structure. However, most applications creative workflows require to be made several meaningful parts that manipulated independently. To address this gap, we introduce PartGen, novel approach generates objects composed starting from text, image, unstructured...

10.48550/arxiv.2412.18608 preprint EN arXiv (Cornell University) 2024-12-24

Non-intrusive load identification method based on GAF and RAN networks

OPENALEX - Publications

Jianyuan Wang Yibo Sun

Non-intrusive load identification can improve the interaction efficiency between power supply side and user of grid. Applying this technology alleviate problem energy shortage is a key technique for achieving efficient management on side. In response to cumbersome process manually selecting features low accuracy in traditional machine learning algorithms non-intrusive identification, paper proposes method that transforms one-dimensional reactive electric signal into two-dimensional image...

10.3389/fenrg.2023.1330690 article EN cc-by Frontiers in Energy Research 2023-12-29

The Research of Feature Extraction Methods in the Tomatoes Detection

OPENALEX - Publications

Xiaoliang Wang Xiaohui Guan Jianyuan Wang Yang Yong-qing Haiyan Liu

Feature selection can reduce the feature space dimension and improve recognition. In discriminatory fresh degree of tomatoes by electronic nose, it used three kinds extraction methods: sheath coefficient characteristics, similitude entropy characteristics energy methods were compared respectively. The results showed that has its advantages in nose detection.

10.1109/imccc.2012.170 article EN 2012-12-01

Linear Video Transformer with Feature Fixation

OPENALEX - Publications

Kaiyue Lu Zexiang Liu Jianyuan Wang Weixuan Sun Zhen Qin and 6 more

Vision Transformers have achieved impressive performance in video classification, while suffering from the quadratic complexity caused by Softmax attention mechanism. Some studies alleviate computational costs reducing number of tokens calculation, but is still quadratic. Another promising way to replace with linear attention, which owns presents a clear drop. We find that such drop results lack concentration on critical features. Therefore, we propose feature fixation module reweight...

10.48550/arxiv.2210.08164 preprint EN cc-by arXiv (Cornell University) 2022-01-01

A New Optimal Space-Vector Modulation Technique for Three-Phase Voltage Source Inverters

OPENALEX - Publications

Shaoliang An Jianyuan Wang Xiangdong Sun Yanru Zhong

In this paper, a new optimal space-vector pulse- width modulation (SVPWM) technique is presented for three-phase voltage source inverters. 6 sectors are redivided into 12 ones based on SVPWM, and combining with local over- method, the discontinuous SVPWM strategies called as DSVPWMx including DSVPWMP, DSVPWMN, DSVPWMPN1 DSVPWMPN3 proposed. The principle of developed, essential relations among different discussed. simulation experimental results verify that right feasible.

10.1109/appeec.2012.6306870 article EN 2012-03-01

Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning

OPENALEX - Publications

Weixuan Sun Jiayi Zhang Jianyuan Wang Zheyuan Liu Yiran Zhong and 4 more

10.48550/arxiv.2303.11302 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Effects of increasing chlorine concentration in feedstock on the emission and distribution characteristic of dioxins in circular fluidized bed boiler

OPENALEX - Publications

Changhao Cui Meijia Liu Li Li Dahai Yan Chao Chen and 3 more

Abstract Field studies were conducted to study the emission and distribution characteristics of dioxins by elevating chloring concentration in feedstock a 600MW circular fluidized bed (CFB) boiler. The total equivalent quantity polychlorinated dibenzo–p–dioxins dibenzofurans (PCDD/Fs) all flue gas, electrostatic ash, cloth bag ash boiler samples under blank condition (i.e., was normal coal) chlorine labelling mixed with coal chlorine-containing agent) analyzed. Results illustrated that...

10.21203/rs.3.rs-1667662/v1 preprint EN cc-by Research Square (Research Square) 2022-06-27

Coming Soon ...