NFDI4DS | UHH-SEMS - Publication Details

Rui Zhao

ORCID: 0000-0003-2993-2023

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100684043

Research Areas

Advanced Neural Network Applications
Domain Adaptation and Few-Shot Learning
Topic Modeling
Multimodal Machine Learning Applications
Human Pose and Action Recognition
Natural Language Processing Techniques
Data Quality and Management
Speech Recognition and Synthesis
Reinforcement Learning in Robotics
Video Surveillance and Tracking Methods
Digital Innovation in Industries
Scientific Computing and Data Management
Generative Adversarial Networks and Image Synthesis
Face recognition and analysis
Advanced Image and Video Retrieval Techniques
Business Process Modeling and Analysis
Text Readability and Simplification
Linguistic research and analysis
COVID-19 diagnosis using AI
Robot Manipulation and Learning
Speech and Audio Processing
Music and Audio Processing
Human Motion and Animation
Educational Technology and Pedagogy
3D Shape Modeling and Analysis

First Affiliated Hospital of Jiangxi Medical College
2025

Nanchang University
2025

Sir Run Run Shaw Hospital
2024

Zhejiang University
2024

Microsoft (United States)
2024

University of Oxford
2024

Tencent (China)
2023-2024

Liaoning Technical University
2024

Guilin University of Technology
2023

Guilin University
2023

QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension

OPENALEX - Publications

Adams Wei Yu D. Dohan Minh-Thang Luong Rui Zhao Kai Chen and 2 more

Current end-to-end machine reading and question answering (Q\&A) models are primarily based on recurrent neural networks (RNNs) with attention. Despite their success, these often slow for both training inference due to the sequential nature of RNNs. We propose a new Q\&A architecture called QANet, which does not require networks: Its encoder consists exclusively convolution self-attention, where local interactions self-attention global interactions. On SQuAD dataset, our model is 3x 13x...

10.48550/arxiv.1804.09541 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Investigation of a transformer-based hybrid artificial neural networks for climate data prediction and analysis

OPENALEX - Publications

Shangke Liu Ke Liu Zheng Wang Yuanyuan Liu Bin Bai and 1 more

Introduction Climate change isone of the major challenges facing world today, causing frequent extreme weather events that significantly impact human production, life, and ecological environment. Traditional climate prediction models largely rely on simulation physical processes. While they have achieved some success, these still face issues such as complexity, high computational cost, insufficient handling multivariable nonlinear relationships. Methods In light this, this paper proposes a...

10.3389/fenvs.2024.1464241 article EN cc-by Frontiers in Environmental Science 2025-01-22

Skin-interfaced Sweat Monitoring Patch Constructed by Flexible Microfluidic Capillary Pump and Cu-MOF Sensitized Electrochemical Sensor

OPENALEX - Publications

Weizheng Xu Yu Cao Huanhuan Shi Xuanhao Jia Yun Zheng and 3 more

10.1016/j.talanta.2025.127895 article EN Talanta 2025-03-04

HumanBench: Towards General Human-Centric Perception with Projector Assisted Pretraining

OPENALEX - Publications

Shixiang Tang Cheng Chen Qingsong Xie Meilin Chen Yizhou Wang and 7 more

Human-centric perceptions include a variety of vision tasks, which have widespread industrial applications, including surveillance, autonomous driving, and the metaverse. It is desirable to general pretrain model for versatile human-centric downstream tasks. This paper forges ahead along this path from aspects both benchmark pretraining methods. Specifically, we propose HumanBench based on existing datasets comprehensively evaluate common ground generalization abilities different methods 19...

10.1109/cvpr52729.2023.02104 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

The impact of nighttime car body lighting on pedestrians’ distraction: A virtual reality simulation based on bottom-up attention mechanism

OPENALEX - Publications

Xiangwei Yi Rui Zhao Yandan Lin

10.1016/j.ssci.2024.106633 article EN Safety Science 2024-08-30

‘You are you and the app. There’s nobody else.’: Building Worker-Designed Data Institutions within Platform Hegemony

OPENALEX - Publications

Jake M L Stein Vidminas Vizgirda Max Van Kleek Reuben Binns Jun Zhao and 5 more

Information asymmetries create extractive, often harmful relationships between platform workers (e.g., Uber or Deliveroo drivers) and their algorithmic managers. Recent HCI studies have put forward more equitable designs but leave open questions about the social technical infrastructures required to support them without cooperation of platforms. We conducted a participatory design study in which deconstructed re-imagined Uber's schema for driver data. analyzed data structures institutions...

10.1145/3544548.3581114 article EN 2023-04-19

Explore the Power of Synthetic Data on Few-shot Object Detection

OPENALEX - Publications

Shaobo Lin Kun Wang Xingyu Zeng Rui Zhao

Few-shot object detection (FSOD) aims to expand an detector for novel categories given only a few instances training. The training samples restrict the performance of FSOD model. Recent text-to-image generation models have shown promising results in generating high-quality images. How applicable these synthetic images are tasks remains under-explored. This work extensively studies how generated from state-of-the-art generators benefit tasks. We focus on two perspectives: (1) use data FSOD?...

10.1109/cvprw59228.2023.00071 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2023-06-01

An Effective Crop-Paste Pipeline for Few-shot Object Detection

OPENALEX - Publications

Shaobo Lin Kun Wang Xingyu Zeng Rui Zhao

Few-shot object detection (FSOD) aims to expand an detector for novel categories given only a few instances training. However, detecting with samples usually leads the problem of misclassification. In FSOD, we notice false positive (FP) is prominent, in which base are often recognized as ones. To address this issue, data augmentation pipeline that Crops Novel and Pastes them on selected Base images, called CNPB, proposed. There two key questions be answered: (1) How select useful images? (2)...

10.1109/cvprw59228.2023.00510 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2023-06-01

Graph Pooling via Dropping Task-Irrelevant Nodes

OPENALEX - Publications

Zhong Cheng Shaofeng Zhang Feng Zhu Rui Zhao Xiaokang Yang and 1 more

10.1109/icassp49660.2025.10889769 article ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2025-03-12

Micropumps and Microvalves for Biomedical Applications

OPENALEX - Publications

Yun Zheng Huanhuan Shi Zhongjian Tan Weizheng Xu Rui Zhao and 4 more

10.1016/j.trac.2025.118236 article EN TrAC Trends in Analytical Chemistry 2025-03-01

Arrangement of High-standard Basic Farmland Construction Based on Village-region Cultivated Land Quality Uniformity

OPENALEX - Publications

Wen Song Kening Wu Huafu Zhao Rui Zhao Ting Li

As an important constitute of land consolidation, high-standard basic farmland construction is means to protect the quantity, quality and ecological environment cultivated land. Its target not only lies in increase but also improvement quality, agricultural production conditions ecosystem environments. In present study, evaluation method arrangement were explored facilitate process decision-making implementation for (HSBFC) with administrative village as unit. Taking comprehensive project...

10.1007/s11769-018-1011-1 article EN Chinese Geographical Science 2018-11-05

Learning from Future: A Novel Self-Training Framework for Semantic Segmentation

OPENALEX - Publications

Ye Du Yujun Shen Haochen Wang Jingjing Fei Wei Li and 4 more

Self-training has shown great potential in semi-supervised learning. Its core idea is to use the model learned on labeled data generate pseudo-labels for unlabeled samples, and turn teach itself. To obtain valid supervision, active attempts typically employ a momentum teacher pseudo-label prediction yet observe confirmation bias issue, where incorrect predictions may provide wrong supervision signals get accumulated training process. The primary cause of such drawback that prevailing...

10.48550/arxiv.2209.06993 preprint EN cc-by arXiv (Cornell University) 2022-01-01

LongFNT: Long-Form Speech Recognition with Factorized Neural Transducer

OPENALEX - Publications

Xun Gong Yu Wu Jinyu Li Shujie Liu Rui Zhao and 2 more

Traditional automatic speech recognition (ASR) systems usually focus on individual utterances, without considering long-form with useful historical information, which is more practical in real scenarios. Simply attending longer transcription history for a vanilla neural transducer model shows no much gain our preliminary experiments, since the prediction network not pure language model. This motivates us to leverage factorized structure, containing model, vocabulary predictor. We propose...

10.1109/icassp49357.2023.10096900 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Human Preference Score: Better Aligning Text-to-Image Models with Human Preference

OPENALEX - Publications

Xiaoshi Wu Keqiang Sun Feng Zhu Rui Zhao Hongsheng Li

Recent years have witnessed a rapid growth of deep generative models, with text-to-image models gaining significant attention from the public. However, existing often generate images that do not align well human preferences, such as awkward combinations limbs and facial expressions. To address this issue, we collect dataset choices on generated Stable Foundation Discord channel. Our experiments demonstrate current evaluation metrics for correlate choices. Thus, train preference classifier...

10.48550/arxiv.2303.14420 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Energy-Based Hindsight Experience Prioritization

OPENALEX - Publications

Rui Zhao Volker Tresp

In Hindsight Experience Replay (HER), a reinforcement learning agent is trained by treating whatever it has achieved as virtual goals. However, in previous work, the experience was replayed at random, without considering which episode might be most valuable for learning. this paper, we develop an energy-based framework prioritizing hindsight robotic manipulation tasks. Our approach inspired work-energy principle physics. We define trajectory energy function sum of transition target object...

10.48550/arxiv.1810.01363 preprint EN other-oa arXiv (Cornell University) 2018-01-01

SocialGenPod: Privacy-Friendly Generative AI Social Web Applications with Decentralised Personal Data Stores

OPENALEX - Publications

Vidminas Vizgirda Rui Zhao Naman Goel

We present SocialGenPod, a decentralised and privacy-friendly way of deploying generative AI Web applications. Unlike centralised data architectures that keep user tied to application service providers, we show how one can use Solid - specification decouple from demonstrate SocialGenPod using prototype allows users converse with different Large Language Models, optionally leveraging Retrieval Augmented Generation generate answers grounded in private documents stored any Pod the is allowed...

10.1145/3589335.3651251 preprint EN cc-by 2024-05-12

CEO turnover, political connections, and firm performance: Evidence from China

OPENALEX - Publications

Xiaoyan Liu Rui Zhao Mengmeng Guo

10.1016/j.ememar.2022.100965 article EN Emerging Markets Review 2022-09-17

PUnifiedNER: A Prompting-Based Unified NER System for Diverse Datasets

OPENALEX - Publications

Jinghui Lu Rui Zhao Brian Mac Namee Fei Tan

Much of named entity recognition (NER) research focuses on developing dataset-specific models based data from the domain interest, and a limited set related types. This is frustrating as each new dataset requires model to be trained stored. In this work, we present ``versatile'' model---the Prompting-based Unified NER system (PUnifiedNER)---that works with different domains can recognise up 37 types simultaneously, theoretically it could many possible. By using prompt learning, PUnifiedNER...

10.1609/aaai.v37i11.26564 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

Maximum Entropy Population-Based Training for Zero-Shot Human-AI Coordination

OPENALEX - Publications

Rui Zhao Jinming Song Yufeng Yuan Haifeng Hu Yang Gao and 3 more

We study the problem of training a Reinforcement Learning (RL) agent that is collaborative with humans without using human data. Although such agents can be obtained through self-play training, they suffer significantly from distributional shift when paired unencountered partners, as humans. In this paper, we propose Maximum Entropy Population-based (MEP) to mitigate shift. MEP, in population are trained our derived Population bonus promote pairwise diversity between and individual...

10.1609/aaai.v37i5.25758 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2023-06-26

Zero-Shot Text-to-Parameter Translation for Game Character Auto-Creation

OPENALEX - Publications

Rui Zhao Wei Li Zhipeng Hu Lincheng Li Zhengxia Zou and 2 more

Recent popular Role-Playing Games (RPGs) saw the great success of character auto-creation systems. The bone-drivenface model controlled by continuous parameters (like position bones) and discrete hairstyles) makes it possible for users to personalize customize in-game characters. Previous systems are mostly image-driven, where facial optimized so that rendered looks similar reference face photo. This paper proposes a novel text-to-parameter translation method (T2P) achieve zero-shot...

10.1109/cvpr52729.2023.02013 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Hulk: A Universal Knowledge Translator for Human-Centric Tasks

OPENALEX - Publications

Yizhou Wang Yixuan Wu Shixiang Tang Weizhen He Xun Guo and 6 more

Human-centric perception tasks, e.g., pedestrian detection, skeleton-based action recognition, and pose estimation, have wide industrial applications, such as metaverse sports analysis. There is a recent surge to develop human-centric foundation models that can benefit broad range of tasks. While many achieved success, they did not explore 3D vision-language tasks for required task-specific finetuning. These limitations restrict their application more downstream situations. To tackle these...

10.48550/arxiv.2312.01697 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Synthesis and biological activity evaluation of novel peroxo-bridged derivatives as potential anti-hepatitis B virus agents

OPENALEX - Publications

Menglu Jia Rui Zhao Bing Xu Wenqiang Yan Fuhao Chu and 7 more

Previous studies have demonstrated that natural steroid compounds containing a peroxide bridge exhibited potential anti-hepatitis B virus activity. To continue our research, simple and regioselective methodology, using Eosin Y as clean photosensitized oxidation catalyst, was developed for the synthesis of in steroids. The method catalyst exposed to visible light furbished high yields, did not involve tedious work-up or purification, avoided environmentally hazardous solvents. It can be...

10.1039/c6md00344c article EN MedChemComm 2016-10-19

What Makes Pre-trained Language Models Better Zero-shot Learners?

OPENALEX - Publications

Jinghui Lu Dongsheng Zhu Weidong Han Rui Zhao Brian Mac Namee and 1 more

Current methods for prompt learning in zero-shot scenarios widely rely on a development set with sufficient human-annotated data to select the best-performing template posteriori. This is not ideal because real-world scenario of practical relevance, no labelled available. Thus, we propose simple yet effective method screening reasonable templates text classification: Perplexity Selection (Perplection). We hypothesize that language discrepancy can be used measure efficacy templates, and...

10.18653/v1/2023.acl-long.128 article EN cc-by 2023-01-01

TPTU-v2: Boosting Task Planning and Tool Usage of Large Language Model-based Agents in Real-world Systems

OPENALEX - Publications

Yilun Kong Jingqing Ruan Yihong Chen Bin Zhang Tianpeng Bao and 7 more

Large Language Models (LLMs) have demonstrated proficiency in addressing tasks that necessitate a combination of task planning and the usage external tools require blend utilization tools, such as APIs. However, real-world complex systems present three prevalent challenges concerning tool usage: (1) The real system usually has vast array APIs, so it is impossible to feed descriptions all APIs prompt LLMs token length limited; (2) designed for handling tasks, base can hardly plan correct...

10.48550/arxiv.2311.11315 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Perennial Semantic Data Terms of Use for Decentralized Web

OPENALEX - Publications

Rui Zhao Jun Zhao

In today's digital landscape, the Web has become increasingly centralized, raising concerns about user privacy violations.Decentralized architectures, such as Solid, offer a promising solution by empowering users with better control over their data in personal 'Pods'.However, significant challenge remains: must navigate numerous applications to decide which application can be trusted access Pods.This often involves reading lengthy and complex Terms of Use agreements, process that find...

10.1145/3589334.3645631 article EN Proceedings of the ACM Web Conference 2022 2024-05-08

Coming Soon ...