NFDI4DS | UHH-SEMS - Publication Details

xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein

OPENALEX - Publications

Bo Chen Xingyi Cheng Li Pan Yangli‐ao Geng Jing Gong and 10 more

Protein language models have shown remarkable success in learning biological information from protein sequences. However, most existing are limited by either autoencoding or autoregressive pre-training objectives, which makes them struggle to handle understanding and generation tasks concurrently. We propose a unified model, xTrimoPGLM, address these two types of simultaneously through an innovative framework. Our key technical contribution is exploration the compatibility potential for...

10.1101/2023.07.05.547496 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2023-07-06

A Coarse-to-Fine Model for Rail Surface Defect Detection

OPENALEX - Publications

Haomin Yu Qingyong Li Yunqiang Tan Jinrui Gan Jianzhu Wang and 2 more

Computer vision systems have attracted much attention in recent years for use detecting surface defects on rails; however, accurate and efficient recognition of possible remains challenging due to the variations shown by also noise. This paper proposes a coarse-to-fine model (CTFM) identify at different scales. The works three scales from coarse fine: subimage level, region pixel level. At background subtraction exploits row consistency longitudinal direction, strongly filters defect-free...

10.1109/tim.2018.2853958 article EN IEEE Transactions on Instrumentation and Measurement 2018-08-02

Exemplar-free class incremental learning via discriminative and comparable parallel one-class classifiers

OPENALEX - Publications

Wenju Sun Qingyong Li Jing Zhang Danyu Wang Wen Wang and 1 more

10.1016/j.patcog.2023.109561 article EN Pattern Recognition 2023-03-29

RecDCL: Dual Contrastive Learning for Recommendation

OPENALEX - Publications

Dan Zhang Yangli‐ao Geng Wenwen Gong Zhongang Qi Zhiyu Chen and 4 more

Self-supervised learning (SSL) has recently achieved great success in mining the user-item interactions for collaborative filtering. As a major paradigm, contrastive (CL) based SSL helps address data sparsity Web platforms by contrasting embeddings between raw and augmented data. However, existing CL-based methods mostly focus on batch-wise way, failing to exploit potential regularity feature dimension. This leads redundant solutions during representation of users items. In this work, we...

10.1145/3589334.3645533 preprint EN arXiv (Cornell University) 2024-01-28

Trajectory-Joint Clustering Algorithm for Time-Varying Channel Modeling

OPENALEX - Publications

Chen Huang Andreas F. Molisch Yangli‐ao Geng Ruisi He Bo Ai and 1 more

Clustering of multipath components (MPCs) is an important aspect propagation channel modeling. When a time series measurements, based on movement transmitter and/or receiver, available, the temporal evolution MPCs can be used as basis for clustering. We present algorithm that bases clustering not only distance in delay/angle space, but also how similar their parameters are. Sample results obtained from vehicle-to-vehicle measurement campaign show good performance proposed algorithm.

10.1109/tvt.2019.2951374 article EN IEEE Transactions on Vehicular Technology 2019-11-04

Optimization research on air quality numerical model forecasting effects based on deep learning methods

OPENALEX - Publications

Wei Wang Xingqin An Qingyong Li Yangli‐ao Geng Haomin Yu and 1 more

10.1016/j.atmosres.2022.106082 article EN Atmospheric Research 2022-02-14

RecDCL: Dual Contrastive Learning for Recommendation

OPENALEX - Publications

Dan Zhang Yangli‐ao Geng Wenwen Gong Zhongang Qi Zhiyu Chen and 4 more

Self-supervised learning (SSL) has recently achieved great success in mining the user-item interactions for collaborative filtering. As a major paradigm, contrastive (CL) based SSL helps address data sparsity Web platforms by contrasting embeddings between raw and augmented data. However, existing CL-based methods mostly focus on batch-wise way, failing to exploit potential regularity feature dimension. This leads redundant solutions during representation of users items. In this work, we...

10.1145/3589334.3645533 article EN Proceedings of the ACM Web Conference 2022 2024-05-08

LightNet

OPENALEX - Publications

Yangli‐ao Geng Qingyong Li Tianyang Lin Lei Jiang Liangtao Xu and 4 more

Lightning as a natural phenomenon poses serious threats to human life, aviation and electrical infrastructures. prediction plays vital role in lightning disaster reduction. Existing methods, usually based on numerical weather models, rely parameterization schemes for forecasting. These however, have two drawbacks. Firstly, simulations of the models deviations space time domains, which introduces irreparable biases subsequent processes. Secondly, are designed manually by experts meteorology,...

10.1145/3292500.3330717 article EN 2019-07-25

A deep learning framework for lightning forecasting with multi‐source spatiotemporal data

OPENALEX - Publications

Yangli‐ao Geng Qingyong Li Tianyang Lin Wen Yao Liangtao Xu and 5 more

Abstract Weather forecasting requires comprehensive analysis of a variety meteorological data. Recent decades have witnessed the advance weather observation and simulation technologies, triggering an explosion data which are collected from multiple sources (e.g., radar, automatic stations numerical prediction) usually characterized by spatiotemporal (ST) structure. As result, adequate exploition these multi‐source ST emerges as promising but challenging topic for forecasting. To address this...

10.1002/qj.4167 article EN Quarterly Journal of the Royal Meteorological Society 2021-09-28

xTrimoPGLM: Unified 100B-Scale Pre-trained Transformer for Deciphering the Language of Protein

OPENALEX - Publications

Bo Chen Xingyi Cheng Pan Li Yangli‐ao Geng Jing Gong and 10 more

Protein language models have shown remarkable success in learning biological information from protein sequences. However, most existing are limited by either autoencoding or autoregressive pre-training objectives, which makes them struggle to handle understanding and generation tasks concurrently. We propose a unified model, xTrimoPGLM, address these two types of simultaneously through an innovative framework. Our key technical contribution is exploration the compatibility potential for...

10.48550/arxiv.2401.06199 preprint EN cc-by-nc-sa arXiv (Cornell University) 2024-01-01

Multi-Condition Remaining Useful Life Prediction Based on Mixture of Encoders

OPENALEX - Publications

Yan Liu Bihe Xu Yangli‐ao Geng

Accurate Remaining Useful Life (RUL) prediction is vital for effective prognostics in and the health management of industrial equipment, particularly under varying operational conditions. Existing approaches to multi-condition RUL often treat each working condition independently, failing effectively exploit cross-condition knowledge. To address this limitation, paper introduces MoEFormer, a novel framework that combines Mixture Encoders (MoE) with Transformer-based architecture achieve...

10.3390/e27010079 article EN cc-by Entropy 2025-01-17

Task Arithmetic in Trust Region: A Training-Free Model Merging Approach to Navigate Knowledge Conflicts

OPENALEX - Publications

Wenju Sun Qingyong Li Wen Wang Yangli‐ao Geng Boyang Li

Multi-task model merging offers an efficient solution for integrating knowledge from multiple fine-tuned models, mitigating the significant computational and storage demands associated with multi-task training. As a key technique in this field, Task Arithmetic (TA) defines task vectors by subtracting pre-trained ($\theta_{\text{pre}}$) models parameter space, then adjusting weight between these $\theta_{\text{pre}}$ to balance task-generalized task-specific knowledge. Despite promising...

10.48550/arxiv.2501.15065 preprint EN arXiv (Cornell University) 2025-01-24

Towards Plastic and Stable Incremental Learning: A Dual-Learner Framework with Cumulative Parameter Averaging

OPENALEX - Publications

Wenju Sun Qingyong Li Siyu Zhang Wen Wang Yangli‐ao Geng

10.2139/ssrn.5157877 preprint EN 2025-01-01

xTrimoPGLM: unified 100-billion-parameter pretrained transformer for deciphering the language of proteins

OPENALEX - Publications

Bo Chen Xingyi Cheng Li Pan Yangli‐ao Geng Jing Gong and 10 more

10.1038/s41592-025-02636-z article EN Nature Methods 2025-04-03

Uncertainty Quantification for Joint Demand Prediction of Multi-Mode Ride-Sourcing Using Spatiotemporal Mixture-of-Expert Neural Network

OPENALEX - Publications

Xiaobing Liu Yu Duan Yangli‐ao Geng Yun Wang Qingyong Li and 2 more

10.2139/ssrn.5214484 preprint EN 2025-01-01

DivGCL: A Graph Contrastive Learning Model for Diverse Recommendation

OPENALEX - Publications

Wenwen Gong Yangli‐ao Geng Dan Zhang Yifan Zhu Xiaolong Xu and 4 more

Graph Contrastive Learning (GCL), as a primary paradigm of graph self-supervised learning, spurs fruitful line research in tackling the data sparsity issue by maximizing consistency user/item embeddings between different augmented views with random perturbations. However, diversity, crucial metric for recommendation performance and user satisfaction, has received rather little attention. In fact, there exists challenging dilemma balancing accuracy diversity. To address these issues, we...

10.1609/aaai.v39i16.33852 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

Attention-Based Dual-Source Spatiotemporal Neural Network for Lightning Forecast

OPENALEX - Publications

Tianyang Lin Qingyong Li Yangli‐ao Geng Lei Jiang Liangtao Xu and 4 more

Accurate lightning forecast is significant for disaster prevention and reduction. However, the mainstream methods, which mainly rely on numerical simulations parameterizations, can hardly cope with spatiotemporal deviations. Meanwhile, rapid complex evolution of regions go beyond traditional extrapolation-based methods. In this work, we propose a data-driven neural network model hourly forecast, exploits both recent historical observations. The two kinds data complement each other play...

10.1109/access.2019.2950328 article EN cc-by IEEE Access 2019-01-01

A Power-Angle-Spectrum Based Clustering and Tracking Algorithm for Time-Varying Radio Channels

OPENALEX - Publications

Chen Huang Ruisi He Zhangdui Zhong Bo Ai Yangli‐ao Geng and 4 more

Radio channel modeling has been an important research topic, since the performance of any communication system depends on characteristics. So far, most existing clustering algorithms are conducted based multipath components (MPCs) extracted by using a high-resolution parameter estimation approach, e.g., SAGE or MUSIC, etc. However, approaches require prior information to extract MPCs. Moreover, usually result in relatively high complexity, and thus, clusters can only be identified offline...

10.1109/tvt.2018.2878049 article EN IEEE Transactions on Vehicular Technology 2018-10-25

A Deep Calibration Method for Low-Cost Air Monitoring Sensors With Multilevel Sequence Modeling

OPENALEX - Publications

Haomin Yu Qingyong Li Rao Wang Zechuan Chen Yingjun Zhang and 4 more

Air pollution is growing ever more serious as a result of rising consumption energy and other natural resources. Generally, governmental static monitoring stations provide accurate air data, but they are sparsely distributed in the space. In contrast, microstations kind low-cost equipment can be densely though their accuracy relatively low. This article proposes deep calibration method (DeepCM) for sensors equipped microstations, which consists an encoder decoder. encoding stage, multilevel...

10.1109/tim.2020.2978596 article EN IEEE Transactions on Instrumentation and Measurement 2020-03-05

A Novel Tracking-Based Multipath Component Clustering Algorithm

OPENALEX - Publications

Chen Huang Ruisi He Zhangdui Zhong Yangli‐ao Geng Qingyong Li and 1 more

In mobile communications, wireless channel has been widely considered to be time-variant. To accurately model the time-variant channels, a tracking-based dynamic multipath components (MPCs) clustering algorithm is proposed in this letter. The tracking problem as total probability maximization estimation and solved by using Kuhn-Munkres algorithm. MPCs are further clustered based on results of tracking. validated simulated channels compared with distance-based method. Simulation show good performance

10.1109/lawp.2017.2740622 article EN IEEE Antennas and Wireless Propagation Letters 2017-01-01

Decoupling Learning and Remembering: a Bilevel Memory Framework with Knowledge Projection for Task-Incremental Learning

OPENALEX - Publications

Wenju Sun Qingyong Li Jing Zhang Wen Wang Yangli‐ao Geng

The dilemma between plasticity and stability arises as a common challenge for incremental learning. In contrast, the human memory system is able to remedy this owing its multilevel structure, which motivates us propose Bilevel Memory with Knowledge Projection (BMKP) BMKP decouples functions of learning remembering via bilevel-memory design: working responsible adaptively model learning, ensure plasticity; long-term in charge enduringly storing knowledge incorporated within learned model,...

10.1109/cvpr52729.2023.01933 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

LightNet+: A dual-source lightning forecasting network with bi-direction spatiotemporal transformation

OPENALEX - Publications

Xinyuan Zhou Yangli‐ao Geng Haomin Yu Qingyong Li Liangtao Xu and 3 more

10.1007/s10489-021-03089-5 article EN Applied Intelligence 2022-01-20

A zero-shot learning framework via cluster-prototype matching

OPENALEX - Publications

Jing Zhang Qingyong Li Yangli‐ao Geng Wen Wang Wenju Sun and 2 more

10.1016/j.patcog.2021.108469 article EN Pattern Recognition 2021-11-29

Local-Density Subspace Distributed Clustering for High-Dimensional Data

OPENALEX - Publications

Yangli‐ao Geng Qingyong Li Mingfei Liang Chong‐Yung Chi Juan Jim Tan and 1 more

Distributed clustering is emerging along with the advent of era big data. However, most existing established distributed methods focus on problems caused by a large amount data rather than dimension Consequently, they suffer "curse" dimensionality (e.g., poor performance and heavy network overhead) when high-dimensional (HD) are clustered. In this article, we propose algorithm, referred to as Local Density Subspace Clustering (LDSDC) cluster large-scale HD data, motivated idea that local...

10.1109/tpds.2020.2975550 article EN IEEE Transactions on Parallel and Distributed Systems 2020-02-25