Rui Xu

ORCID: 0000-0002-5549-236X
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Neural Network Applications
  • Domain Adaptation and Few-Shot Learning
  • COVID-19 diagnosis using AI
  • Radiomics and Machine Learning in Medical Imaging
  • Lung Cancer Diagnosis and Treatment
  • Anomaly Detection Techniques and Applications
  • Advanced Vision and Imaging
  • Machine Learning and ELM
  • Advanced Image and Video Retrieval Techniques
  • 3D Shape Modeling and Analysis
  • Speech and Audio Processing
  • Computer Graphics and Visualization Techniques
  • Remote-Sensing Image Classification
  • Robotics and Sensor-Based Localization
  • AI in cancer detection
  • Face recognition and analysis
  • Vibration Control and Rheological Fluids
  • Text and Document Classification Technologies
  • Aeroelasticity and Vibration Control
  • Face and Expression Recognition
  • Multimodal Machine Learning Applications
  • Indoor and Outdoor Localization Technologies
  • Sparse and Compressive Sensing Techniques
  • Robotic Path Planning Algorithms
  • Topic Modeling

Shanghai Jiao Tong University
2018-2025

Beijing Friendship Hospital
2025

Fujian University of Technology
2022-2025

Capital Medical University
2025

China University of Petroleum, East China
2019-2024

Changchun University of Science and Technology
2024

Jinling Institute of Technology
2024

Longhua Hospital Shanghai University of Traditional Chinese Medicine
2024

Shanghai University of Traditional Chinese Medicine
2024

Chinese Research Academy of Environmental Sciences
2023-2024

We develop a general approach to distill symbolic representations of learned deep model by introducing strong inductive biases. focus on Graph Neural Networks (GNNs). The technique works as follows: we first encourage sparse latent when train GNN in supervised setting, then apply regression components the extract explicit physical relations. find correct known equations, including force laws and Hamiltonians, can be extracted from neural network. our method non-trivial cosmology example-a...

10.48550/arxiv.2006.11287 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Lip reading has witnessed unparalleled development in recent years thanks to deep learning and the availability of large-scale datasets. Despite encouraging results achieved, performance lip reading, unfortunately, remains inferior one its counterpart speech recognition, due ambiguous nature actuations that makes it challenging extract discriminant features from movement videos. In this paper, we propose a new method, termed as by Speech (LIBS), which goal is strengthen recognizers. The...

10.1609/aaai.v34i04.6174 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2020-04-03

FPGA-based CNN accelerators have advantages in flexibility and power efficiency so are being deployed by a number of cloud computing service providers, including Microsoft, Amazon, Tencent, Alibaba. Given the increasing complexity neural networks, however, it is becoming challenging to efficiently map CNNs multi-FPGA platforms. In this work, we present scalable framework, FPDeep, which helps engineers specific CNN's training logic cluster or build RTL implementations for target network. With...

10.1109/fccm.2018.00021 article EN 2018-04-01

Lip reading aims at decoding texts from the movement of a speaker's mouth. In recent years, lip methods have made great progress for English, both word-level and sentence-level. Unlike however, Chinese Mandarin is tone-based language relies on pitches to distinguish lexical or grammatical meaning, which significantly increases ambiguity task. this paper, we propose Cascade Sequence-to-Sequence Model (CSSMCM) reading, explicitly models tones when predicting sentence. Tones are modeled based...

10.1145/3338533.3366579 preprint EN 2019-12-15

In recent years, researchers pay growing attention to the few-shot learning (FSL) task address data-scarce problem. A standard FSL framework is composed of two components: i) Pre-train. Employ base data generate a CNN-based feature extraction model (FEM). ii) Meta-test. Apply trained FEM novel (category different from data) acquire embeddings and recognize them. Although have made remarkable breakthroughs in FSL, there still exists fundamental Since with usually cannot adapt class...

10.1109/tcsvt.2021.3135023 article EN IEEE Transactions on Circuits and Systems for Video Technology 2021-12-16

Few-shot learning (FSL), purposing to resolve the problem of data-scarce, has attracted considerable attention in recent years. A popular FSL framework contains two phases: (i) pre-train phase employs base data train a CNN-based feature extractor. (ii) meta-test applies frozen extractor novel (novel different categories from data) and designs classifier for recognition. To correct few-shot distribution, researchers propose Semi-Supervised Few-Shot Learning (SSFSL) by introducing unlabeled...

10.1109/tcsvt.2022.3196550 article EN IEEE Transactions on Circuits and Systems for Video Technology 2022-08-04

In this work, we propose a voxel-based single-stage fine-grained and efficient point cloud 3D object detection algorithm to address the inadequate granularity in feature extraction tasks imbalance between efficiency accuracy scenarios. We develop lightweight multibranch cross-sparse convolution network (LMCCN) that is designed preserve of original while achieving enhanced efficiency. Additionally, introduce compact self-attention augmented bird's eye view (BEV) module (CFSAM). This aims...

10.1109/tits.2024.3373227 article EN IEEE Transactions on Intelligent Transportation Systems 2024-03-29

Stereo matching is a key technique for metric depth estimation in computer vision and robotics. Real-world challenges like occlusion non-texture hinder accurate disparity from binocular cues. Recently, monocular relative has shown remarkable generalization using foundation models. Thus, to facilitate robust stereo with cues, we incorporate model into the recurrent stereo-matching framework, building new framework model-based stereo-matching, DEFOM-Stereo. In feature extraction stage,...

10.48550/arxiv.2501.09466 preprint EN arXiv (Cornell University) 2025-01-16

Extracting buildings from high-resolution remote sensing images is currently a research hotspot in the field of applications. Deep learning methods have significantly improved accuracy building extraction, but there are still deficiencies such as blurred edges, incomplete structures and loss details extraction results. To obtain accurate contours clear boundaries buildings, this article proposes novel method utilizing multi-scale attention gate enhanced positional information. By employing...

10.7717/peerj-cs.2826 article EN cc-by PeerJ Computer Science 2025-04-21

10.1016/j.engappai.2023.106640 article EN Engineering Applications of Artificial Intelligence 2023-06-22

The rapid development of antibiotic resistance is occurring at a global scale. We therefore stride into the post-antibiotic era and have to battle in Anthropocene. Metals are widely used their pollution widespread worldwide. More importantly, metal-induced co-selection greatly expands environmental resistomes increases health risk environments. Here, we reviewed increasingly important roles resistance. In particular, highlight metal-rich environments that maintain reservoirs for high-risk...

10.1007/s42832-024-0244-4 article EN other-oa Soil Ecology Letters 2024-04-12

Gossamer space structures technology have gained widely applications in missions. However, the vibration problem is a great challenge which makes complicated. The overall motivation of this work to develop control system for gossamer structures. In study, membrane structure with piezoelectric stack actuators bracketed on its support frame considered. First, description smart and dynamic model are presented. Then, decentralized adaptive fuzzy method developed vibration. Finally, experimental...

10.1177/1045389x16679023 article EN Journal of Intelligent Material Systems and Structures 2016-12-02

This paper deals with the active vibration control of smart truss structure. First, electro-mechanical coupled dynamic model structure is constructed. Then, first-order ordinary differential equation system presented. After that, an online learning fuzzy (OLFC) algorithm proposed to vibrations. The OLFC composed a reward function, Q algorithm, rule base generator and conventional controller. learns by interaction plant, changes generate policy via evaluative signal realize goal. only needs...

10.1177/0954410016640823 article EN Proceedings of the Institution of Mechanical Engineers Part G Journal of Aerospace Engineering 2016-04-06

Terrestrial gross primary productivity (GPP) is the major carbon input to terrestrial ecosystem. The Yangtze River Basin (YRB) holds a key role in shaping China’s economic and social progress, as well ecological environmental protection. However, how GPP YRB responds climate factors remain unclear. In this research, we applied Vegetation Photosynthesis Model (VPM) data explore spatial temporal variations of during 2000–2018. Based on China Meteorological Forcing Dataset (CMFD), partial least...

10.3390/f14091898 article EN Forests 2023-09-18
Coming Soon ...