Xiaodong He

ORCID: 0000-0001-9379-6762
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Retinal Imaging and Analysis
  • Medical Image Segmentation Techniques
  • Emotion and Mood Recognition
  • Reservoir Engineering and Simulation Methods
  • Retinal Diseases and Treatments
  • Hydraulic Fracturing and Reservoir Analysis
  • Liver physiology and pathology
  • Brain Tumor Detection and Classification
  • Oil and Gas Production Techniques
  • Structural Response to Dynamic Loads
  • Mesenchymal stem cell research
  • Dermatology and Skin Diseases
  • Speech and Audio Processing
  • COVID-19 diagnosis using AI
  • Lattice Boltzmann Simulation Studies
  • 3D Surveying and Cultural Heritage
  • Caching and Content Delivery
  • Hematopoietic Stem Cell Transplantation
  • Speech Recognition and Synthesis
  • Integrated Circuits and Semiconductor Failure Analysis
  • Advanced Graph Neural Networks
  • Advanced Image Processing Techniques
  • Machine Learning in Materials Science
  • Fluid Dynamics and Vibration Analysis
  • Structural Behavior of Reinforced Concrete

Dian Diagnostics (China)
2020

BOE Technology Group (China)
2020

Inception Institute of Artificial Intelligence
2019

PetroChina Southwest Oil and Gas Field Company (China)
2008-2011

Research Institute of Petroleum Exploration and Development
2008-2011

Shanghai Stomatological Hospital
2005

Medical image analysis has two important research areas: disease grading and fine-grained lesion segmentation. Although the former problem often relies on latter, are usually studied separately. Disease severity can be treated as a classification problem, which only requires image-level annotations, while segmentation stronger pixel-level annotations. However, pixel-wise data annotation for medical images is highly time-consuming domain experts. In this paper, we propose collaborative...

10.1109/cvpr.2019.00218 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Diabetic retinopathy (DR) is a complication of diabetes that severely affects eyes. It can be graded into five levels severity according to international protocol. However, optimizing grading model have strong generalizability requires large amount balanced training data, which difficult collect, particularly for the high levels. Typical data augmentation methods, including random flipping and rotation, cannot generate with diversity. In this paper, we propose diabetic generative adversarial...

10.1109/jbhi.2020.3045475 article EN IEEE Journal of Biomedical and Health Informatics 2020-12-17

This paper introduces a novel neural network - flow completion (FCN) to infer the fluid dynamics, includ-ing field and force acting on body, from incomplete data based Graph Convolution AttentionNetwork. The FCN is composed of several graph convolution layers spatial attention layers. It designed inferthe velocity vortex contribution when combined with map (VFM)method. Compared other networks adopted in capable dealing bothstructured unstructured data. performance proposed assessed by...

10.1063/5.0097688 article EN cc-by Physics of Fluids 2022-07-19

SimRank is a classic measure of the similarities nodes in graph. Given node u graph $G =(V, E)$, \em single-source query returns $s(u, v)$ between and each $v \in V$. This type queries has numerous applications web search social networks analysis, such as link prediction, mining, spam detection. Existing methods for queries, however, incur cost at least linear to number n, which renders them inapplicable real-time interactive analysis. paper proposes \prsim, an algorithm that exploits...

10.1145/3299869.3319873 preprint EN Proceedings of the 2022 International Conference on Management of Data 2019-06-18

Speech emotion recognition (SER) has attracted great attention in recent years due to the high demand for emotionally intelligent speech interfaces. Deriving speaker-invariant representations is crucial. In this paper, we propose apply adversarial training SER learn representations. Our model consists of three parts: a representation learning sub-network with time-delay neural network (TDNN) and LSTM statistical pooling, an classification speaker network. Both take output as input. Two...

10.48550/arxiv.1903.09606 preprint EN other-oa arXiv (Cornell University) 2019-01-01

The morphological ECG features for arrhythmia diagnosis are usually identified and combined on different scales.For example, can be the scale of length or amplitude QRS waves. Professionals then make a based combination these features. Attention-based deep neural networks have been proved to boost meaningful scales suppress weak To combine classification, we proposed MADNN: multi-scale attention network classification. Our was designed combining kernel-wise branch-wise modules backbone...

10.22489/cinc.2020.282 article EN Computing in cardiology 2020-12-30

CAPTCHAs based on reading text are susceptible to machine-learning-based attacks due recent significant advances in deep learning (DL). To address this, this paper promotes image/visual captioning CAPTCHAs, which is robust against attacks. develop image/visual-captioning-based proposes a new image architecture by exploiting tensor product representations (TPR), structured neural-symbolic framework developed cognitive science over the past 20 years, with aim of integrating DL explicit...

10.48550/arxiv.1710.11475 preprint EN other-oa arXiv (Cornell University) 2017-01-01
Coming Soon ...