Haoxiang Gao

ORCID: 0000-0003-4581-1723
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Single-cell and spatial transcriptomics
  • Cell Image Analysis Techniques
  • Gene Regulatory Network Analysis
  • Human Pose and Action Recognition
  • Extracellular vesicles in disease
  • Osteoarthritis Treatment and Mechanisms
  • Knee injuries and reconstruction techniques
  • Anomaly Detection Techniques and Applications
  • Context-Aware Activity Recognition Systems
  • Traffic Prediction and Management Techniques
  • Gene expression and cancer classification
  • Artificial Intelligence in Law
  • Law, Economics, and Judicial Systems
  • Metallurgy and Material Forming
  • Cardiac Fibrosis and Remodeling
  • BIM and Construction Integration
  • Immune cells in cancer
  • Machine Learning and ELM
  • Metal Forming Simulation Techniques
  • Traffic control and management
  • Congenital heart defects research
  • Internet Traffic Analysis and Secure E-voting
  • Advanced Surface Polishing Techniques
  • European and International Contract Law
  • Energy Load and Power Forecasting

Institute of Economics
2025

Tsinghua University
2018-2024

Carnegie Mellon University
2018

Institute of Bioinformatics
2018

Imperial College London
2016

Deep neural networks, including recurrent have been successfully applied to human activity recognition. Unfortunately, the final representation learned by networks might encode some noise (irrelevant signal components, unimportant sensor modalities, etc.). Besides, it is difficult interpret gain insight into models' behavior. To address these issues, we propose two attention models for recognition: temporal and attention. These mechanisms adaptively focus on important signals modalities....

10.1145/3267242.3267286 preprint EN 2018-10-04

The accumulation of massive single-cell omics data provides growing resources for building biomolecular atlases all cells human organs or the whole body. true assembly a cell atlas should be cell-centric rather than file-centric. We developed unified informatics framework seamless and built Ensemble Cell Atlas (hECA) from scattered data. hECA v1.0 assembled 1,093,299 labeled 116 published datasets, covering 38 11 systems. invented three new methods applications based on assembly: "in data"...

10.1016/j.isci.2022.104318 article EN cc-by iScience 2022-04-28

Idiopathic pulmonary fibrosis (IPF) is a chronic interstitial lung disease with high mortality rate and unclarified aetiology. Immune response elaborately regulated during the progression of IPF, but immune cells subsets are complicated which has not been detailed described IPF progression. Therefore, in current study, we sought to investigate role regulation by characterize heterogeneous IPF. To this end, performed single-cell profiling isolated from four stages bleomycin-induced fibrosis-a...

10.3389/fimmu.2023.1230266 article EN cc-by Frontiers in Immunology 2023-09-13

Abstract Profiling spatial variations of cellular composition and transcriptomic characteristics is important for understanding the physiology pathology tissues. Spatial transcriptomics (ST) data depict gene expression but currently dominating high-throughput technology yet not at single-cell resolution. Single-cell RNA-sequencing (SC) provide information level lack information. Integrating these two types would be ideal revealing landscapes We develop method STEM (SpaTially aware EMbedding)...

10.1038/s42003-023-05640-1 article EN cc-by Communications Biology 2024-01-06

Large language models (LLMs) have demonstrated significant potential in transforming real estate transactions through advanced natural processing capabilities. This paper proposes a method leveraging LLMs to extract information from contracts and facilitate interactive querying by users. By emphasizing the use of few-shot learning, can effectively analyze contract documents respond specific questions users regarding details. explores development this method, its implementation workflows,...

10.31219/osf.io/kb9cu preprint EN 2024-05-10

Real estate sales contracts contain crucial information for property transactions, but manual extraction of data can be time-consuming and error-prone. This paper explores the application large language models, specifically transformer-based architectures, automated from real contracts. We discuss challenges, techniques, future directions in leveraging these models to improve efficiency accuracy contract analysis.

10.48550/arxiv.2404.18043 preprint EN arXiv (Cornell University) 2024-04-27

This article explores the application of Large Lan-guage Models (LLMs) and multi-task learning inevaluating validity contract formation sce-narios. The breaks down formationinto four essential elements: offer, acceptance, con-sideration, defenses then assesses pres-ence each element within a hypothetical case todetermine viability formation.

10.31219/osf.io/d5jsn preprint EN 2024-06-06

Background: Musculoskeletal tissue degeneration impairs the life quality and function of many people. Meniscus is a major origin knee osteoarthritis common threat to athletic ability, but its cellular mechanism remains elusive. Methods: We built cell atlas 12 healthy or degenerated human meniscus samples from inner outer meniscal zones 8 patients using scRNA-seq investigate microenvironment homeostasis changes in process verified findings with immunofluorescent imaging. Results: identified...

10.7554/elife.79585 article EN cc-by eLife 2022-12-22

This perspective discusses the need and directions for development of a unified information framework to enable assembly cell atlases revolution in medical research on virtual body assembled systems.

10.1093/nsr/nwab179 article EN cc-by National Science Review 2021-09-24

A universal coordinate system that can ensemble the huge number of cells and capture their heterogeneities is vital importance for constructing large-scale cell atlases as references molecular cellular studies. Studies have shown exhibit multifaceted in transcriptomic features at multiple resolutions. This nature complexity makes it hard to design a fixed through combination known features. It desirable build learnable model major serve controlled generative data augmentation. We developed...

10.1038/s42003-024-06564-0 article EN cc-by-nc-nd Communications Biology 2024-08-12

Understanding perturbations at the single-cell level is essential for unraveling cellular mechanisms and their implications in health disease. The growing availability of biological data has driven development a variety silico perturbation methods designed analysis, which offer means to address many inherent limitations experimental approaches. However, these computational are often tailored specific scenarios validated on limited datasets metrics, making evaluation comparison challenging....

10.1101/2024.12.20.629581 preprint EN cc-by-nc bioRxiv (Cold Spring Harbor Laboratory) 2024-12-22

The use of Finite Element (FE) simulation software to adequately predict the outcome sheet metal forming processes is crucial enhancing efficiency and lowering development time such processes, whilst reducing costs involved in trial-and-error prototyping. Recent focus on substitution steel components with aluminum alloy alternatives automotive aerospace sectors has increased need simulate behavior alloys for ever more complex component geometries. However these alloys, particular their high...

10.3791/53957-v article EN Journal of Visualized Experiments 2016-12-13

Expectations of machine learning (ML) are high for discovering new patterns in high-throughput biological data, but most such practices accustomed to relying on existing knowledge conditions design experiments. Investigations the power and limitation ML revealing complex from data without guide have been lacking. In this study, we conducted systematic experiments ab initio discovery with methods single-cell RNA-sequencing early embryonic development. Results showed that a strategy combining...

10.1016/j.patter.2020.100071 article EN cc-by-nc-nd Patterns 2020-07-10

SUMMARY Single-cell omics data can characterize multifaceted features of massive cells and bring significant insights to biomedical researches. The accumulation single-cell provides growing resources for constructing atlases all a human organ or the whole body. true assembly cell atlas should be cell-centric rather than file-centric. We proposed unified information framework enabling seamless developed Ensemble Cell Atlas (hECA) as an instance. hECA version 1.0 assembled scRNA-seq across...

10.1101/2021.07.21.453289 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2021-07-23

Abstract Profiling spatial variations of cellular composition and transcriptomic characteristics is important for understanding the physiology pathology tissues in health or diseases. Spatial transcriptomics (ST) data are powerful depicting gene expression but currently dominating high-throughput technology yet not at single-cell resolution. On other hand, RNA-sequencing (SC) provide information level lack information. Integrating these two types would be ideal revealing landscapes We...

10.1101/2022.09.23.509186 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2022-09-26

Abstract A universal coordinate system that can ensemble the huge number of cells and capture their heterogeneities is vital importance for constructing large-scale cell atlases as references future molecular cellular studies. Studies have shown in complex organs exhibit multifaceted transcriptomic features at multiple resolutions. This nature complexity makes it hard to design a fixed through combination known features. It desirable build learnable model major serve controlled generative...

10.1101/2021.09.09.459281 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2021-09-10

Abstract Multiple steps of bioinformatics processing are needed to convert the raw scRNA-seq data information that can be used in downstream analyses and building cell atlases. Dozens software packages have been developed different labs tend preferences on choices workflow. Such diversity cause difficulties future efforts aggregating from multiple labs, also for new start this field. A few pipelines help integrating into a whole, but fixed architecture makes it hard developers add features...

10.1101/456772 preprint EN cc-by-nc bioRxiv (Cold Spring Harbor Laboratory) 2018-10-30

Deep neural networks, including recurrent have been successfully applied to human activity recognition. Unfortunately, the final representation learned by networks might encode some noise (irrelevant signal components, unimportant sensor modalities, etc.). Besides, it is difficult interpret gain insight into models' behavior. To address these issues, we propose two attention models for recognition: temporal and attention. These mechanisms adaptively focus on important signals modalities....

10.48550/arxiv.1810.04038 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Abstract Background Musculoskeletal tissue degeneration impairs the life quality and function of many people. Meniscus is a major origin knee osteoarthritis common threat to athletic ability, but its cellular mechanism remains elusive. Methods We built cell atlas healthy/degenerated human meniscus using scRNA-seq investigate meniscal microenvironment homeostasis changes in process verified findings with immunofluorescent imaging. Results identified localized types inner outer meniscus, found...

10.1101/2022.05.19.22275311 preprint EN cc-by medRxiv (Cold Spring Harbor Laboratory) 2022-05-20
Coming Soon ...