Haoyu Xie

ORCID: 0000-0001-8833-0933
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Neural Network Applications
  • Advanced Image and Video Retrieval Techniques
  • Face recognition and analysis
  • Generative Adversarial Networks and Image Synthesis
  • COVID-19 diagnosis using AI
  • Biometric Identification and Security
  • Domain Adaptation and Few-Shot Learning
  • Topic Modeling
  • Video Surveillance and Tracking Methods
  • Multimodal Machine Learning Applications
  • Anomaly Detection Techniques and Applications
  • Gait Recognition and Analysis
  • Direction-of-Arrival Estimation Techniques
  • Face and Expression Recognition
  • Imbalanced Data Classification Techniques
  • Rough Sets and Fuzzy Logic
  • Seismic Performance and Analysis
  • Adversarial Robustness in Machine Learning
  • AI in cancer detection
  • Data Mining Algorithms and Applications
  • Traffic Prediction and Management Techniques
  • Advanced Optical Sensing Technologies
  • Natural Language Processing Techniques
  • Robotics and Automated Systems
  • Autonomous Vehicle Technology and Safety

New York University
2024

Northeastern University
2020-2024

Alibaba Group (Cayman Islands)
2023

Universidad del Noreste
2021

Anhui University of Finance and Economics
2021

Southwest University
2019

National University of Defense Technology
2017

In the background of information age, importance data resources can be imagined, and use means—data mining has also emerged. current situation, all industries are in a relatively equal stage, should make good resources, Apriori algorithm to mine association rules, formulate marketing strategies, promote sales growth slow down loss national GDP. Countries predictions support decisions. Briefly describe basic concepts rules. Only with understanding rules we better understand algorithm, why it...

10.4236/jss.2021.94034 article EN Open Journal of Social Sciences 2021-01-01

Vision Transformers (ViTs) have demonstrated powerful representation ability in various visual tasks thanks to their intrinsic data-hungry nature. However, we unexpectedly find that ViTs perform vulnerably when applied face recognition (FR) scenarios with extremely large datasets. We investigate the reasons for this phenomenon and discover existing data augmentation approach hard sample mining strategy are incompatible ViTs-based FR backbone due lack of tailored consideration on preserving...

10.1109/iccv51070.2023.01887 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Semi-Supervised Semantic Segmentation (S4) aims to train a segmentation model with limited labeled images and substantial volume of unlabeled images. To improve the robustness representations, powerful methods introduce pixel-wise contrastive learning approach in latent space (i.e., representation space) that aggregates representations their prototypes fully supervised manner. However, previous contrastive-based S4 merely rely on supervision from model's output (logits) logit during...

10.1109/iccv51070.2023.00092 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Scene completion and forecasting are two popular perception problems in research for mobile agents like autonomous vehicles. Existing approaches treat the isolation, resulting a separate of aspects. In this paper, we introduce novel LiDAR task Occupancy Completion Forecasting (OCF) context driving to unify these aspects into cohesive framework. This requires new algorithms address three challenges altogether: (1) sparse-to-dense reconstruction, (2) partial-to-complete hallucination, (3)...

10.48550/arxiv.2310.11239 preprint EN cc-by-nc-nd arXiv (Cornell University) 2023-01-01

Tremendous breakthroughs have been developed in Semi-Supervised Semantic Segmentation (S4) through contrastive learning. However, due to limited annotations, the guidance on unlabeled images is generated by model itself, which inevitably exists noise and disturbs unsupervised training process. To address this issue, we propose a robust contrastive-based S4 framework, termed Probabilistic Representation Contrastive Learning (PRCL) framework enhance robustness of We pixel-wise representation...

10.48550/arxiv.2402.18117 preprint EN arXiv (Cornell University) 2024-02-28

In the field of human-centric personalized image generation, adapter-based method obtains ability to customize and generate portraits by text-to-image training on facial data. This allows for identity-preserved personalization without additional fine-tuning in inference. Although there are improvements efficiency fidelity, is often a significant performance decrease test following ability, controllability, diversity generated faces compared base model. this paper, we analyze that degradation...

10.48550/arxiv.2410.12312 preprint EN arXiv (Cornell University) 2024-10-16

10.1109/iros58592.2024.10801302 article EN 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2024-10-14

Vision Transformers (ViTs) have demonstrated powerful representation ability in various visual tasks thanks to their intrinsic data-hungry nature. However, we unexpectedly find that ViTs perform vulnerably when applied face recognition (FR) scenarios with extremely large datasets. We investigate the reasons for this phenomenon and discover existing data augmentation approach hard sample mining strategy are incompatible ViTs-based FR backbone due lack of tailored consideration on preserving...

10.48550/arxiv.2308.10133 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Recent advancement in personalized image generation have unveiled the intriguing capability of pre-trained text-to-image models on learning identity information from a collection portrait images. However, existing solutions are vulnerable producing truthful details, and usually suffer several defects such as (i) The generated face exhibit its own unique characteristics, \ie facial shape feature positioning may not resemble key characteristics input, (ii) synthesized contain warped, blurred...

10.48550/arxiv.2308.14256 preprint EN other-oa arXiv (Cornell University) 2023-01-01

10.1016/j.nima.2017.12.030 article EN Nuclear Instruments and Methods in Physics Research Section A Accelerators Spectrometers Detectors and Associated Equipment 2017-12-23

Adaptive filters based on the maximum correntropy criterion (MCC) exhibit good robustness against impulsive interferences. The commonly used kernel function employed in MCC is Gaussian although it may not be best option all cases. To this end, a generalization of kernel, called q-Gaussian combined with developed using gradient, however leading slow convergence rate. overcome defect rate, paper presents novel conjugate gradient (QMCC-CG) algorithm. proposed QMCC-CG algorithm accelerates and...

10.1109/cac48633.2019.8996428 article EN 2019-11-01

RoIPool/RoIAlign is an indispensable process for the typical two-stage object detection algorithm, it used to rescale proposal cropped from feature pyramid generate a fixed size map. However, these maps of local receptive fields will heavily lose global context information. To tackle this problem, we propose novel end-to-end trainable framework, called Global Context Aware (GCA) RCNN, aiming at assisting neural network in strengthening spatial correlation between background and foreground by...

10.48550/arxiv.2012.02637 preprint EN other-oa arXiv (Cornell University) 2020-01-01

The input ground motion for the seismic design of nuclear power plant (NPP) equipment is required to envelop both response spectrum (RRS) and target spectral density (PSD). However, no unified algorithm exists generate PSD, whether at home or abroad. Based on generation methodology PSD suggested in Section 3.7.1 Standard Review Plan (SRP) U.S. Nuclear Regulatory Commission, this paper proposes an improved method synthesis with original iteration process optimized. proposed applied...

10.13832/j.jnpe.2021.05.0128 article EN 核动力工程 2021-09-30
Coming Soon ...