Chenghao Li

ORCID: 0009-0003-0404-3825
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Neural Network Applications
  • Context-Aware Activity Recognition Systems
  • Video Surveillance and Tracking Methods
  • Domain Adaptation and Few-Shot Learning
  • Multimodal Machine Learning Applications
  • Visual Attention and Saliency Detection
  • Advanced Image and Video Retrieval Techniques
  • Robot Manipulation and Learning
  • Human Pose and Action Recognition
  • Robotic Path Planning Algorithms
  • Chaos-based Image/Signal Encryption
  • Engineering Applied Research
  • Electronic Health Records Systems
  • IoT and Edge/Fog Computing
  • Remote Sensing and LiDAR Applications
  • Adversarial Robustness in Machine Learning
  • Advanced Vision and Imaging
  • Healthcare Technology and Patient Monitoring
  • Cloud Computing and Resource Management
  • Non-Invasive Vital Sign Monitoring
  • Optimization and Search Problems
  • Advanced Steganography and Watermarking Techniques
  • Wireless Body Area Networks
  • Optimization and Packing Problems
  • Advanced Manufacturing and Logistics Optimization

Xi’an Jiaotong-Liverpool University
2025

Japan Advanced Institute of Science and Technology
2024

Shandong Normal University
2024

Korea Advanced Institute of Science and Technology
2024

Brigham Young University
2023

Dalian University of Technology
2023

University of Southern California
2023

Southwest University of Science and Technology
2022

Dalian Maritime University
2022

Changchun Institute of Optics, Fine Mechanics and Physics
2022

Segment anything model (SAM) developed by Meta AI Research has recently attracted significant attention. Trained on a large segmentation dataset of over 1 billion masks, SAM is capable segmenting any object certain image. In the original work, authors turned to zero-short transfer tasks (like edge detection) for evaluating performance SAM. Recently, numerous works have attempted investigate in various scenarios recognize and segment objects. Moreover, projects emerged show versatility as...

10.48550/arxiv.2306.06211 preprint EN cc-by-nc-sa arXiv (Cornell University) 2023-01-01

The emotional response of robotics is crucial for promoting the socially intelligent level human–robot interaction (HRI). development machine learning has extensively stimulated research on recognition robots. Our focuses gaits, a type simple modality that stores series joint coordinates and easy humanoid robots to execute. However, limited amount investigates HRI systems based indicating an existing gap in human emotion gait robotic response. To address this challenge, we propose...

10.3390/s25030734 article EN cc-by Sensors 2025-01-25

Aiming at the problem of low detection accuracy grasping algorithm based on RGB information as input, this paper proposes a CSP-ResNet to improve algorithm. that chessboard effect will be produced when transposed convolution restores image fractional variability, which affect prediction model, designs fusion nearest neighbor interpolation upsampling and restore resolution feature map. To alleviate checkerboard by convolution. Namely FCG-Net (Fuse CSPS Grasp Net). This method takes first...

10.1109/icicml57342.2022.10009877 article EN 2022 International Conference on Image Processing, Computer Vision and Machine Learning (ICICML) 2022-10-28

The concern over data and model privacy in machine learning inference as a service (MLaaS) has led to the development of private (PI) techniques. However, existing PI frameworks, especially those designed for large models such vision transformers (ViT), suffer from high computational communication overheads caused by expensive multi-party computation (MPC) protocols. encrypted attention module that involves softmax operation contributes significantly this overhead. In work, we present family...

10.1109/iccad57390.2023.10323702 article EN 2015 IEEE/ACM International Conference on Computer-Aided Design (ICCAD) 2023-10-28

Grasping a diverse range of novel objects from dense clutter poses great challenge to robots because the occlusion among these objects. In this work, we propose Pyramid-Monozone Synergistic Policy (PMSGP) that enables cleverly avoid most occlusions during grasping. Specifically, initially construct Pyramid Se quencing (PSP) sequence each object in scene into pyramid structure. By isolating layer-by-layer, grasp candidates will focus on single layer grasp. Then, devise Monozone Sampling (MSP)...

10.48550/arxiv.2409.06959 preprint EN arXiv (Cornell University) 2024-09-10

The option framework has shown great promise by automatically extracting temporally-extended sub-tasks from a long-horizon task. Methods have been proposed for concurrently learning low-level intra-option policies and high-level selection policy. However, existing methods typically suffer two major challenges: ineffective exploration unstable updates. In this paper, we present novel stable off-policy approach that builds on the maximum entropy model to address these challenges. Our...

10.48550/arxiv.2006.14363 preprint EN other-oa arXiv (Cornell University) 2020-01-01

A new approach to digital image signatures is proposed. The proposed has shown be resistant several kinds of processing and the JPEG lossy compression. Moreover, can extracted from watermarked without resorting original image.

10.1109/icce.1999.785189 article EN 2003-01-20

This paper studies the prevention of premature failures LED backlights used in mobile devices that are subject to different use conditions. is a vitally important topic for consumer device manufacturers as life expectancy two identical from same production line may vary substantially under operating environments and These differences not addressed by traditional reliability assessment methods documented many electronics handbooks. The outlines prognostics approach condition-based monitoring...

10.1109/jdt.2012.2198044 article EN Journal of Display Technology 2012-06-05

Abstract In order to effectively improve the detection accuracy of remote sensing images in airport areas, basing on representative deep network Faster R-CNN as object method, a deeper basic ResNet and feature fusion component FPN are used extract more robust distinguishing features, add new fully connected layer end combine softmax classifier 4 logistic regression classifiers for according inter-class correlation object. Experiments show that improvement original brings 7.7% mAP 76.6% mAP....

10.1088/1742-6596/1601/3/032010 article EN Journal of Physics Conference Series 2020-07-01

Population aging is a growing issue for many metropolitan cities. With the proven effectiveness of assistive technology elderly care, smart home healthcare system that utilizes would facilitate independent living senior citizens with cognitive impairment alone. The main objective proposed to provide patients point-of-care solution minimal user intervention while reducing demand public services.

10.1109/gcce.2012.6379653 article EN 2012-10-01

This study focuses on the over-fitting problem in training process of deep convolutional neural network model and poor robustness when is applied an occlusion environment. We propose a unique data augmentation method, In-and-Out. First, information variance enhanced through dynamic local operation while maintaining overall geometric structure image; compared with global our method effectively alleviates overfitting significantly improves generalization ability model. Then removal operation,...

10.1117/1.jei.31.1.013023 article EN Journal of Electronic Imaging 2022-02-09

In contrast to the human vision that mainly depends on shape for recognizing objects, deep image recognition models are widely known be biased toward texture. Recently, Meta research team has released first foundation model segmentation, termed segment anything (SAM), which attracted significant attention. this work, we understand SAM from perspective of texture \textit{v.s.} shape. Different label-oriented tasks, is trained predict a mask covering object based promt. With said, it seems...

10.48550/arxiv.2311.11465 preprint EN cc-by-nc-sa arXiv (Cornell University) 2023-01-01

We revisit the relationship between attention mechanisms and large kernel ConvNets in visual transformers propose a new spatial named Large Kernel Convolutional Attention (LKCA). It simplifies operation by replacing it with single convolution. LKCA combines advantages of convolutional neural networks transformers, possessing receptive field, locality, parameter sharing. explained superiority from both convolution perspectives, providing equivalent code implementations for each view....

10.48550/arxiv.2401.05738 preprint EN cc-by arXiv (Cornell University) 2024-01-01

The detection accuracy and speed of grasp models on benchmarks are the focal points concern in robotic grasping community. Especially a collaborative robot setting, safety model is an essential aspect that cannot be overlooked. In this paper, we explore how to enhance autonomous vision-guided grasping. Specifically, propose simple yet practical Safety-optimized Strategy, which consists two parts. first part involves depth prioritization, optimizing sequence from top bottom based order...

10.36227/techrxiv.170792424.40456169/v1 preprint EN cc-by-nc-sa 2024-02-14

Integrating the artificial intelligence vision system into robots has significantly enhanced adaptability of grasping, but are vulnerable to potential backdoor threats. Currently, majority attacks focused on image classification and limited unimodal information single-object digital scenarios. In this work, we make first endeavor realize attack multimodal vision-guided robot grasping within high-clutter Specifically, propose a novel method named Shortcut-enhanced Multimodal Backdoor Attack...

10.36227/techrxiv.170792505.56224502/v1 preprint EN cc-by-nc-sa 2024-02-14

This paper presents an efficient deep reinforcement learning (DRL) framework for online 3D bin packing (3D-BPP). The 3D-BPP is NP-hard problem significant in logistics, warehousing, and transportation, involving the optimal arrangement of objects inside a bin. Traditional heuristic algorithms often fail to address dynamic physical constraints real-time scenarios. We introduce novel DRL that integrates reliable physics algorithm object rearrangement stable placement. Our experiment show...

10.48550/arxiv.2408.09694 preprint EN arXiv (Cornell University) 2024-08-19
Coming Soon ...