NFDI4DS | UHH-SEMS - Publication Details

Multi-view Convolutional Neural Networks for 3D Shape Recognition

OPENALEX - Publications

Hang Su Subhransu Maji Evangelos Kalogerakis Erik Learned-Miller

A longstanding question in computer vision concerns the representation of 3D shapes for recognition: should be represented with descriptors operating on their native formats, such as voxel grid or polygon mesh, can they effectively view-based descriptors? We address this context learning to recognize from a collection rendered views 2D images. first present standard CNN architecture trained shapes' independently each other, and show that shape recognized even single view at an accuracy far...

10.1109/iccv.2015.114 preprint EN 2015-12-01

SPLATNet: Sparse Lattice Networks for Point Cloud Processing

OPENALEX - Publications

Hang Su Varun Jampani Deqing Sun Subhransu Maji Evangelos Kalogerakis and 2 more

We present a network architecture for processing point clouds that directly operates on collection of points represented as sparse set samples in high-dimensional lattice. Naively applying convolutions this lattice scales poorly, both terms memory and computational cost, the size increases. Instead, our uses bilateral convolutional layers building blocks. These maintain efficiency by using indexing structures to apply only occupied parts lattice, allow flexible specifications structure...

10.1109/cvpr.2018.00268 preprint EN 2018-06-01

DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection

OPENALEX - Publications

Hao Zhang Feng Li Shilong Liu Lei Zhang Hang Su and 3 more

We present DINO (\textbf{D}ETR with \textbf{I}mproved de\textbf{N}oising anch\textbf{O}r boxes), a state-of-the-art end-to-end object detector. % in this paper. improves over previous DETR-like models performance and efficiency by using contrastive way for denoising training, mixed query selection method anchor initialization, look forward twice scheme box prediction. achieves $49.4$AP $12$ epochs $51.3$AP $24$ on COCO ResNet-50 backbone multi-scale features, yielding significant improvement...

10.48550/arxiv.2203.03605 preprint EN other-oa arXiv (Cornell University) 2022-01-01

KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition

OPENALEX - Publications

Dong Yu Kaisheng Yao Hang Su Gang Li Frank Seide

We propose a novel regularized adaptation technique for context dependent deep neural network hidden Markov models (CD-DNN-HMMs). The CD-DNN-HMM has large output layer and many layers, each with thousands of neurons. huge number parameters in the makes challenging task, esp. when set is small. developed this paper adapts model conservatively by forcing senone distribution estimated from adapted to be close that unadapted model. This constraint realized adding Kullback-Leibler divergence...

10.1109/icassp.2013.6639201 article EN IEEE International Conference on Acoustics Speech and Signal Processing 2013-05-01

A probabilistic approach to spatiotemporal theme pattern mining on weblogs

OPENALEX - Publications

Qiaozhu Mei Chao Liu Hang Su ChengXiang Zhai

Mining subtopics from weblogs and analyzing their spatiotemporal patterns have applications in multiple domains. In this paper, we define the novel problem of mining theme propose a probabilistic approach to model subtopic themes simultaneously. The proposed discovers by (1) extracting common weblogs; (2) generating life cycles for each given location; (3) snapshots time period. Evolution can be discovered comparative analysis snapshots. Experiments on three different data sets show that...

10.1145/1135777.1135857 article EN 2006-05-23

Multi-view Convolutional Neural Networks for 3D Shape Recognition

OPENALEX - Publications

Hang Su Subhransu Maji Evangelos Kalogerakis Erik Learned-Miller

A longstanding question in computer vision concerns the representation of 3D shapes for recognition: should be represented with descriptors operating on their native formats, such as voxel grid or polygon mesh, can they effectively view-based descriptors? We address this context learning to recognize from a collection rendered views 2D images. first present standard CNN architecture trained shapes' independently each other, and show that shape recognized even single view at an accuracy far...

10.48550/arxiv.1505.00880 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Pixel-Adaptive Convolutional Neural Networks

OPENALEX - Publications

Hang Su Varun Jampani Deqing Sun Orazio Gallo Erik Learned-Miller and 1 more

Convolutions are the fundamental building blocks of CNNs. The fact that their weights spatially shared is one main reasons for widespread use, but it also a major limitation, as makes convolutions content-agnostic. We propose pixel-adaptive convolution (PAC) operation, simple yet effective modification standard convolutions, in which filter multiplied with varying kernel depends on learnable, local pixel features. PAC generalization several popular filtering techniques and thus can be used...

10.1109/cvpr.2019.01142 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

DAB-DETR: Dynamic Anchor Boxes are Better Queries for DETR

OPENALEX - Publications

Shilong Liu Feng Li Hao Zhang Xiao Yang Xianbiao Qi and 3 more

We present in this paper a novel query formulation using dynamic anchor boxes for DETR (DEtection TRansformer) and offer deeper understanding of the role queries DETR. This new directly uses box coordinates as Transformer decoders dynamically updates them layer-by-layer. Using not only helps explicit positional priors to improve query-to-feature similarity eliminate slow training convergence issue DETR, but also allows us modulate attention map width height information. Such design makes it...

10.48550/arxiv.2201.12329 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Improved Human–Robot Collaborative Control of Redundant Robot for Teleoperated Minimally Invasive Surgery

OPENALEX - Publications

Hang Su Chenguang Yang Giancarlo Ferrigno Elena De Momi

An improved human-robot collaborative control scheme is proposed in a teleoperated minimally invasive surgery scenario, based on hierarchical operational space formulation of seven-degree-of-freedom redundant robot. Redundancy exploited to guarantee remote center motion (RCM) constraint and provide compliant behavior for the medical staff. Based implemented framework, an RCM safe are applied nullspace achieve surgical tasks with interaction. Due physical interactions, safety accuracy may be...

10.1109/lra.2019.2897145 article EN IEEE Robotics and Automation Letters 2019-02-05

Constrained Multilegged Robot System Modeling and Fuzzy Control With Uncertain Kinematics and Dynamics Incorporating Foot Force Optimization

OPENALEX - Publications

Zhijun Li Shengtao Xiao Shuzhi Sam Ge Hang Su

This paper studies the optimal distribution of feet forces and control multilegged robots with uncertainties in both kinematics dynamics. First, a constrained dynamics for environment model are established by considering kinematic dynamic uncertainties. Under an external wrench robots, foot moments supporting legs can be formulated as quadratic programming problems subject to linear nonlinear constraints. The neurodynamics recurrent neural network is developed force optimization. For...

10.1109/tsmc.2015.2422267 article EN IEEE Transactions on Systems Man and Cybernetics Systems 2015-05-05

Improved recurrent neural network-based manipulator control with remote center of motion constraints: Experimental results

OPENALEX - Publications

Hang Su Yingbai Hu Hamid Reza Karimi Alois Knoll Giancarlo Ferrigno and 1 more

10.1016/j.neunet.2020.07.033 article EN Neural Networks 2020-08-11

Fuzzy-Torque Approximation-Enhanced Sliding Mode Control for Lateral Stability of Mobile Robot

OPENALEX - Publications

Jiehao Li Junzheng Wang Peng Hui Yingbai Hu Hang Su

Accurate path tracking and stability are the main challenges of lateral motion control in mobile robots, especially under situation with complex road conditions. The interaction force between robots external environment may cause interference, which should be considered to guarantee its performance dynamic uncertain environments. In this article, a flexible scheme is for developed wheel-legged robot, consists cubature Kalman algorithm evaluate centroid slip angle yaw rate. Furthermore, fuzzy...

10.1109/tsmc.2021.3050616 article EN IEEE Transactions on Systems Man and Cybernetics Systems 2021-01-28

Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

OPENALEX - Publications

Shilong Liu Zhaoyang Zeng Tianhe Ren Feng Li Hao Zhang and 6 more

In this paper, we present an open-set object detector, called Grounding DINO, by marrying Transformer-based detector DINO with grounded pre-training, which can detect arbitrary objects human inputs such as category names or referring expressions. The key solution of detection is introducing language to a closed-set for concept generalization. To effectively fuse and vision modalities, conceptually divide into three phases propose tight fusion solution, includes feature enhancer,...

10.48550/arxiv.2303.05499 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Toward Teaching by Demonstration for Robot-Assisted Minimally Invasive Surgery

OPENALEX - Publications

Hang Su Andrea Mariani Salih Ertug Ovur Arianna Menciassi Giancarlo Ferrigno and 1 more

Learning manipulation skills from open surgery provides more flexible access to the organ targets in abdomen cavity and this could make surgical robot working a highly intelligent friendly manner. Teaching by demonstration (TbD) is capable of transferring human humanoid robots employing active learning multiple demonstrated tasks. This work aims transfer motion demonstrations manipulators robot-assisted minimally invasive (RA-MIS) using TbD. However, kinematic constraint should be respected...

10.1109/tase.2020.3045655 article EN IEEE Transactions on Automation Science and Engineering 2021-01-07

Deep Neural Network Approach in Robot Tool Dynamics Identification for Bilateral Teleoperation

OPENALEX - Publications

Hang Su Wen Qi Chenguang Yang Juan Sandoval Giancarlo Ferrigno and 1 more

For bilateral teleoperation, the haptic feedback demands availability of accurate force information transmitted from remote site. Nevertheless, due to limitation size, sensor is usually attached outside patient's abdominal cavity for surgical operation. Hence, it measures not only interaction forces on tip but also tool dynamics. In this letter, a model-free based deep convolutional neural network (DCNN) structure proposed dynamics identification, which features fast computation and noise...

10.1109/lra.2020.2974445 article EN IEEE Robotics and Automation Letters 2020-02-18

A Smartphone-Based Adaptive Recognition and Real-Time Monitoring System for Human Activities

OPENALEX - Publications

Wen Qi Hang Su Andréa Aliverti

Human activity recognition (HAR) using smartphones provides significant healthcare guidance for telemedicine and long-term treatment. Machine learning deep (DL) techniques are widely utilized the scientific study of statistical models human behaviors. However, performance existing HAR platforms is limited by complex physical activity. In this article, we proposed an adaptive real-time monitoring system activities (Ada-HAR), which expected to identify more motions in dynamic situations. The...

10.1109/thms.2020.2984181 article EN IEEE Transactions on Human-Machine Systems 2020-04-24

Biometrics recognition using deep learning: a survey

OPENALEX - Publications

Shervin Minaee AmirAli Abdolrashidi Hang Su Mohammed Bennamoun David Zhang

10.1007/s10462-022-10237-x article EN Artificial Intelligence Review 2023-01-13

Recent Advancements in Agriculture Robots: Benefits and Challenges

OPENALEX - Publications

Chao Cheng Jun Fu Hang Su Luquan Ren

In the development of digital agriculture, agricultural robots play a unique role and confer numerous advantages in farming production. From invention first industrial 1950s, have begun to capture attention both research industry. Thanks recent advancements computer science, sensing, control approaches, experienced rapid evolution, relying on various cutting-edge technologies for different application scenarios. Indeed, significant refinements been achieved by integrating perception,...

10.3390/machines11010048 article EN cc-by Machines 2023-01-01

Human-in-the-Loop Control of Soft Exosuits Using Impedance Learning on Different Terrains

OPENALEX - Publications

Zhijun Li Xiang Li Qinjian Li Hang Su Zhen Kan and 1 more

Many previous works of soft wearable exoskeletons (exosuit) target at improving the human locomotion assistance, without considering impedance adaption to interact with unpredictable dynamics and external environment, preferably outside laboratory environments. This article proposes a novel hierarchical human-in-the-loop paradigm that aims produce suitable assistance powers for cable-driven lower limb exosuits aid ankle joint in pushing off ground. It includes two primary loop layers:...

10.1109/tro.2022.3160052 article EN IEEE Transactions on Robotics 2022-04-22

A Cybertwin Based Multimodal Network for ECG Patterns Monitoring Using Deep Learning

OPENALEX - Publications

Wen Qi Hang Su

In next-generation network architecture, the Cybertwin drove sixth generation of cellular networks sixth-generation (6G) to play an active role in many applications, such as healthcare and computer vision. Although previous (5G) provides concept edge cloud core cloud, internal communication mechanism has not been explained with a specific application. This article introduces possible based multimodal (beyond 5G) for electrocardiogram (ECG) patterns monitoring during daily activity. paradigm...

10.1109/tii.2022.3159583 article EN IEEE Transactions on Industrial Informatics 2022-03-16

ProlificDreamer: High-Fidelity and Diverse Text-to-3D Generation with Variational Score Distillation

OPENALEX - Publications

Zhengyi Wang Cheng Lu Yikai Wang Fan Bao Chongxuan Li and 2 more

Score distillation sampling (SDS) has shown great promise in text-to-3D generation by distilling pretrained large-scale text-to-image diffusion models, but suffers from over-saturation, over-smoothing, and low-diversity problems. In this work, we propose to model the 3D parameter as a random variable instead of constant SDS present variational score (VSD), principled particle-based framework explain address aforementioned issues generation. We show that is special case VSD leads poor samples...

10.48550/arxiv.2305.16213 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Fuzzy Approximation-Based Task-Space Control of Robot Manipulators With Remote Center of Motion Constraint

OPENALEX - Publications

Hang Su Wen Qi Jiahao Chen Dandan Zhang

The presence of unknown physical interaction between the patients’ body and surgical tool in laparoscopic surgery requires a secure end-effector positioning while assuring reliable constraint motion. In this work, task-space control approach based on fuzzy approximation is proposed for teleoperated scenario utilizing serial redundant robot manipulator (7 degrees freedom), motions which are constrained with respect to point known as remote center motion (RCM). dynamical uncertainties due...

10.1109/tfuzz.2022.3157075 article EN IEEE Transactions on Fuzzy Systems 2022-03-07

Pneumatic Soft Robots: Challenges and Benefits

OPENALEX - Publications

Hang Su Xu Hou Xin Zhang Wen Qi Shuting Cai and 2 more

In the field of robotics, soft robots have been showing great potential in areas medical care, education, service, rescue, exploration, detection, and wearable devices due to their inherently high flexibility, good compliance, excellent adaptability, natural safe interactivity. Pneumatic occupy an essential position among because features such as lightweight, efficiency, non-pollution, environmental adaptability. Thanks its mentioned benefits, increasing research interests attracted...

10.3390/act11030092 article EN cc-by Actuators 2022-03-16

An adaptive reinforcement learning-based multimodal data fusion framework for human–robot confrontation gaming

OPENALEX - Publications

Wen Qi Haoyu Fan Hamid Reza Karimi Hang Su

Playing games between humans and robots have become a widespread human-robot confrontation (HRC) application. Although many approaches were proposed to enhance the tracking accuracy by combining different information, problems of intelligence degree robot anti-interference ability motion capture system still need be solved. In this paper, we present an adaptive reinforcement learning (RL) based multimodal data fusion (AdaRL-MDF) framework teaching hand play Rock-Paper-Scissors (RPS) game...

10.1016/j.neunet.2023.04.043 article EN cc-by Neural Networks 2023-05-06