NFDI4DS | UHH-SEMS - Publication Details

GAMMA challenge: Glaucoma grAding from Multi-Modality imAges

OPENALEX - Publications

Junde Wu Huihui Fang Fei Li Huazhu Fu Fengbin Lin and 24 more

10.1016/j.media.2023.102938 article EN Medical Image Analysis 2023-09-18

Intention Understanding in Human–Robot Interaction Based on Visual-NLP Semantics

OPENALEX - Publications

Zhihao Li Yishan Mu Zhenglong Sun Sifan Song Jionglong Su and 1 more

With the rapid development of robotic and AI technology in recent years, human–robot interaction has made great advancement, making practical social impact. Verbal commands are one most direct frequently used means for interaction. Currently, such can enable robots to execute pre-defined tasks based on simple explicit language instructions, e.g., certain keywords must be detected. However, that is not natural way human communicate. In this paper, we propose a novel task-based framework robot...

10.3389/fnbot.2020.610139 article EN cc-by Frontiers in Neurorobotics 2021-02-02

FECTS: A Facial Emotion Cognition and Training System for Chinese Children with Autism Spectrum Disorder

OPENALEX - Publications

Guobin Wan Fuhao Deng Zijian Jiang Sifan Song Di Hu and 7 more

Traditional training methods such as card teaching, assistive technologies (e.g., augmented reality/virtual reality games and smartphone apps), DVDs, human-computer interactions, human-robot interactions are widely applied in autistic rehabilitation recent years. In this article, we propose a novel framework for human-computer/robot interaction introduce preliminary intervention study improving the emotion recognition of Chinese children with an autism spectrum disorder. The core is Facial...

10.1155/2022/9213526 article EN cc-by Computational Intelligence and Neuroscience 2022-04-27

Implicit Image-to-Image Schrödinger Bridge for image restoration

OPENALEX - Publications

Yuang Wang Siyeop Yoon Pengrong Jin Matthew Tivnan Sifan Song and 6 more

10.1016/j.patcog.2025.111627 article EN Pattern Recognition 2025-04-01

Distortion-Disentangled Contrastive Learning

OPENALEX - Publications

Jinfeng Wang Sifan Song Jionglong Su S. Kevin Zhou

Self-supervised learning is well known for its remarkable performance in representation and various downstream computer vision tasks. Recently, Positive-pair-Only Contrastive Learning (POCL) has achieved reliable without the need to construct positive-negative training sets. It reduces memory requirements by lessening dependency on batch size. The POCL method typically uses a single objective function extract distortion invariant (DIR) which describes proximity of positive-pair...

10.1109/wacv57701.2024.00015 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024-01-03

Chromosome Classification with Convolutional Neural Network Based Deep Learning

OPENALEX - Publications

Wenbo Zhang Sifan Song Tianming Bai Yanxin Zhao Fei Ma and 2 more

Karyotyping plays a crucial role in genetic disorder diagnosis. Currently requires considerable manual efforts, domain expertise and experience, is very time consuming. Automating the karyotyping process has been an important popular task. This study focuses on classification of chromosomes into 23 types, step towards fully automatic karyotyping. proposes convolutional neural network (CNN) based deep learning to automatically classify chromosomes. The proposed method was trained tested...

10.1109/cisp-bmei.2018.8633228 article EN 2021 14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) 2018-10-01

A Framework of Hierarchical Deep Q-Network for Portfolio Management

OPENALEX - Publications

Yuan Gao Ziming Gao Yi Hu Sifan Song Zhengyong Jiang and 1 more

10.5220/0010233201320140 article EN cc-by-nc-nd Proceedings of the 14th International Conference on Agents and Artificial Intelligence 2021-01-01

A New Convolutional Neural Network Architecture for Automatic Segmentation of Overlapping Human Chromosomes

OPENALEX - Publications

Sifan Song Tianming Bai Yanxin Zhao Wenbo Zhang Chunxiao Yang and 3 more

10.1007/s11063-021-10629-0 article EN Neural Processing Letters 2021-09-06

From deterministic to stochastic: an interpretable stochastic model-free reinforcement learning framework for portfolio optimization

OPENALEX - Publications

Zitao Song Yining Wang Pin Qian Sifan Song Frans Coenen and 2 more

10.1007/s10489-022-04217-5 article EN Applied Intelligence 2022-11-11

ProMISe: Promptable Medical Image Segmentation using SAM

OPENALEX - Publications

Jinfeng Wang Sifan Song Xinkun Wang Yiyi Wang Yiyi Miao and 2 more

With the proposal of Segment Anything Model (SAM), fine-tuning SAM for medical image segmentation (MIS) has become popular. However, due to large size model and significant domain gap between natural images, fine-tuning-based strategies are costly with potential risk instability, feature damage catastrophic forgetting. Furthermore, some methods transferring a domain-specific MIS through disable model's prompting capability, severely limiting its utilization scenarios. In this paper, we...

10.48550/arxiv.2403.04164 preprint EN arXiv (Cornell University) 2024-03-06

DualStreamFoveaNet: A Dual Stream Fusion Architecture With Anatomical Awareness for Robust Fovea Localization

OPENALEX - Publications

Sifan Song Jinfeng Wang Zilong Wang Hongxing Wang Jionglong Su and 2 more

Accurate fovea localization is essential for analyzing retinal diseases to prevent irreversible vision loss. While current deep learning-based methods outperform traditional ones, they still face challenges such as the lack of local anatomical landmarks around fovea, inability robustly handle diseased images, and variations in image conditions. In this paper, we propose a novel transformer-based architecture called DualStreamFoveaNet (DSFN) multi-cue fusion. This explicitly incorporates...

10.1109/jbhi.2024.3445112 article EN IEEE Journal of Biomedical and Health Informatics 2024-08-16

Retrieval Instead of Fine-tuning: A Retrieval-based Parameter Ensemble for Zero-shot Learning

OPENALEX - Publications

Pengfei Jin Peng Shu Sekeun Kim Qing Xiao Sifan Song and 4 more

Foundation models have become a cornerstone in deep learning, with techniques like Low-Rank Adaptation (LoRA) offering efficient fine-tuning of large models. Similarly, methods such as Retrieval-Augmented Generation (RAG), which leverage vectorized databases, further improved model performance by grounding outputs external information. While these approaches demonstrated notable success, they often require extensive training or labeled data, can limit their adaptability resource-constrained...

10.48550/arxiv.2410.09908 preprint EN arXiv (Cornell University) 2024-10-13

RC-Net: Regression Correction for End-To-End Chromosome Instance Segmentation

OPENALEX - Publications

Hui Liu Guangjie Wang Sifan Song Daiyun Huang Lin Zhang

Precise segmentation of chromosome in the real image achieved by a microscope is significant for karyotype analysis. The usually pixel-level classification task, which considers different instances as classes. Many instance methods predict Intersection over Union (IoU) through head branch to correct confidence. Their effectiveness based on correlation between tasks. However, none these consider input and output Herein, we propose network regression correction. First, adopt two branches...

10.3389/fgene.2022.895099 article EN cc-by Frontiers in Genetics 2022-05-18

A Robust Framework of Chromosome Straightening With Vit-Patch Gan

OPENALEX - Publications

Sifan Song Jinfeng Wang Fengrui Cheng Qirui Cao Yi-Han Zuo and 7 more

Chromosomes carry the genetic information of humans. They exhibit non-rigid and non-articulated nature with varying degrees curvature. Chromosome straightening is an important step for subsequent karyotype construction, pathological diagnosis cytogenetic map development. However, robust chromosome remains challenging, due to unavailability training images, distorted details shapes after straightening, as well poor generalization capability. In this paper, we propose a novel architecture,...

10.1109/isbi53787.2023.10230388 article EN 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI) 2023-04-18

Image Captioning in Chinese and Its Application for Children with Autism Spectrum Disorder

OPENALEX - Publications

Bin Zhang Lixin Zhou Sifan Song Chen Li-fu Zijian Jiang and 1 more

This research looks into the applications of image captioning in Chinese. In order to improve abilities children with Autism Spectrum Disorder (ASD) spontaneous language and turn-taking, during interacting them, rehabilitation robots are used track their attention describe attractive objects scenes. method has three advantages. First, may attract ASD by describing that interest them. Second, ability which is beneficial enhancing superior expressions social communication or human beings....

10.1145/3383972.3384072 article EN 2020-02-15

Bilateral-ViT For Robust Fovea Localization

OPENALEX - Publications

Sifan Song Kang Dang Qinji Yu Zilong Wang Frans Coenen and 2 more

The fovea is an important anatomical landmark of the retina. Detecting location essential for analysis many retinal diseases. However, robust localization remains a challenging problem, as region often appears fuzzy, and retina diseases may further obscure its appearance. This paper proposes novel Vision Transformer (ViT) approach that integrates information both inside outside to achieve localization. Our proposed network, named Bilateral-Vision-Transformer (Bilateral-ViT), consists two...

10.1109/isbi52829.2022.9761523 article EN 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI) 2022-03-28

A Novel Application of Image-to-Image Translation: Chromosome Straightening Framework by Learning from a Single Image

OPENALEX - Publications

Sifan Song Daiyun Huang Yalun Hu Chunxiao Yang Jia Meng and 4 more

In medical imaging, chromosome straightening plays a significant role in the pathological study of chromosomes and development cytogenetic maps. Whereas different approaches exist for task, typically geometric algorithms are used whose outputs characterized by jagged edges or fragments with discontinued banding patterns. To address flaws algorithms, we propose novel framework based on image-to-image translation to learn pertinent mapping dependence synthesizing straightened uninterrupted...

10.1109/cisp-bmei53629.2021.9624383 article EN 2021 14th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) 2021-10-23

An Event-Triggered Low-Cost Tactile Perception System for Social Robot’s Whole Body Interaction

OPENALEX - Publications

Shengzhao Lin Jionglong Su Sifan Song Jiaming Zhang

The social interaction is one of the necessary skills for robots to better integrate into human society. However, current interact mainly through audio and visual means with little reliance on haptic interaction. There still exist many obstacles touch: 1) complex manufacturing process tactile sensor array main obstacle lowering cost production; 2) mode diverse. are no robot standards data sets interactive behavior in public domain. In view this, our research looks following aspects...

10.1109/access.2021.3053117 article EN cc-by IEEE Access 2021-01-01

Contrastive Learning Via Equivariant Representation

OPENALEX - Publications

Sifan Song Jinfeng Wang Qiaochu Zhao Xiang Li Dufan Wu and 4 more

Invariant-based Contrastive Learning (ICL) methods have achieved impressive performance across various domains. However, the absence of latent space representation for distortion (augmentation)-related information in makes ICL sub-optimal regarding training efficiency and robustness downstream tasks. Recent studies suggest that introducing equivariance into (CL) can improve overall performance. In this paper, we rethink roles augmentation strategies improving CL efficacy. We propose a novel...

10.48550/arxiv.2406.00262 preprint EN arXiv (Cornell University) 2024-05-31

Implicit Image-to-Image Schrödinger Bridge for Image Restoration

OPENALEX - Publications

Yuang Wang Siyeop Yoon Pengfei Jin Matthew Tivnan Sifan Song and 6 more

10.2139/ssrn.5008071 preprint EN 2024-01-01

Learning Bionic Motions by Imitating Animals

OPENALEX - Publications

Da Zhao Sifan Song Jionglong Su Zijian Jiang Jiaming Zhang

Motion control algorithms for quadruped robots undergo rapid development in recent years. Interactive have demonstrated they may positively enhance the effect of psychotherapy treatment patients with cognitive impairment, which requires them to more interactive capabilities than traditional robots. In this study, we focus on enabling imitate real animal motions extracted from videos, by design robotic motion controllers can be simplified and bionic degree enhanced. The capture data, however,...

10.1109/icma49215.2020.9233839 article EN 2022 IEEE International Conference on Mechatronics and Automation (ICMA) 2020-10-13

Say What You Are Looking At: An Attention-Based Interactive System for Autistic Children

OPENALEX - Publications

Furong Deng Yu Zhou Sifan Song Zijian Jiang Chen Li-fu and 3 more

Gaze-following is an effective way for intention understanding in human–robot interaction, which aims to follow the gaze of humans estimate what object being observed. Most existing methods require people and objects appear same image. Due limitation view camera, these are not applicable practice. To address this problem, we propose a method following that utilizes geometric map better estimation. With help map, competitive cross-frame On basis method, novel gaze-based image caption system,...

10.3390/app11167426 article EN cc-by Applied Sciences 2021-08-12

Bilateral-ViT for Robust Fovea Localization

OPENALEX - Publications

Sifan Song Kang Dang Qinji Yu Zilong Wang Frans Coenen and 2 more

The fovea is an important anatomical landmark of the retina. Detecting location essential for analysis many retinal diseases. However, robust localization remains a challenging problem, as region often appears fuzzy, and retina diseases may further obscure its appearance. This paper proposes novel Vision Transformer (ViT) approach that integrates information both inside outside to achieve localization. Our proposed network, named Bilateral-Vision-Transformer (Bilateral-ViT), consists two...

10.48550/arxiv.2110.09860 preprint EN other-oa arXiv (Cornell University) 2021-01-01

A Robust Framework of Chromosome Straightening with ViT-Patch GAN

OPENALEX - Publications

Sifan Song Jinfeng Wang Fengrui Cheng Qirui Cao Yi-Han Zuo and 7 more

Chromosomes carry the genetic information of humans. They exhibit non-rigid and non-articulated nature with varying degrees curvature. Chromosome straightening is an important step for subsequent karyotype construction, pathological diagnosis cytogenetic map development. However, robust chromosome remains challenging, due to unavailability training images, distorted details shapes after straightening, as well poor generalization capability. In this paper, we propose a novel architecture,...

10.48550/arxiv.2203.02901 preprint EN other-oa arXiv (Cornell University) 2022-01-01