NFDI4DS | UHH-SEMS - Publication Details

Yitong Li

ORCID: 0009-0009-3874-6055

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5100413888

Research Areas

Multimodal Machine Learning Applications
Video Analysis and Summarization
Robotics and Sensor-Based Localization
Human Pose and Action Recognition
Advanced Image and Video Retrieval Techniques
Remote-Sensing Image Classification
Generative Adversarial Networks and Image Synthesis
Advanced Vision and Imaging
Software Engineering Research
Medical Imaging Techniques and Applications
Brain Tumor Detection and Classification
Bone and Joint Diseases
Topic Modeling
Advanced MRI Techniques and Applications
Optical measurement and interference techniques
Radiomics and Machine Learning in Medical Imaging
Infrastructure Maintenance and Monitoring
Software Testing and Debugging Techniques
Machine Learning in Healthcare
BIM and Construction Integration
Spine and Intervertebral Disc Pathology
Business Process Modeling and Analysis
Data Quality and Management
MRI in cancer diagnosis
Human Motion and Animation

Huazhong University of Science and Technology
2019-2025

Tongji Hospital
2019-2025

Chinese Academy of Medical Sciences & Peking Union Medical College
2025

State Key Laboratory of Information Engineering in Surveying Mapping and Remote Sensing
2025

Wuhan University
2025

Technical University of Munich
2022-2025

Binzhou University
2025

Binzhou Medical University
2025

Beijing University of Technology
2024

Dalian Maritime University
2024

StoryGAN: A Sequential Conditional GAN for Story Visualization

OPENALEX - Publications

Yitong Li Zhe Gan Yelong Shen Jingjing Liu Yu Cheng and 4 more

In this work, we propose a new task called Story Visualization. Given multi-sentence paragraph, the story is visualized by generating sequence of images, one for each sentence. contrast to video generation, visualization focuses less on continuity in generated images (frames), but more global consistency across dynamic scenes and characters -- challenge that has not been addressed any single-image or generation methods. Therefore, story-to-image-sequence model, StoryGAN, based sequential...

10.1109/cvpr.2019.00649 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

EventNet

OPENALEX - Publications

Guangnan Ye Yitong Li Hongliang Xu Dong Liu Shih‐Fu Chang

Event-specific concepts are the semantic specifically designed for events of interest, which can be used as a mid-level representation complex in videos. Existing methods only focus on defining event-specific small number pre-defined events, but cannot handle novel unseen events. This motivates us to build large scale concept library that covers many real-world and their possible. Specifically, we choose WikiHow, an online forum containing how-to articles human daily life We perform...

10.1145/2733373.2806221 article EN 2015-10-13

Video Generation From Text

OPENALEX - Publications

Yitong Li Martin Renqiang Min Dinghan Shen David Carlson Lawrence Carin

Generating videos from text has proven to be a significant challenge for existing generative models. We tackle this problem by training conditional model extract both static and dynamic information text. This is manifested in hybrid framework, employing Variational Autoencoder (VAE) Generative Adversarial Network (GAN). The features, called "gist," are used sketch text-conditioned background color object layout structure. Dynamic features considered transforming input into an image filter....

10.1609/aaai.v32i1.12233 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2018-04-27

DocTer: documentation-guided fuzzing for testing deep learning API functions

OPENALEX - Publications

Danning Xie Yitong Li Mijung Kim Hung Viet Pham Lin Tan and 2 more

Input constraints are useful for many software development tasks. For example, input of a function enable the generation valid inputs, i.e., inputs that follow these constraints, to test deeper. API functions deep learning (DL) libraries have DL specific which described informally in free form documentation. Existing constraint extraction techniques ineffective extracting constraints. To fill this gap, we design and implement new technique, DocTer, analyze documentation extract functions....

10.1145/3533767.3534220 preprint EN 2022-07-15

DiaMond: Dementia Diagnosis with Multi-Modal Vision Transformers Using MRI and PET

OPENALEX - Publications

Yitong Li Morteza Ghahremani Youssef Wally Christian Wachinger

10.1109/wacv61041.2025.00021 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025-02-26

Sequential Attention GAN for Interactive Image Editing

OPENALEX - Publications

Yu Cheng Zhe Gan Yitong Li Jingjing Liu Jianfeng Gao

Most existing text-to-image synthesis tasks are static single-turn generation, based on pre-defined textual descriptions of images. To explore more practical and interactive real-life applications, we introduce a new task - Interactive Image Editing, where users can guide an agent to edit images via multi-turn commands on-the-fly. In each session, the takes natural language description from user as input, modifies image generated in previous turn design, following description. The main...

10.1145/3394171.3413551 article EN Proceedings of the 30th ACM International Conference on Multimedia 2020-10-12

PhoCaL: A Multi-Modal Dataset for Category-Level Object Pose Estimation with Photometrically Challenging Objects

OPENALEX - Publications

Pengyuan Wang Hyunjun Jung Yitong Li Siyuan Shen Rahul Parthasarathy Srikanth and 4 more

Object pose estimation is crucial for robotic applications and augmented reality. Beyond instance level 6D object methods, estimating category-level shape has become a promising trend. As such, new research field needs to be supported by well-designed datasets. To provide benchmark with high-quality ground truth annotations the community, we introduce multimodal dataset photometrically challenging objects termed PhoCaL. PhoCaL comprises 60 high quality 3D models of household over 8...

10.1109/cvpr52688.2022.02054 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

A Machine learning approach for Post-Disaster data curation

OPENALEX - Publications

Sun Ho Ro Yitong Li Jie Gong

10.1016/j.aei.2024.102427 article EN publisher-specific-oa Advanced Engineering Informatics 2024-02-27

Personalized auto-segmentation for magnetic resonance imaging-guided adaptive radiotherapy of large brain metastases

OPENALEX - Publications

Yuchao Ma Xiangyu Ma Canjun Li Ying Jiang Zhihui Zhang and 10 more

10.1016/j.radonc.2025.110773 article EN Radiotherapy and Oncology 2025-02-01

3D Shape-to-Image Brownian Bridge Diffusion for Brain MRI Synthesis from Cortical Surfaces

OPENALEX - Publications

Fabian Bongratz Yitong Li Sama Elbaroudy Christian Wachinger

Despite recent advances in medical image generation, existing methods struggle to produce anatomically plausible 3D structures. In synthetic brain magnetic resonance images (MRIs), characteristic fissures are often missing, and reconstructed cortical surfaces appear scattered rather than densely convoluted. To address this issue, we introduce Cor2Vox, the first diffusion model-based method that translates continuous shape priors MRIs. achieve this, leverage a Brownian bridge process which...

10.48550/arxiv.2502.12742 preprint EN arXiv (Cornell University) 2025-02-18

Use of double-echo ultrashort echo time magnetic resonance imaging to assess proximal femoral cortical bone changes in axial spondyloarthritis

OPENALEX - Publications

Yitong Li Bowen Hou Yao Zhang Junqing Wang Yu Chu and 2 more

10.1016/j.bone.2025.117430 article EN Bone 2025-02-25

Regional Biomechanical-Radiological Disparities of Bilateral Hips in Subjects with Unilateral Hip Osteoarthritis: A Retrospective Study

OPENALEX - Publications

Bowen Hou Yi Wang Jing Zhang Yitong Li Yao Zhang and 2 more

10.2139/ssrn.5174283 preprint EN 2025-01-01

Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images

OPENALEX - Publications

Xiaofei Yu Yitong Li Jie Ma Chang Li Hanlin Wu

10.1109/tgrs.2025.3554360 article EN IEEE Transactions on Geoscience and Remote Sensing 2025-01-01

Paris saponin VII restrains PD-L1 mediated immune evasion through the AKT1 and STAT3 signaling pathways

OPENALEX - Publications

Yudi Wang Yanli Li Yurui Zhang Yitong Li Leilei Zhao and 4 more

10.1016/j.cbi.2025.111562 article EN Chemico-Biological Interactions 2025-05-01

On the Importance of Accurate Geometry Data for Dense 3D Vision Tasks

OPENALEX - Publications

Hyunjun Jung Patrick Ruhkamp Guangyao Zhai Nikolas Brasch Yitong Li and 8 more

Learning-based methods to solve dense 3D vision problems typically train on sensor data. The respectively used principle of measuring distances provides advantages and drawbacks. These are not compared nor discussed in the literature due a lack multi-modal datasets. Texture-less regions problematic for structure from motion stereo, reflective material poses issues active sensing, translucent objects intricate measure with existing hardware. Training inaccurate or corrupt data induces model...

10.1109/cvpr52729.2023.00082 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Multi-echo in steady-state acquisition improves MRI image quality and lumbosacral radiculopathy diagnosis efficacy compared with T2 fast spin-echo sequence

OPENALEX - Publications

Shuang Hu Yitong Li Bowen Hou Yao Zhang Weiyin Vivian Liu and 2 more

10.1007/s00234-023-03130-z article EN Neuroradiology 2023-03-02

A microstrip patch antenna for 5G mobile communications

OPENALEX - Publications

Yitong Li

Abstract The advantages of microstrip patch antennas include small size, adaptable surface, ease fabrication, and compatibility with integrated circuit technology. Numerous experiments have been done over the past few decades to enhance performance this antenna, both military commercial sectors found many uses for it. This paper introduces a antenna an operating frequency 28GHz 5G mobile communication. research designed simulated rectangular 3.494 mm * 5.3 0.003 mm. proposed resonates at 28...

10.1088/1742-6596/2580/1/012063 article EN Journal of Physics Conference Series 2023-09-01

Infrared scene-based non-uniformity correction based on deep learning model

OPENALEX - Publications

Yitong Li Ning Liu Ji Xu

10.1016/j.ijleo.2020.165899 article EN Optik 2020-10-29

EventNet: A Large Scale Structured Concept Library for Complex Event Detection in Video

OPENALEX - Publications

Guangnan Ye Yitong Li Hongliang Xu Dong Liu Shih‐Fu Chang

Event-specific concepts are the semantic designed for events of interest, which can be used as a mid-level representation complex in videos. Existing methods only focus on defining event-specific small number predefined events, but cannot handle novel unseen events. This motivates us to build large scale concept library that covers many real-world and their possible. Specifically, we choose WikiHow, an online forum containing how-to articles human daily life We perform coarse-to-fine event...

10.48550/arxiv.1506.02328 preprint EN other-oa arXiv (Cornell University) 2015-01-01

Coming Soon ...