NFDI4DS | UHH-SEMS - Publication Details

Chang Liu

ORCID: 0009-0003-1751-6206

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5114889359

Research Areas

Generative Adversarial Networks and Image Synthesis
Advanced Vision and Imaging
Computer Graphics and Visualization Techniques
Metallurgy and Material Forming
Advanced Neural Network Applications
Image and Video Stabilization
Handwritten Text Recognition Techniques
Human Motion and Animation
Nanofabrication and Lithography Techniques
3D Shape Modeling and Analysis
Video Surveillance and Tracking Methods
Model Reduction and Neural Networks
Industrial Vision Systems and Defect Detection
Advanced Image and Video Retrieval Techniques
Gaussian Processes and Bayesian Inference
Video Analysis and Summarization

University of Science and Technology of China
2023-2024

Towards Interactive Image Inpainting via Robust Sketch Refinement

OPENALEX - Publications

Chang Liu Shunxin Xu Jialun Peng Kaidong Zhang Dong Liu

One tough problem of image inpainting is to restore complex structures in the corrupted regions. It motivates interactive which leverages additional hints, e.g., sketches, assist process. A sketch simple and intuitive for end users provide, but meanwhile has free forms with much randomness. Such randomness may confuse models, incur severe artifacts completed images. To better facilitate guidance, we propose a two-stage system, termed SketchRefiner. The first stage our approach serves as data...

10.1109/tmm.2024.3402620 article EN IEEE Transactions on Multimedia 2024-01-01

LSVOS Challenge Report: Large-scale Complex and Long Video Object Segmentation

OPENALEX - Publications

Henghui Ding Lingyi Hong Chang Liu Ning Xu Linjie Yang and 28 more

Despite the promising performance of current video segmentation models on existing benchmarks, these still struggle with complex scenes. In this paper, we introduce 6th Large-scale Video Object Segmentation (LSVOS) challenge in conjunction ECCV 2024 workshop. This year's includes two tasks: (VOS) and Referring (RVOS). year, replace classic YouTube-VOS YouTube-RVOS benchmark latest datasets MOSE, LVOS, MeViS to assess VOS under more challenging environments. attracted 129 registered teams...

10.48550/arxiv.2409.05847 preprint EN arXiv (Cornell University) 2024-09-09

StableV2V: Stablizing Shape Consistency in Video-to-Video Editing

OPENALEX - Publications

Chang Liu Rui Li Kaidong Zhang Yu‐Ju Lan Dong Liu

Recent advancements of generative AI have significantly promoted content creation and editing, where prevailing studies further extend this exciting progress to video editing. In doing so, these mainly transfer the inherent motion patterns from source videos edited ones, results with inferior consistency user prompts are often observed, due lack particular alignments between delivered motions contents. To address limitation, we present a shape-consistent editing method, namely StableV2V, in...

10.48550/arxiv.2411.11045 preprint EN arXiv (Cornell University) 2024-11-17

NeRF Inpainting with Geometric Diffusion Prior and Balanced Score Distillation

OPENALEX - Publications

M Zhang Xin Luo Yu‐Ju Lan Chang Liu Rui Li and 3 more

Recent advances in NeRF inpainting have leveraged pretrained diffusion models to enhance performance. However, these methods often yield suboptimal results due their ineffective utilization of 2D priors. The limitations manifest two critical aspects: the inadequate capture geometric information by and guidance provided existing Score Distillation Sampling (SDS) methods. To address problems, we introduce GB-NeRF, a novel framework that enhances through improved Our approach incorporates key...

10.48550/arxiv.2411.15551 preprint EN arXiv (Cornell University) 2024-11-23

UnZipLoRA: Separating Content and Style from a Single Image

OPENALEX - Publications

Chang Liu Viraj Shah Aiyu Cui Svetlana Lazebnik

This paper introduces UnZipLoRA, a method for decomposing an image into its constituent subject and style, represented as two distinct LoRAs (Low-Rank Adaptations). Unlike existing personalization techniques that focus on either or style in isolation, require separate training sets each, UnZipLoRA disentangles these elements from single by both the simultaneously. ensures resulting are compatible, i.e., they can be seamlessly combined using direct addition. enables independent manipulation...

10.48550/arxiv.2412.04465 preprint EN arXiv (Cornell University) 2024-12-05

AI-Empowered Persuasive Video Generation: A Survey

OPENALEX - Publications

Chang Liu Han Yu

Promotional videos are rapidly becoming a popular medium for persuading people to change their behaviours in many settings (e.g., online shopping, social enterprise initiatives). Today, such often produced by professionals, which is time-, labour- and cost-intensive undertaking. In order produce contents support large applications e-commerce), the field of artificial intelligence (AI)-empowered persuasive video generation (AIPVG) has gained traction recent years. This interdisciplinary...

10.48550/arxiv.2112.09401 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Towards Interactive Image Inpainting via Sketch Refinement

OPENALEX - Publications

Chang Liu Shunxin Xu Jialun Peng Kaidong Zhang Dong Liu

One tough problem of image inpainting is to restore complex structures in the corrupted regions. It motivates interactive which leverages additional hints, e.g., sketches, assist process. Sketch simple and intuitive end users, but meanwhile has free forms with much randomness. Such randomness may confuse models, incur severe artifacts completed images. To address this problem, we propose a two-stage method termed SketchRefiner. In first stage, using cross-correlation loss function robustly...

10.48550/arxiv.2306.00407 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Distilling Representations from GAN Generator via Squeeze and Span

OPENALEX - Publications

Yu Yang Xiaotian Cheng Chang Liu Hakan Bilen Xiangyang Ji

In recent years, generative adversarial networks (GANs) have been an actively studied topic and shown to successfully produce high-quality realistic images in various domains. The controllable synthesis ability of GAN generators suggests that they maintain informative, disentangled, explainable image representations, but leveraging transferring their representations downstream tasks is largely unexplored. this paper, we propose distill knowledge from by squeezing spanning representations. We...

10.48550/arxiv.2211.03000 preprint EN cc-by-nc-nd arXiv (Cornell University) 2022-01-01

Coming Soon ...