NFDI4DS | UHH-SEMS - Publication Details

NTIRE 2018 Challenge on Single Image Super-Resolution: Methods and Results

OPENALEX - Publications

Radu Timofte Shuhang Gu Jiqing Wu Luc Van Gool Lei Zhang and 95 more

This paper reviews the 2nd NTIRE challenge on single image super-resolution (restoration of rich details in a low resolution image) with focus proposed solutions and results. The had 4 tracks. Track 1 employed standard bicubic downscaling setup, while Tracks 2, 3 realistic unknown downgrading operators simulating camera acquisition pipeline. were learnable through provided pairs high train images. tracks 145, 114, 101, 113 registered participants, resp., 31 teams competed final testing...

10.1109/cvprw.2018.00130 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2018-06-01

Normalized Cut Loss for Weakly-Supervised CNN Segmentation

OPENALEX - Publications

Meng Tang Abdelaziz Djelouah Federico Perazzi Yuri Boykov Christopher Schroers

Most recent semantic segmentation methods train deep convolutional neural networks with fully annotated masks requiring pixel-accuracy for good quality training. Common weakly-supervised approaches generate full from partial input (e.g. scribbles or seeds) using standard interactive as preprocessing. But, errors in such result poorer training since loss functions cross-entropy) do not distinguish seeds potentially mislabeled other pixels. Inspired by the general ideas semi-supervised...

10.1109/cvpr.2018.00195 preprint EN 2018-06-01

A Fully Progressive Approach to Single-Image Super-Resolution

OPENALEX - Publications

Yifan Wang Federico Perazzi Brian McWilliams Alexander Sorkine‐Hornung Olga Sorkine‐Hornung and 1 more

Recent deep learning approaches to single image superresolution have achieved impressive results in terms of traditional error measures and perceptual quality. However, each case it remains challenging achieve high quality for large upsampling factors. To this end, we propose a method (ProSR) that is progressive both architecture training: the network upsamples an intermediate steps, while process organized from easy hard, as done curriculum learning. obtain more photorealistic results,...

10.1109/cvprw.2018.00131 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2018-06-01

PhaseNet for Video Frame Interpolation

OPENALEX - Publications

Simone Meyer Abdelaziz Djelouah Brian McWilliams Alexander Sorkine‐Hornung Markus Groß and 1 more

Most approaches for video frame interpolation require accurate dense correspondences to synthesize an in-between frame. Therefore, they do not perform well in challenging scenarios with e.g. lighting changes or motion blur. Recent deep learning that rely on kernels represent can only alleviate these problems some extent. In those cases, methods use a per-pixel phase-based representation have been shown work well. However, are applicable limited amount of motion. We propose new approach,...

10.1109/cvpr.2018.00059 article EN 2018-06-01

Neural Inter-Frame Compression for Video Coding

OPENALEX - Publications

Abdelaziz Djelouah Joaquim Campos Simone Schaub-Meyer Christopher Schroers

While there are many deep learning based approaches for single image compression, the field of end-to-end learned video coding has remained much less explored. Therefore, in this work we present an inter-frame compression approach neural that can seamlessly build up on different existing codecs. Our solution performs temporal prediction by optical flow motion compensation pixel space. The key insight is increase both decoding efficiency and reconstruction quality encoding required...

10.1109/iccv.2019.00652 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Blind image super-resolution with spatially variant degradations

OPENALEX - Publications

Victor Cornillère Abdelaziz Djelouah Yifan Wang Olga Sorkine‐Hornung Christopher Schroers

Existing deep learning approaches to single image super-resolution have achieved impressive results but mostly assume a setting with fixed pairs of high resolution and low images. However, robustly address realistic upscaling scenarios where the relation between images is unknown, blind required. To this end, we propose solution that relies on three components: First, use degradation aware SR network synthesize HR given corresponding blur kernel. Second, train kernel discriminator analyze...

10.1145/3355089.3356575 article EN ACM Transactions on Graphics 2019-11-08

High‐Resolution Neural Face Swapping for Visual Effects

OPENALEX - Publications

Jacek Naruniec Leonhard Helminger Christopher Schroers Romann M. Weber

Abstract In this paper, we propose an algorithm for fully automatic neural face swapping in images and videos. To the best of our knowledge, is first method capable rendering photo‐realistic temporally coherent results at megapixel resolution. end, introduce a progressively trained multi‐way comb network light‐ contrast‐preserving blending method. We also show that while progressive training enables generation high‐resolution images, extending architecture data beyond two people allows us to...

10.1111/cgf.14062 article EN Computer Graphics Forum 2020-07-01

Point Cloud Noise and Outlier Removal for Image-Based 3D Reconstruction

OPENALEX - Publications

Katja Wolff Chang-Il Kim Henning Zimmer Christopher Schroers Mario Botsch and 2 more

Point sets generated by image-based 3D reconstruction techniques are often much noisier than those obtained using active like laser scanning. Therefore, they pose greater challenges to the subsequent surface (meshing) stage. We present a simple and effective method for removing noise outliers from such point sets. Our algorithm uses input images corresponding depth maps remove pixels which geometrically or photometrically inconsistent with colored implied input. This allows standard methods...

10.1109/3dv.2016.20 article EN 2016-10-01

Cyclic Schemes for PDE-Based Image Analysis

OPENALEX - Publications

Joachim Weickert Sven Grewenig Christopher Schroers Andrés Bruhn

10.1007/s11263-015-0874-1 article EN International Journal of Computer Vision 2015-12-19

VideoSnapping

OPENALEX - Publications

Oliver Wang Christopher Schroers Henning Zimmer Markus Groß Alexander Sorkine‐Hornung

Aligning video is a fundamental task in computer graphics and vision, required for wide range of applications. We present an interactive method computing optimal nonlinear temporal alignments arbitrary number videos. first derive robust approximation alignment quality between pairs clips, computed as weighted histogram feature matches. then find mappings (constituting frame correspondences) using graph-based approach that allows very efficient evaluation with artist constraints. This enables...

10.1145/2601097.2601208 article EN ACM Transactions on Graphics 2014-07-22

Normalized Cut Loss for Weakly-supervised CNN Segmentation

OPENALEX - Publications

Meng Tang Abdelaziz Djelouah Federico Perazzi Yuri Boykov Christopher Schroers

Most recent semantic segmentation methods train deep convolutional neural networks with fully annotated masks requiring pixel-accuracy for good quality training. Common weakly-supervised approaches generate full from partial input (e.g. scribbles or seeds) using standard interactive as preprocessing. But, errors in such result poorer training since loss functions cross-entropy) do not distinguish seeds potentially mislabeled other pixels. Inspired by the general ideas semi-supervised...

10.48550/arxiv.1804.01346 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Controllable Inversion of Black-Box Face Recognition Models via Diffusion

OPENALEX - Publications

Manuel Kansy Anton Raël Graziana Mignone Jacek Naruniec Christopher Schroers and 2 more

Face recognition models embed a face image into low-dimensional identity vector containing abstract encodings of identity-specific facial features that allow individuals to be distinguished from one another. We tackle the challenging task inverting latent space pre-trained without full model access (i.e. black-box setting). A variety methods have been proposed in literature for this task, but they serious shortcomings such as lack realistic outputs and strong requirements data set...

10.1109/iccvw60793.2023.00341 article EN 2023-10-02

High-Fidelity Novel View Synthesis via Splatting-Guided Diffusion

OPENALEX - Publications

Xiang Zhang Yang Zhang Lukas Mehl Markus Groß Christopher Schroers

Despite recent advances in Novel View Synthesis (NVS), generating high-fidelity views from single or sparse observations remains a significant challenge. Existing splatting-based approaches often produce distorted geometry due to splatting errors. While diffusion-based methods leverage rich 3D priors achieve improved geometry, they suffer texture hallucination. In this paper, we introduce SplatDiff, pixel-splatting-guided video diffusion model designed synthesize novel image. Specifically,...

10.48550/arxiv.2502.12752 preprint EN arXiv (Cornell University) 2025-02-18

CLIP-Fusion: A Spatio-Temporal Quality Metric for Frame Interpolation

OPENALEX - Publications

Göksel Mert Çökmez Yang Zhang Christopher Schroers Tunç Ozan Aydın

10.1109/wacv61041.2025.00724 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2025-02-26

Bridging the Gap between Gaussian Diffusion Models and Universal Quantization for Image Compression

OPENALEX - Publications

Lucas Relic Roberto Gerson de Albuquerque Azevedo Yang Zhang Markus Groß Christopher Schroers

Generative neural image compression supports data representation at extremely low bitrate, synthesizing details the client and consistently producing highly realistic images. By leveraging similarities between quantization error additive noise, diffusion-based generative codecs can be built using a latent diffusion model to "denoise" artifacts introduced by quantization. However, we identify three critical gaps in previous approaches following this paradigm (namely, noise level, type,...

10.48550/arxiv.2504.02579 preprint EN arXiv (Cornell University) 2025-04-03

An Omnistereoscopic Video Pipeline for Capture and Display of Real-World VR

OPENALEX - Publications

Christopher Schroers Jean‐Charles Bazin Alexander Sorkine‐Hornung

In this article, we describe a complete pipeline for the capture and display of real-world Virtual Reality video content, based on concept omnistereoscopic panoramas. We address important practical theoretical issues that have remained undiscussed in previous works. On side, show how high-quality omnistereo can be generated from sparse set cameras (16 our prototype array) instead hundreds input views previously required. Despite number views, approach allows high quality, real-time virtual...

10.1145/3225150 article EN ACM Transactions on Graphics 2018-06-30

Video Compression with Entropy-Constrained Neural Representations

OPENALEX - Publications

Carlos Francısco Sımões Gomes Roberto Azevêdo Christopher Schroers

Encoding videos as neural networks is a recently proposed approach that allows new forms of video processing. However, traditional techniques still outperform such representation (NVR) methods for the task compression. This performance gap can be explained by fact current NVR methods: i) use architectures do not efficiently obtain compact temporal and spatial information; ii) minimize rate distortion disjointly (first overfitting network on then using heuristic post-training quantization or...

10.1109/cvpr52729.2023.01774 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01

Lossy Image Compression with Normalizing Flows

OPENALEX - Publications

Leonhard Helminger Abdelaziz Djelouah Markus Groß Christopher Schroers

Deep learning based image compression has recently witnessed exciting progress and in some cases even managed to surpass transform coding approaches that have been established refined over many decades. However, state-of-the-art solutions for deep typically employ autoencoders which map the input a lower dimensional latent space thus irreversibly discard information already before quantization. Due that, they inherently limit range of quality levels can be covered. In contrast, traditional...

10.48550/arxiv.2008.10486 preprint EN public-domain arXiv (Cornell University) 2020-01-01

Neural Video Compression with Spatio-Temporal Cross-Covariance Transformers

OPENALEX - Publications

Zhenghao Chen Lucas Relic Roberto Gerson de Albuquerque Azevedo Yang Zhang Markus Groß and 3 more

Although existing neural video compression~(NVC) methods have achieved significant success, most of them focus on improving either temporal or spatial information separately. They generally use simple operations such as concatenation subtraction to utilize this information, while only partially exploit spatio-temporal redundancies. This work aims effectively and jointly leverage robust by proposing a new 3D-based transformer module: Spatio-Temporal Cross-Covariance Transformer (ST-XCT). The...

10.1145/3581783.3611960 article EN 2023-10-26

Deep Generative Video Compression

OPENALEX - Publications

Jun Han Salvator Lombardo Christopher Schroers Stephan Mandt

The usage of deep generative models for image compression has led to impressive performance gains over classical codecs while neural video is still in its infancy. Here, we propose an end-to-end, modeling approach compress temporal sequences with a focus on video. Our builds upon variational autoencoder (VAE) sequential data and combines them recent work compression. jointly learns transform the original sequence into lower-dimensional representation as well discretize entropy code this...

10.48550/arxiv.1810.02845 preprint EN other-oa arXiv (Cornell University) 2018-01-01

A Fully Progressive Approach to Single-Image Super-Resolution

OPENALEX - Publications

Yifan Wang Federico Perazzi Brian McWilliams Alexander Sorkine‐Hornung Olga Sorkine‐Hornung and 1 more

Recent deep learning approaches to single image super-resolution have achieved impressive results in terms of traditional error measures and perceptual quality. However, each case it remains challenging achieve high quality for large upsampling factors. To this end, we propose a method (ProSR) that is progressive both architecture training: the network upsamples an intermediate steps, while process organized from easy hard, as done curriculum learning. obtain more photorealistic results,...

10.48550/arxiv.1804.02900 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Frame Interpolation Transformer and Uncertainty Guidance

OPENALEX - Publications

Markus Plack Matthias B. Hullin Karlis Martins Briedis Markus Groß Abdelaziz Djelouah and 1 more

Video frame interpolation has seen important progress in recent years, thanks to developments several directions. Some works leverage better optical flow methods with improved splatting strategies or additional cues from depth, while others have investigated alternative approaches through direct predictions transformers. Still, the problem remains unsolved more challenging conditions such as complex lighting large motion. In this work, we are bridging gap towards video production a novel...

10.1109/cvpr52729.2023.00946 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023-06-01