NFDI4DS | UHH-SEMS - Publication Details

High-Resolution Image Synthesis with Latent Diffusion Models

OPENALEX - Publications

Robin Rombach Andreas Blattmann Dominik Lorenz Patrick Esser Björn Ommer

By decomposing the image formation process into a sequential application of denoising autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on data and beyond. Additionally, their formulation allows for guiding mechanism to control generation without retraining. However, since these typically operate directly in pixel space, optimization powerful DMs often consumes hundreds GPU days inference is expensive due evaluations. To enable DM training limited computational...

10.1109/cvpr52688.2022.01042 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Taming Transformers for High-Resolution Image Synthesis

OPENALEX - Publications

Patrick Esser Robin Rombach Björn Ommer

Designed to learn long-range interactions on sequential data, transformers continue show state-of-the-art results a wide variety of tasks. In contrast CNNs, they contain no inductive bias that prioritizes local interactions. This makes them expressive, but also computationally infeasible for long sequences, such as high-resolution images. We demonstrate how combining the effectiveness CNNs with expressivity enables model and thereby synthesize (i) use context-rich vocabulary image...

10.1109/cvpr46437.2021.01268 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Asynchronous therapy restores motor control by rewiring of the rat corticospinal tract after stroke

OPENALEX - Publications

Anna-Sophia Wahl Wolfgang Omlor José C. Rubio Jerry L. Chen Hongwei Zheng and 7 more

The brain exhibits limited capacity for spontaneous restoration of lost motor functions after stroke. Rehabilitation is the prevailing clinical approach to augment functional recovery, but scientific basis poorly understood. Here, we show nearly full recovery skilled forelimb in rats with large strokes when a growth-promoting immunotherapy against neurite growth-inhibitory protein was applied boost sprouting new fibers, before stabilizing newly formed circuits by intensive training. In...

10.1126/science.1253050 article EN Science 2014-06-12

Content and Style Disentanglement for Artistic Style Transfer

OPENALEX - Publications

Dmytro Kotovenko Artsiom Sanakoyeu Sabine Lang Björn Ommer

Artists rarely paint in a single style throughout their career. More often they change styles or develop variations of it. In addition, artworks different and even within one depict real content differently: while Picasso's Blue Period displays vase blueish tone but as whole, his Cubist works deconstruct the object. To produce artistically convincing stylizations, transfer models must be able to reflect these changes variations. Recently many have aimed improve task, neglected address...

10.1109/iccv.2019.00452 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Unsupervised Part-Based Disentangling of Object Shape and Appearance

OPENALEX - Publications

Dominik Lorenz Leonard Bereska Timo Milbich Björn Ommer

Large intra-class variation is the result of changes in multiple object characteristics. Images, however, only show superposition different variable factors such as appearance or shape. Therefore, learning to disentangle and represent these characteristics poses a great challenge, especially unsupervised case. Moreover, large articulation calls for flexible part-based model. We present an approach disentangling shape by parts consistently over all instances category. Our model representation...

10.1109/cvpr.2019.01121 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Learning Multi-Scale Photo Exposure Correction

OPENALEX - Publications

Mahmoud Afifi Konstantinos G. Derpanis Björn Ommer Michael S. Brown

Capturing photographs with wrong exposures remains a major source of errors in camera-based imaging. Exposure problems are categorized as either: (i) overexposed, where the camera exposure was too long, resulting bright and washed-out image regions, or (ii) underexposed, short, dark regions. Both under- overexposure greatly reduce contrast visual appeal an image. Prior work mainly focuses on underexposed images general enhancement. In contrast, our proposed method targets both over-...

10.1109/cvpr46437.2021.00904 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

High-Resolution Image Synthesis with Latent Diffusion Models

OPENALEX - Publications

Robin Rombach Andreas Blattmann Dominik Lorenz Patrick Esser Björn Ommer

By decomposing the image formation process into a sequential application of denoising autoencoders, diffusion models (DMs) achieve state-of-the-art synthesis results on data and beyond. Additionally, their formulation allows for guiding mechanism to control generation without retraining. However, since these typically operate directly in pixel space, optimization powerful DMs often consumes hundreds GPU days inference is expensive due evaluations. To enable DM training limited computational...

10.48550/arxiv.2112.10752 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Neuropathic pain caused by miswiring and abnormal end organ targeting

OPENALEX - Publications

Vijayan Gangadharan Hongwei Zheng Francisco J. Taberner Jonathan J. M. Landry Timo A. Nees and 9 more

Abstract Nerve injury leads to chronic pain and exaggerated sensitivity gentle touch (allodynia) as well a loss of sensation in the areas which injured non-injured nerves come together 1–3 . The mechanisms that disambiguate these mixed paradoxical symptoms are unknown. Here we longitudinally non-invasively imaged genetically labelled populations fibres sense noxious stimuli (nociceptors) (low-threshold afferents) peripherally skin for longer than 10 months after nerve injury, while...

10.1038/s41586-022-04777-z article EN cc-by Nature 2022-05-25

State of the Art on Diffusion Models for Visual Computing

OPENALEX - Publications

Riccardo Pó Yifan Wang Vladislav Golyanik Kfir Aberman Jonathan T. Barron and 13 more

Abstract The field of visual computing is rapidly advancing due to the emergence generative artificial intelligence (AI), which unlocks unprecedented capabilities for generation, editing, and reconstruction images, videos, 3D scenes. In these domains, diffusion models are AI architecture choice. Within last year alone, literature on diffusion‐based tools applications has seen exponential growth relevant papers published across computer graphics, vision, communities with new works appearing...

10.1111/cgf.15063 article EN Computer Graphics Forum 2024-04-30

Video parsing for abnormality detection

OPENALEX - Publications

Borislav Antić Björn Ommer

Detecting abnormalities in video is a challenging problem since the class of all irregular objects and behaviors infinite thus no (or by far not enough) abnormal training samples are available. Consequently, standard setting to find without actually knowing what they because we have been shown examples during training. However, although data does define an abnormality looks like, main paradigm this field directly search for individual local patches or image regions independent another. To...

10.1109/iccv.2011.6126525 article EN International Conference on Computer Vision 2011-11-01

Divide and Conquer the Embedding Space for Metric Learning

OPENALEX - Publications

Artsiom Sanakoyeu Vadim Tschernezki Uta Büchler Björn Ommer

Learning the embedding space, where semantically similar objects are located close together and dissimilar far apart, is a cornerstone of many computer vision applications. Existing approaches usually learn single metric in space for all available data points, which may have very complex non-uniform distribution with different notions similarity between objects, e.g. appearance, shape, color or semantic meaning. Approaches learning distance often struggle to encode types relationships do not...

10.1109/cvpr.2019.00056 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Deep Semantic Feature Matching

OPENALEX - Publications

Nikolai Ufer Björn Ommer

Estimating dense visual correspondences between objects with intra-class variation, deformations and background clutter remains a challenging problem. Thanks to the breakthrough of CNNs there are new powerful features available. Despite their easy accessibility great success, existing semantic flow methods could not significantly benefit from these without extensive additional training. We introduce novel method for matching pre-trained CNN which is based on convolutional feature pyramids...

10.1109/cvpr.2017.628 article EN 2017-07-01

MIC: Mining Interclass Characteristics for Improved Metric Learning

OPENALEX - Publications

Biagio Brattoli Karsten Roth Björn Ommer

Metric learning seeks to embed images of objects such that class-defined relations are captured by the embedding space. However, variability in is not just due different depicted object classes, but also depends on other latent characteristics as viewpoint or illumination. In addition these structured properties, random noise further obstructs visual interest. The common approach metric enforce a representation invariant under all factors ones contrast, we propose explicitly learn shared and...

10.1109/iccv.2019.00809 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

A Content Transformation Block for Image Style Transfer

OPENALEX - Publications

Dmytro Kotovenko Artsiom Sanakoyeu Pingchuan Ma Sabine Lang Björn Ommer

Style transfer has recently received a lot of attention, since it allows to study fundamental challenges in image understanding and synthesis. Recent work significantly improved the representation color texture computational speed resolution. The explicit transformation content has, however, been mostly neglected: while artistic style affects formal characteristics an image, such as color, shape or texture, also deforms, adds removes details. This paper explicitly focuses on content-and...

10.1109/cvpr.2019.01027 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Optogenetically stimulating intact rat corticospinal tract post-stroke restores motor control through regionalized functional circuit formation

OPENALEX - Publications

Anna-Sophia Wahl Uta Büchler André W. Brändli Biagio Brattoli Simon Musall and 5 more

Current neuromodulatory strategies to enhance motor recovery after stroke often target large brain areas non-specifically and without sufficient understanding of their interaction with internal repair mechanisms. Here we developed a novel therapeutic approach by specifically activating corticospinal circuitry using optogenetics strokes in rats. Similar neuronal growth-promoting immunotherapy, optogenetic stimulation together intense, scheduled rehabilitation leads the restoration lost...

10.1038/s41467-017-01090-6 article EN cc-by Nature Communications 2017-10-24

A Disentangling Invertible Interpretation Network for Explaining Latent Representations

OPENALEX - Publications

Patrick Esser Robin Rombach Björn Ommer

Neural networks have greatly boosted performance in computer vision by learning powerful representations of input data. The drawback end-to-end training for maximal overall are black-box models whose hidden lacking interpretability: Since distributed coding is optimal latent layers to improve their robustness, attributing meaning parts a feature vector or individual neurons hindered. We formulate interpretation as translation onto semantic concepts that comprehensible the user. mapping...

10.1109/cvpr42600.2020.00924 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

CliqueCNN: Deep Unsupervised Exemplar Learning

OPENALEX - Publications

Miguel Ángel Bautista Artsiom Sanakoyeu Ekaterina Sutter Björn Ommer

Exemplar learning is a powerful paradigm for discovering visual similarities in an unsupervised manner. In this context, however, the recent breakthrough deep could not yet unfold its full potential. With only single positive sample, great imbalance between one and many negatives, unreliable relationships most samples, training of Convolutional Neural networks impaired. Given weak estimates local distance we propose optimization problem to extract batches samples with mutually consistent...

10.48550/arxiv.1608.08792 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes

OPENALEX - Publications

Dmytro Kotovenko Matthias Wright Arthur Heimbrecht Björn Ommer

There have been many successful implementations of neural style transfer in recent years. In most these works, the stylization process is confined to pixel domain. However, we argue that this representation unnatural because paintings usually consist brushstrokes rather than pixels. We propose a method stylize images by optimizing parameterized instead pixels and further introduce simple differentiable rendering mechanism.Our approach significantly improves visual quality enables additional...

10.1109/cvpr46437.2021.01202 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Taming Transformers for High-Resolution Image Synthesis

OPENALEX - Publications

Patrick Esser Robin Rombach Björn Ommer

Designed to learn long-range interactions on sequential data, transformers continue show state-of-the-art results a wide variety of tasks. In contrast CNNs, they contain no inductive bias that prioritizes local interactions. This makes them expressive, but also computationally infeasible for long sequences, such as high-resolution images. We demonstrate how combining the effectiveness CNNs with expressivity enables model and thereby synthesize (i) use context-rich vocabulary image...

10.48550/arxiv.2012.09841 preprint EN other-oa arXiv (Cornell University) 2020-01-01

ImageBART: Bidirectional Context with Multinomial Diffusion for Autoregressive Image Synthesis

OPENALEX - Publications

Patrick Esser Robin Rombach Andreas Blattmann Björn Ommer

Autoregressive models and their sequential factorization of the data likelihood have recently demonstrated great potential for image representation synthesis. Nevertheless, they incorporate context in a linear 1D order by attending only to previously synthesized patches above or left. Not is this unidirectional, bias attention unnatural images as it disregards large parts scene until synthesis almost complete. It also processes entire on single scale, thus ignoring more global contextual...

10.48550/arxiv.2108.08827 preprint EN other-oa arXiv (Cornell University) 2021-01-01