NFDI4DS | UHH-SEMS - Publication Details

Image Generators with Conditionally-Independent Pixel Synthesis

OPENALEX - Publications

Ivan Anokhin Kirill Demochkin Taras Khakhulin Gleb Sterkin Victor Lempitsky and 1 more

Existing image generator networks rely heavily on spatial convolutions and, optionally, self-attention blocks in order to gradually synthesize images a coarse-to-fine manner. Here, we present new architecture for generators, where the color value at each pixel is computed independently given of random latent vector and coordinate that pixel. No or similar operations propagate information across pixels are involved during synthesis. We analyze modeling capabilities such generators when...

10.1109/cvpr46437.2021.01405 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion

OPENALEX - Publications

Mustafa Işık Martin Rünz Markos Georgopoulos Taras Khakhulin J. Starck and 2 more

Representing human performance at high-fidelity is an essential building block in diverse applications, such as film production, computer games or videoconferencing. To close the gap to production-level quality, we introduce HumanRF, a 4D dynamic neural scene representation that captures full-body appearance motion from multi-view video input, and enables playback novel, unseen viewpoints. Our novel acts encoding fine details high compression rates by factorizing space-time into temporal...

10.1145/3592415 article EN ACM Transactions on Graphics 2023-07-26

DeepPavlov: Open-Source Library for Dialogue Systems

OPENALEX - Publications

Mikhail Burtsev А. В. Селиверстов Rafael Airapetyan М. В. Архипов Dilyara Baymurzina and 15 more

Mikhail Burtsev, Alexander Seliverstov, Rafael Airapetyan, Arkhipov, Dilyara Baymurzina, Nickolay Bushkov, Olga Gureenkova, Taras Khakhulin, Yuri Kuratov, Denis Kuznetsov, Alexey Litinsky, Varvara Logacheva, Lymar, Valentin Malykh, Maxim Petrov, Vadim Polulyakh, Leonid Pugachev, Sorokin, Maria Vikhreva, Marat Zaynutdinov. Proceedings of ACL 2018, System Demonstrations. 2018.

10.18653/v1/p18-4021 article EN cc-by 2018-01-01

MegaPortraits: One-shot Megapixel Neural Head Avatars

OPENALEX - Publications

Nikita Drobyshev Jenya Chelishev Taras Khakhulin Aleksei Ivakhnenko Victor Lempitsky and 1 more

In this work, we advance the neural head avatar technology to megapixel resolution while focusing on particularly challenging task of cross-driving synthesis, i.e., when appearance driving image is substantially different from animated source image. We propose a set new architectures and training methods that can leverage both medium-resolution video data high-resolution achieve desired levels rendered quality generalization novel views motion. demonstrate suggested produce convincing...

10.1145/3503161.3547838 article EN Proceedings of the 30th ACM International Conference on Multimedia 2022-10-10

High-Resolution Daytime Translation Without Domain Labels

OPENALEX - Publications

Ivan Anokhin Pavel Solovev Denis Korzhenkov Alexey Kharlamov Taras Khakhulin and 4 more

Modeling daytime changes in high resolution photographs, e.g., re-rendering the same scene under different illuminations typical for day, night, or dawn, is a challenging image manipulation task. We present high-resolution translation (HiDT) model this HiDT combines generative image-to-image and new upsampling scheme that allows to apply at resolution. The demonstrates competitive results terms of both commonly used GAN metrics human evaluation. Importantly, good performance comes as result...

10.1109/cvpr42600.2020.00751 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2020-06-01

Self-improving Multiplane-to-layer Images for Novel View Synthesis

OPENALEX - Publications

Pavel Solovev Taras Khakhulin Denis Korzhenkov

We present a new method for lightweight novel-view synthesis that generalizes to an arbitrary forward-facing scene. Recent approaches are computationally expensive, require per-scene optimization, or produce memory-expensive representation. start by representing the scene with set of fronto-parallel semitransparent planes and afterwards convert them deformable layers in end-to-end manner. Additionally, we employ feed-forward refinement procedure corrects estimated representation aggregating...

10.1109/wacv56688.2023.00429 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023-01-01

Stereo Magnification with Multi-Layer Images

OPENALEX - Publications

Taras Khakhulin Denis Korzhenkov Pavel Solovev G. Sterkin Andrei-Timotei Ardelean and 1 more

Representing scenes with multiple semitransparent colored layers has been a popular and successful choice for real-time novel view synthesis. Existing approaches infer colors transparency values over regularly spaced of planar or spherical shape. In this work, we introduce new synthesis approach based on scene-adapted geometry. Our infers such representations from stereo pairs in two stages. The first stage produces the geometry small number data-adaptive given pair views. second color these...

10.1109/cvpr52688.2022.00849 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Robust Word Vectors: Context-Informed Embeddings for Noisy Texts

OPENALEX - Publications

Valentin Malykh Varvara Logacheva Taras Khakhulin

We suggest a new language-independent architecture of robust word vectors (RoVe). It is designed to alleviate the issue typos, which are common in almost any user-generated content, and hinder automatic text processing. Our model morphologically motivated, allows it deal with unseen forms rich languages. present results on number Natural Language Processing (NLP) tasks languages for variety related architectures show that proposed typo-proof.

10.18653/v1/w18-6108 article EN cc-by 2018-01-01

Simple heuristics for efficient parallel tensor contraction and quantum circuit simulation

OPENALEX - Publications

Roman Schutski Taras Khakhulin Ivan Oseledets Dmitry Kolmakov

Tensor networks are the main building blocks in a wide variety of computational sciences, ranging from many-body theory and quantum computing to probability machine learning. Here we propose parallel algorithm for contraction tensor using probabilistic graphical models. Our approach is based on heuristic solution $\mu$-treewidth deletion problem graph theory. We apply resulting simulation random circuits discuss extensions general network contractions.

10.1103/physreva.102.062614 article EN Physical review. A/Physical review, A 2020-12-28

Robust Word Vectors: Context-Informed Embeddings for Noisy Texts

OPENALEX - Publications

Valentin Malykh Taras Khakhulin Varvara Logacheva

10.1007/s10958-023-06523-w article EN Journal of Mathematical Sciences 2023-06-22

Noise Robustness in Aspect Extraction Task

OPENALEX - Publications

Valentin Malykh Taras Khakhulin

Aspect extraction from user reviews is one of the sources to make dialog systems, which are on rise now. A typical a conversation system has no time check spelling or grammar in his her utterances. Due that utterances contain typos and errors, so noise robustness should be considered as significant feature an aspect model. We analyze noise-robustness state-of-the-art Attention-Based Extraction technique propose extensions for this model, lead more robust behaviour presence typos....

10.1109/ic-aiai.2018.8674450 article EN 2018-10-01

Graph Convolutional Policy for Solving Tree Decomposition via Reinforcement Learning Heuristics

OPENALEX - Publications

Taras Khakhulin Roman Schutski Ivan Oseledets

We propose a Reinforcement Learning based approach to approximately solve the Tree Decomposition (TD) problem. TD is combinatorial problem, which central analysis of graph minor structure and computational complexity, as well in algorithms probabilistic inference, register allocation, other practical tasks. Recently, it has been shown that problems can be successively solved by learned heuristics. However, majority existing works do not address question generalization learning-based...

10.48550/arxiv.1910.08371 preprint EN other-oa arXiv (Cornell University) 2019-01-01

High-Resolution Daytime Translation Without Domain Labels

OPENALEX - Publications

Ivan Anokhin Pavel Solovev Denis Korzhenkov T. Alexopoulos Taras Khakhulin and 4 more

Modeling daytime changes in high resolution photographs, e.g., re-rendering the same scene under different illuminations typical for day, night, or dawn, is a challenging image manipulation task. We present high-resolution translation (HiDT) model this HiDT combines generative image-to-image and new upsampling scheme that allows to apply at resolution. The demonstrates competitive results terms of both commonly used GAN metrics human evaluation. Importantly, good performance comes as result...

10.48550/arxiv.2003.08791 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Image Generators with Conditionally-Independent Pixel Synthesis

OPENALEX - Publications

Ivan Anokhin Kirill Demochkin Taras Khakhulin Gleb Sterkin Victor Lempitsky and 1 more

Existing image generator networks rely heavily on spatial convolutions and, optionally, self-attention blocks in order to gradually synthesize images a coarse-to-fine manner. Here, we present new architecture for generators, where the color value at each pixel is computed independently given of random latent vector and coordinate that pixel. No or similar operations propagate information across pixels are involved during synthesis. We analyze modeling capabilities such generators when...

10.48550/arxiv.2011.13775 preprint EN cc-by arXiv (Cornell University) 2020-01-01

Stereo Magnification with Multi-Layer Images

OPENALEX - Publications

Taras Khakhulin Denis Korzhenkov Pavel Solovev Gleb Sterkin Timotei Ardelean and 1 more

Representing scenes with multiple semi-transparent colored layers has been a popular and successful choice for real-time novel view synthesis. Existing approaches infer colors transparency values over regularly-spaced of planar or spherical shape. In this work, we introduce new synthesis approach based on scene-adapted geometry. Our infers such representations from stereo pairs in two stages. The first stage the geometry small number data-adaptive given pair views. second color these...

10.48550/arxiv.2201.05023 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Realistic One-shot Mesh-based Head Avatars

OPENALEX - Publications

Taras Khakhulin Vanessa Sklyarova Victor Lempitsky Egor Zakharov

We present a system for realistic one-shot mesh-based human head avatars creation, ROME short. Using single photograph, our model estimates person-specific mesh and the associated neural texture, which encodes both local photometric geometric details. The resulting are rigged can be rendered using network, is trained alongside texture estimators on dataset of in-the-wild videos. In experiments, we observe that performs competitively in terms geometry recovery quality renders, especially...

10.48550/arxiv.2206.08343 preprint EN cc-by-sa arXiv (Cornell University) 2022-01-01

Self-improving Multiplane-to-layer Images for Novel View Synthesis

OPENALEX - Publications

Pavel Solovev Taras Khakhulin Denis Korzhenkov

We present a new method for lightweight novel-view synthesis that generalizes to an arbitrary forward-facing scene. Recent approaches are computationally expensive, require per-scene optimization, or produce memory-expensive representation. start by representing the scene with set of fronto-parallel semitransparent planes and afterward convert them deformable layers in end-to-end manner. Additionally, we employ feed-forward refinement procedure corrects estimated representation aggregating...

10.48550/arxiv.2210.01602 preprint EN cc-by-sa arXiv (Cornell University) 2022-01-01

MegaPortraits: One-shot Megapixel Neural Head Avatars

OPENALEX - Publications

Nikita Drobyshev Jenya Chelishev Taras Khakhulin Aleksei Ivakhnenko Victor Lempitsky and 1 more

In this work, we advance the neural head avatar technology to megapixel resolution while focusing on particularly challenging task of cross-driving synthesis, i.e., when appearance driving image is substantially different from animated source image. We propose a set new architectures and training methods that can leverage both medium-resolution video data high-resolution achieve desired levels rendered quality generalization novel views motion. demonstrate suggested produce convincing...

10.48550/arxiv.2207.07621 preprint EN cc-by-nc-sa arXiv (Cornell University) 2022-01-01