NFDI4DS | UHH-SEMS - Publication Details

David Svitov

ORCID: 0009-0009-9116-0416

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5073577238

Research Areas

Advanced Neural Network Applications
Human Pose and Action Recognition
3D Shape Modeling and Analysis
Augmented Reality Applications
Human Motion and Animation
Face recognition and analysis
Advanced Vision and Imaging
CCD and CMOS Imaging Sensors
Industrial Vision Systems and Defect Detection
Generative Adversarial Networks and Image Synthesis
COVID-19 diagnosis using AI
AI in cancer detection
Video Surveillance and Tracking Methods
Computer Graphics and Visualization Techniques
Advanced Image Fusion Techniques
Digital Media Forensic Detection
Image and Signal Denoising Methods
Anomaly Detection Techniques and Applications
Radiomics and Machine Learning in Medical Imaging
Automated Road and Building Extraction
Virtual Reality Applications and Impacts
Face and Expression Recognition
Biometric Identification and Security
Neural Networks and Applications
Advanced Image Processing Techniques

Samsung (United States)
2023-2024

Institute of Automation and Electrometry
2017-2022

Siberian Branch of the Russian Academy of Sciences
2022

Seoul National University
2021

Russian Academy of Sciences
2017

NTIRE 2021 Challenge on Image Deblurring

OPENALEX - Publications

Seungjun Nah Sanghyun Son Suyoung Lee Radu Timofte Kyoung Mu Lee and 93 more

Motion blur is a common photography artifact in dynamic environments that typically comes jointly with the other types of degradation. This paper reviews NTIRE 2021 Challenge on Image Deblurring. In this challenge report, we describe specifics and evaluation results from 2 competition tracks proposed solutions. While both aim to recover high-quality clean image blurry image, different artifacts are involved. track 1, images low resolution while compressed JPEG format. each competition, there...

10.1109/cvprw53098.2021.00025 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2021-06-01

Low-Power Computer Vision: Status, Challenges, and Opportunities

OPENALEX - Publications

Sergei Alyamkin Matthew Ardi Alexander C. Berg Achille Brighton Bo Chen and 39 more

Computer vision has achieved impressive progress in recent years. Meanwhile, mobile phones have become the primary computing platforms for millions of people. In addition to phones, many autonomous systems rely on visual data making decisions, and some these limited energy (such as unmanned aerial vehicles also called drones robots). These batteries, efficiency is critical. This paper serves following two main purposes. First, examine state art low-power solutions detect objects images....

10.1109/jetcas.2019.2911899 article EN IEEE Journal on Emerging and Selected Topics in Circuits and Systems 2019-05-23

DINAR: Diffusion Inpainting of Neural Textures for One-Shot Human Avatars

OPENALEX - Publications

David Svitov Dmitrii Gudkov Renat Bashirov Victor Lempitsky

We present DINAR, an approach for creating realistic rigged fullbody avatars from single RGB images. Similarly to previous works, our method uses neural textures combined with the SMPL-X body model achieve photorealistic quality of while keeping them easy animate and fast infer. To restore texture, we use a latent diffusion show how such can be trained in texture space. The allows us realistically reconstruct large unseen regions as back person given frontal view. models pipeline are using...

10.1109/iccv51070.2023.00650 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

MoRF: Mobile Realistic Fullbody Avatars from a Monocular Video

OPENALEX - Publications

Renat Bashirov Alexey Larionov Evgeniya Ustinova Mikhail S. Sidorenko David Svitov and 2 more

We present a system to create Mobile Realistic Fullbody (MoRF) avatars. MoRF avatars are rendered in real-time on mobile devices, learned from monocular videos, and have high realism. use SMPL-X as proxy geometry render it with DNR (neural texture image-2-image network). improve prior work, by overfitting perframe warping fields the neural space, allowing better align training signal between different frames. also refine mesh fitting procedure overall avatar quality. In comparisons other...

10.1109/wacv57701.2024.00351 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024-01-03

HAHA: Highly Articulated Gaussian Human Avatars with Textured Mesh Prior

OPENALEX - Publications

David Svitov Pietro Morerio Lourdes Agapito Alessio Del Bue

We present HAHA - a novel approach for animatable human avatar generation from monocular input videos. The proposed method relies on learning the trade-off between use of Gaussian splatting and textured mesh efficient high fidelity rendering. demonstrate its efficiency to animate render full-body avatars controlled via SMPL-X parametric model. Our model learns apply only in areas where it is necessary, like hair out-of-mesh clothing. This results minimal number Gaussians being used represent...

10.48550/arxiv.2404.01053 preprint EN arXiv (Cornell University) 2024-04-01

Low-Power Computer Vision: Status, Challenges, Opportunities

OPENALEX - Publications

Sergei Alyamkin Matthew Ardi Alexander C. Berg Achille Brighton Bo Chen and 39 more

Computer vision has achieved impressive progress in recent years. Meanwhile, mobile phones have become the primary computing platforms for millions of people. In addition to phones, many autonomous systems rely on visual data making decisions and some these limited energy (such as unmanned aerial vehicles also called drones robots). These batteries efficiency is critical. This article serves two main purposes: (1) Examine state-of-the-art low-power solutions detect objects images. Since...

10.48550/arxiv.1904.07714 preprint EN other-oa arXiv (Cornell University) 2019-01-01

MarginDistillation: distillation for margin-based softmax

OPENALEX - Publications

David Svitov Sergey Alyamkin

The usage of convolutional neural networks (CNNs) in conjunction with a margin-based softmax approach demonstrates state-of-the-art performance for the face recognition problem. Recently, lightweight network models trained have been introduced identification task edge devices. In this paper, we propose novel distillation method architectures that outperforms other known methods on LFW, AgeDB-30 and Megaface datasets. idea proposed is to use class centers from teacher student network. Then...

10.48550/arxiv.2003.02586 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Detection of suspicious objects on the basis of analysis of human X-ray images

OPENALEX - Publications

David Svitov V. A. Kulikov V. P. Kosykh

10.3103/s875669901702008x article EN Optoelectronics Instrumentation and Data Processing 2017-03-01

BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis

OPENALEX - Publications

David Svitov Pietro Morerio Lourdes Agapito Alessio Del Bue

We present billboard Splatting (BBSplat) - a novel approach for 3D scene representation based on textured geometric primitives. BBSplat represents the as set of optimizable planar primitives with learnable RGB textures and alpha-maps to control their shape. can be used in any Gaussian pipeline drop-in replacements Gaussians. Our method's qualitative quantitative improvements over 2D Gaussians are most noticeable when fewer used, achieves 1200 FPS. regularization term encourages have sparser...

10.48550/arxiv.2411.08508 preprint EN arXiv (Cornell University) 2024-11-13

Distilling Face Recognition Models Trained Using Margin-Based Softmax Function

OPENALEX - Publications

David Svitov Sergey Alyamkin

10.1134/s00051179220100046 article EN Automation and Remote Control 2022-10-01

2018 Low-Power Image Recognition Challenge

OPENALEX - Publications

Sergei Alyamkin Matthew Ardi Achille Brighton Alexander C. Berg Yiran Chen and 36 more

The Low-Power Image Recognition Challenge (LPIRC, https://rebootingcomputing.ieee.org/lpirc) is an annual competition started in 2015. identifies the best technologies that can classify and detect objects images efficiently (short execution time low energy consumption) accurately (high precision). Over four years, winners' scores have improved more than 24 times. As computer vision widely used many battery-powered systems (such as drones mobile phones), need for low-power will become...

10.48550/arxiv.1810.01732 preprint EN other-oa arXiv (Cornell University) 2018-01-01

DINAR: Diffusion Inpainting of Neural Textures for One-Shot Human Avatars

OPENALEX - Publications

David Svitov Dmitrii Gudkov Renat Bashirov Victor Lemptisky

We present DINAR, an approach for creating realistic rigged fullbody avatars from single RGB images. Similarly to previous works, our method uses neural textures combined with the SMPL-X body model achieve photo-realistic quality of while keeping them easy animate and fast infer. To restore texture, we use a latent diffusion show how such can be trained in texture space. The allows us realistically reconstruct large unseen regions as back person given frontal view. models pipeline are using...

10.48550/arxiv.2303.09375 preprint EN cc-by-nc-nd arXiv (Cornell University) 2023-01-01

MoRF: Mobile Realistic Fullbody Avatars from a Monocular Video

OPENALEX - Publications

Alexey Larionov Evgeniya Ustinova Mikhail S. Sidorenko David Svitov Ilya Zakharkin and 2 more

We present a system to create Mobile Realistic Fullbody (MoRF) avatars. MoRF avatars are rendered in real-time on mobile devices, learned from monocular videos, and have high realism. use SMPL-X as proxy geometry render it with DNR (neural texture image-2-image network). improve prior work, by overfitting per-frame warping fields the neural space, allowing better align training signal between different frames. also refine mesh fitting procedure overall avatar quality. In comparisons other...

10.48550/arxiv.2303.10275 preprint EN cc-by-nc-nd arXiv (Cornell University) 2023-01-01

AmphibianDetector: adaptive computation for moving objects detection

OPENALEX - Publications

David Svitov Sergey Alyamkin

Convolutional neural networks (CNN) allow achieving the highest accuracy for task of object detection in images. Major challenges further development detectors are false-positive detections and high demand processing power. In this paper, we propose an approach to which makes it possible reduce number by only moving objects required power algorithm inference. The proposed is a modification CNN already trained task. This method can be used improve existing system applying minor changes...

10.48550/arxiv.2011.07513 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Optimizing the Neural Network Detector of Moving Objects

OPENALEX - Publications

David Svitov Sergey Alyamkin

10.3103/s875669902101012x article EN Optoelectronics Instrumentation and Data Processing 2021-01-01

Coming Soon ...