Florian Barthel

ORCID: 0009-0004-7264-1672
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Computer Graphics and Visualization Techniques
  • Advanced Vision and Imaging
  • Digital Media Forensic Detection
  • Generative Adversarial Networks and Image Synthesis
  • Speech and Audio Processing
  • Advanced Image and Video Retrieval Techniques
  • Video Surveillance and Tracking Methods
  • Multimodal Machine Learning Applications
  • Epigenetics and DNA Methylation
  • Industrial Vision Systems and Defect Detection
  • Image Processing and 3D Reconstruction
  • Advanced ceramic materials synthesis
  • Advanced Data Compression Techniques
  • Face recognition and analysis
  • Data Management and Algorithms
  • 3D Shape Modeling and Analysis
  • Medical Image Segmentation Techniques
  • Additive Manufacturing and 3D Printing Technologies

Fraunhofer Institute for Telecommunications, Heinrich Hertz Institute
2023-2025

Humboldt-Universität zu Berlin
2024-2025

University of Augsburg
2022-2023

10.5220/0013308500003912 article EN Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications 2025-01-01

Abstract 3D Gaussian Splatting (3DGS) has emerged as a cutting‐edge technique for real‐time radiance field rendering, offering state‐of‐the‐art performance in terms of both quality and speed. 3DGS models scene collection three‐dimensional Gaussians, with additional attributes optimized to conform the scene's geometric visual properties. Despite its advantages rendering speed image fidelity, is limited by significant storage memory demands. These high demands make impractical mobile devices...

10.1111/cgf.70078 article EN cc-by Computer Graphics Forum 2025-04-17

Visually sorted grid layouts provide an efficient method for organizing high-dimensional vectors in two-dimensional space by aligning spatial proximity with similarity relationships. This approach facilitates the effective sorting of diverse elements ranging from data points to images, and enables simultaneous visualization a significant number elements. However, on grids is challenge due its high complexity. Even small 8-by-8 64 elements, possible arrangements exceeds 1.3 * 10^89 - more...

10.1145/3652583.3657585 article EN cc-by-nc-sa 2024-05-30

10.1109/cvprw63382.2024.00794 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2024-06-17

Current scene graph datasets suffer from strong long-tail distributions of their predicate classes. Due to a very low number some classes in the test sets, no reliable metrics can be retrieved for rarest We construct new panoptic dataset and set that are designed as benchmark predictive performance especially on rare To dataset, we propose model-assisted annotation pipeline efficiently finds hidden large images like needles haystack.Contrary prior datasets, Haystack contains explicit...

10.1109/iccvw60793.2023.00013 article EN 2023-10-02

3D Gaussian Splatting has recently emerged as a highly promising technique for modeling of static scenes. In contrast to Neural Radiance Fields, it utilizes efficient rasterization allowing very fast rendering at high-quality. However, the storage size is significantly higher, which hinders practical deployment, e.g. on resource constrained devices. this paper, we introduce compact scene representation organizing parameters (3DGS) into 2D grid with local homogeneity, ensuring drastic...

10.48550/arxiv.2312.13299 preprint EN other-oa arXiv (Cornell University) 2023-01-01

10.5220/0012371000003660 article EN cc-by-nc-nd Proceedings of the 17th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications 2024-01-01

NeRF-based 3D-aware Generative Adversarial Networks (GANs) like EG3D or GIRAFFE have shown very high rendering quality under large representational variety. However, with Neural Radiance Fields poses challenges for 3D applications: First, the significant computational demands of NeRF preclude its use on low-power devices, such as mobiles and VR/AR headsets. Second, implicit representations based neural networks are difficult to incorporate into explicit scenes, VR environments video games....

10.48550/arxiv.2404.10625 preprint EN arXiv (Cornell University) 2024-04-16

We present a work-in-progress survey on 3D Gaussian Splatting compression methods, focusing their statistical performance across various benchmarks. This aims to facilitate comparability by summarizing key statistics of different approaches in tabulated format. The datasets evaluated include TanksAndTemples, MipNeRF360, DeepBlending, and SyntheticNeRF. For each method, we report the Peak Signal-to-Noise Ratio (PSNR), Structural Similarity Index (SSIM), Learned Perceptual Image Patch (LPIPS),...

10.48550/arxiv.2407.09510 preprint EN arXiv (Cornell University) 2024-06-17

Abstract In this work, we propose a method that enforces explicit control over various attributes during the image generation process in generative adversarial net. We semi-supervised learning procedure allows us to use quantized approximation of object orientation for continuous rotations. As result, among many other attributes, our proposed scenes are rendered according specifications.

10.1007/s42979-022-01462-w article EN cc-by SN Computer Science 2022-11-10

Current scene graph datasets suffer from strong long-tail distributions of their predicate classes. Due to a very low number some classes in the test sets, no reliable metrics can be retrieved for rarest We construct new panoptic dataset and set that are designed as benchmark predictive performance especially on rare To dataset, we propose model-assisted annotation pipeline efficiently finds hidden large images like needles haystack. Contrary prior datasets, Haystack contains explicit...

10.48550/arxiv.2309.02286 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Current 3D GAN inversion methods for human heads typically use only one single frontal image to reconstruct the whole head model. This leaves out meaningful information when multi-view data or dynamic videos are available. Our method builds on existing state-of-the-art techniques allow consistent and simultaneous of multiple views same subject. We employ a multi-latent extension handle inconsistencies present in face re-synthesize representations from sequence. As our uses additional about...

10.48550/arxiv.2312.05330 preprint EN cc-by-nc-sa arXiv (Cornell University) 2023-01-01
Coming Soon ...