NFDI4DS | UHH-SEMS - Publication Details

MonteFloor: Extending MCTS for Reconstructing Accurate Large-Scale Floor Plans

OPENALEX - Publications

Sinisa Stekovic Mahdi Rad Friedrich Fraundorfer Vincent Lepetit

We propose a novel method for reconstructing floor plans from noisy 3D point clouds. Our main contribution is principled approach that relies on the Monte Carlo Tree Search (MCTS) algorithm to maximize suitable objective function efficiently despite complexity of problem. Like previous work, we first project input cloud top view create density map and extract room proposals it. selects optimizes polygonal shapes these jointly fit outputs an accurate vectorized even large complex scenes. To...

10.1109/iccv48922.2021.01573 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2021-10-01

Monte Carlo Scene Search for 3D Scene Understanding

OPENALEX - Publications

Shreyas Hampali Sinisa Stekovic Sayan Sarkar Chetan S. Kumar Friedrich Fraundorfer and 1 more

We explore how a general AI algorithm can be used for 3D scene understanding to reduce the need training data. More exactly, we propose modification of Monte Carlo Tree Search (MCTS) retrieve objects and room layouts from noisy RGB-D scans. While MCTS was developed as game-playing algorithm, show it also complex perception problems. Our adapted has few easy-to-tune hyperparameters optimise losses. use posterior probability layout hypotheses given This results in an analysis-by-synthesis...

10.1109/cvpr46437.2021.01359 preprint EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

Automatically Annotating Indoor Images with CAD Models via RGB-D Scans

OPENALEX - Publications

Stefan Ainetter Sinisa Stekovic Friedrich Fraundorfer Vincent Lepetit

We present an automatic method for annotating images of indoor scenes with the CAD models objects by relying on RGB-D scans. Through a visual evaluation 3D experts, we show that our retrieves annotations are at least as accurate manual annotations, and can thus be used ground truth without burden manually data. do this using analysis-by-synthesis approach, which compares renderings captured scene. introduce 'cloning procedure' identifies have same geometry, to annotate these models. This...

10.1109/wacv56688.2023.00317 article EN 2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023-01-01

HOC-Search: Efficient CAD Model and Pose Retrieval From RGB-D Scans

OPENALEX - Publications

Stefan Ainetter Sinisa Stekovic Friedrich Fraundorfer Vincent Lepetit

We present an automated and efficient approach for retrieving high-quality CAD models of objects their poses in a scene captured by moving RGB-D camera. first investigate various objective functions to measure similarity between candidate object model the available data, best function appears be "render-and-compare" method comparing depth mask rendering. thus introduce fast-search that approximates exhaustive search based on this simultaneously category, model, pose given approximate 3D...

10.1109/3dv62453.2024.00066 article EN 2021 International Conference on 3D Vision (3DV) 2024-03-18

Casting Geometric Constraints in Semantic Segmentation as Semi-Supervised Learning

OPENALEX - Publications

Sinisa Stekovic Friedrich Fraundorfer Vincent Lepetit

We propose a simple yet effective method to learn segment new indoor scenes from video frames: State-of- the-art methods trained on one dataset, even as large the SUNRGB-D can perform poorly when applied images that are not part of because dataset bias, common phenomenon in computer vision. To make semantic segmentation more useful practice, exploit geometric constraints. Our main contribution is show these constraints be cast conveniently semi-supervised terms, which enforce fact same class...

10.1109/wacv45572.2020.9093571 article EN 2020-03-01

MCTS with Refinement for Proposals Selection Games in Scene Understanding

OPENALEX - Publications

Sinisa Stekovic Mahdi Rad Alireza Moradi Friedrich Fraundorfer Vincent Lepetit

We propose a novel method applicable in many scene understanding problems that adapts the Monte Carlo Tree Search (MCTS) algorithm, originally designed to learn play games of high-state complexity. From generated pool proposals, our jointly selects and optimizes proposals minimize objective term. In first application for floor plan reconstruction from point clouds, refines room modelled as 2D polygons, by optimizing on an function combining fitness predicted deep network regularizing terms...

10.1109/tpami.2022.3203729 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2022-09-26

PyTorchGeoNodes: Enabling Differentiable Shape Programs for 3D Shape Reconstruction

OPENALEX - Publications

Sinisa Stekovic Stefan Ainetter M. D'urso Friedrich Fraundorfer Vincent Lepetit

We propose PyTorchGeoNodes, a differentiable module for reconstructing 3D objects from images using interpretable shape programs. In comparison to traditional CAD model retrieval methods, the use of programs reconstruction allows reasoning about semantic properties reconstructed objects, editing, low memory footprint, etc. However, utilization scene understanding has been largely neglected in past works. As our main contribution, we enable gradient-based optimization by introducing that...

10.48550/arxiv.2404.10620 preprint EN arXiv (Cornell University) 2024-04-16

IGITUGraz/L2L: v1.0.0-beta

OPENALEX - Publications

Anand Subramoney Sandra Diaz-Pier A. Ravishankar Rao Franz Scherr Darjan Salaj and 5 more

10.5281/zenodo.2590760 article SL 2019-03-11

S4-Net: Geometry-Consistent Semi-Supervised Semantic Segmentation

OPENALEX - Publications

Sinisa Stekovic Friedrich Fraundorfer Vincent Lepetit

We show that it is possible to learn semantic segmentation from very limited amounts of manual annotations, by enforcing geometric 3D constraints between multiple views. More exactly, image locations corresponding the same physical point should all have label. introducing such during learning effective, even when no label available for a point, and can be done simply employing techniques 'general' semi-supervised context segmentation. To demonstrate this idea, we use RGB-D sequences rigid...

10.48550/arxiv.1812.10717 preprint EN other-oa arXiv (Cornell University) 2018-01-01

HOC-Search: Efficient CAD Model and Pose Retrieval from RGB-D Scans

OPENALEX - Publications

Stefan Ainetter Sinisa Stekovic Friedrich Fraundorfer Vincent Lepetit

We present an automated and efficient approach for retrieving high-quality CAD models of objects their poses in a scene captured by moving RGB-D camera. first investigate various objective functions to measure similarity between candidate object model the available data, best function appears be "render-and-compare" method comparing depth mask rendering. thus introduce fast-search that approximates exhaustive search based on this simultaneously category, model, pose given approximate 3D...

10.48550/arxiv.2309.06107 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Casting Geometric Constraints in Semantic Segmentation as Semi-Supervised Learning

OPENALEX - Publications

Sinisa Stekovic Friedrich Fraundorfer Vincent Lepetit

We propose a simple yet effective method to learn segment new indoor scenes from video frames: State-of-the-art methods trained on one dataset, even as large the SUNRGB-D can perform poorly when applied images that are not part of because dataset bias, common phenomenon in computer vision. To make semantic segmentation more useful practice, exploit geometric constraints. Our main contribution is show these constraints be cast conveniently semi-supervised terms, which enforce fact same class...

10.48550/arxiv.1904.12534 preprint EN other-oa arXiv (Cornell University) 2019-01-01

General 3D Room Layout from a Single View by Render-and-Compare

OPENALEX - Publications

Sinisa Stekovic Shreyas Hampali Rad Mahdi Sarkar Sayan Deb Friedrich Fraundorfer and 1 more

We present a novel method to reconstruct the 3D layout of room (walls, floors, ceilings) from single perspective view in challenging conditions, by contrast with previous single-view methods restricted cuboid-shaped layouts. This input can consist color image only, but considering depth map results more accurate reconstruction. Our approach is formalized as solving constrained discrete optimization problem find set polygons that constitute layout. In order deal occlusions between components...

10.48550/arxiv.2001.02149 preprint EN other-oa arXiv (Cornell University) 2020-01-01

MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point Cloud

OPENALEX - Publications

Michaël Ramamonjisoa Sinisa Stekovic Vincent Lepetit

We present MonteBoxFinder, a method that, given noisy input point cloud, fits cuboids to the scene. Our primary contribution is discrete optimization algorithm from dense set of initially detected cuboids, able efficiently filter good boxes ones. Inspired by recent applications MCTS scene understanding problems, we develop stochastic that is, design, more efficient for our task. Indeed, quality fit cuboid arrangement invariant order in which are added into several search baselines problem...

10.48550/arxiv.2207.14268 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Automatically Annotating Indoor Images with CAD Models via RGB-D Scans

OPENALEX - Publications

Stefan Ainetter Sinisa Stekovic Friedrich Fraundorfer Vincent Lepetit

We present an automatic method for annotating images of indoor scenes with the CAD models objects by relying on RGB-D scans. Through a visual evaluation 3D experts, we show that our retrieves annotations are at least as accurate manual annotations, and can thus be used ground truth without burden manually data. do this using analysis-by-synthesis approach, which compares renderings captured scene. introduce 'cloning procedure' identifies have same geometry, to annotate these models. This...

10.48550/arxiv.2212.11796 preprint EN other-oa arXiv (Cornell University) 2022-01-01

MCTS with Refinement for Proposals Selection Games in Scene Understanding

OPENALEX - Publications

Sinisa Stekovic Mahdi Rad Alireza Moradi Friedrich Fraundorfer Vincent Lepetit

We propose a novel method applicable in many scene understanding problems that adapts the Monte Carlo Tree Search (MCTS) algorithm, originally designed to learn play games of high-state complexity. From generated pool proposals, our jointly selects and optimizes proposals minimize objective term. In first application for floor plan reconstruction from point clouds, refines room modelled as 2D polygons, by optimizing on an function combining fitness predicted deep network regularizing terms...

10.48550/arxiv.2207.03204 preprint EN other-oa arXiv (Cornell University) 2022-01-01

MonteFloor: Extending MCTS for Reconstructing Accurate Large-Scale Floor Plans

OPENALEX - Publications

Sinisa Stekovic Mahdi Rad Friedrich Fraundorfer Vincent Lepetit

We propose a novel method for reconstructing floor plans from noisy 3D point clouds. Our main contribution is principled approach that relies on the Monte Carlo Tree Search (MCTS) algorithm to maximize suitable objective function efficiently despite complexity of problem. Like previous work, we first project input cloud top view create density map and extract room proposals it. selects optimizes polygonal shapes these jointly fit outputs an accurate vectorized even large complex scenes. To...

10.48550/arxiv.2103.11161 preprint EN other-oa arXiv (Cornell University) 2021-01-01