NFDI4DS | UHH-SEMS - Publication Details

Adarsh Kowdle

ORCID: 0000-0002-4428-889X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5067691489

Research Areas

Advanced Vision and Imaging
Advanced Image and Video Retrieval Techniques
Robotics and Sensor-Based Localization
Optical measurement and interference techniques
Advanced Image Processing Techniques
Image Processing Techniques and Applications
Image Enhancement Techniques
Image Retrieval and Classification Techniques
Advanced Neural Network Applications
Visual Attention and Saliency Detection
Video Analysis and Summarization
3D Surveying and Cultural Heritage
Medical Image Segmentation Techniques
Explainable Artificial Intelligence (XAI)
Speech and Audio Processing
Music and Audio Processing
Multimodal Machine Learning Applications
Computer Graphics and Visualization Techniques
3D Shape Modeling and Analysis
Interactive and Immersive Displays
Indoor and Outdoor Localization Technologies
Video Coding and Compression Technologies
Data Visualization and Analytics
Domain Adaptation and Few-Shot Learning
Cell Image Analysis Techniques

Google (United States)
2018-2023

Perceptive Engineering (United Kingdom)
2017

Microsoft Research (United Kingdom)
2016

Microsoft (United States)
2016

Cornell University
2009-2014

Holoportation

OPENALEX - Publications

Sergio Orts‐Escolano Christoph Rhemann Sean Fanello Wayne H. Chang Adarsh Kowdle and 18 more

We present an end-to-end system for augmented and virtual reality telepresence, called Holoportation. Our demonstrates high-quality, real-time 3D reconstructions of entire space, including people, furniture objects, using a set new depth cameras. These models can also be transmitted in to remote users. This allows users wearing or displays see, hear interact with participants 3D, almost as if they were the same physical space. From audio-visual perspective, communicating interacting edges...

10.1145/2984511.2984517 article EN 2016-10-16

iCoseg: Interactive co-segmentation with intelligent scribble guidance

OPENALEX - Publications

Dhruv Batra Adarsh Kowdle Devi Parikh Jiebo Luo Tsuhan Chen

This paper presents an algorithm for Interactive Co-segmentation of a foreground object from group related images. While previous approaches focus on unsupervised co-segmentation, we use successful ideas the interactive object-cutout literature. We develop that allows users to decide what is, and then guide output co-segmentation towards it via scribbles. Interestingly, keeping user in loop leads simpler highly parallelizable energy functions, allowing us work with significantly more images...

10.1109/cvpr.2010.5540080 article EN 2010-06-01

Fusion4D

OPENALEX - Publications

Mingsong Dou Sameh Khamis Yury Degtyarev Philip Davidson Sean Fanello and 8 more

We contribute a new pipeline for live multi-view performance capture, generating temporally coherent high-quality reconstructions in real-time. Our algorithm supports both incremental reconstruction, improving the surface estimation over time, as well parameterizing nonrigid scene motion. approach is highly robust to large frame-to-frame motion and topology changes, allowing us reconstruct extremely challenging scenes. demonstrate advantages related real-time techniques that either deform an...

10.1145/2897824.2925969 article EN ACM Transactions on Graphics 2016-07-11

HITNet: Hierarchical Iterative Tile Refinement Network for Real-time Stereo Matching

OPENALEX - Publications

Vladimir Tankovich Christian Häne Yinda Zhang Adarsh Kowdle Sean Fanello and 1 more

This paper presents HITNet, a novel neural network architecture for real-time stereo matching. Contrary to many recent approaches that operate on full cost volume and rely 3D convolutions, our approach does not explicitly build instead relies fast multi-resolution initialization step, differentiable 2D geometric propagation warping mechanisms infer disparity hypotheses. To achieve high level of accuracy, only geometrically reasons about disparities but also infers slanted plane hypotheses...

10.1109/cvpr46437.2021.01413 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021-06-01

The relightables

OPENALEX - Publications

Kaiwen Guo Peter Lincoln Philip Davidson Jay Busch Xueming Yu and 16 more

We present "The Relightables", a volumetric capture system for photorealistic and high quality relightable full-body performance capture. While significant progress has been made on systems, focusing 3D geometric reconstruction with resolution textures, much less work done to recover photometric properties needed relighting. Results from such systems lack high-frequency details the subject's shading is prebaked into texture. In contrast, large body of addressed acquisition image-based...

10.1145/3355089.3356571 article EN ACM Transactions on Graphics 2019-11-08

Motion2fusion

OPENALEX - Publications

Mingsong Dou Philip Davidson Sean Fanello Sameh Khamis Adarsh Kowdle and 3 more

We present Motion2Fusion, a state-of-the-art 360 performance capture system that enables *real-time* reconstruction of arbitrary non-rigid scenes. provide three major contributions over prior work: 1) new fusion pipeline allowing for far more faithful high frequency geometric details, avoiding the over-smoothing and visual artifacts observed previously. 2) speed coupled with machine learning technique 3D correspondence field estimation reducing tracking errors are attributed to fast motions....

10.1145/3130800.3130801 article EN ACM Transactions on Graphics 2017-11-20

LookinGood

OPENALEX - Publications

Ricardo Martin-Brualla Rohit Pandey Shuoran Yang Pavel Pidlypenskyi Jonathan M. Taylor and 12 more

Motivated by augmented and virtual reality applications such as telepresence, there has been a recent focus in real-time performance capture of humans under motion. However, given the constraint, these systems often suffer from artifacts geometry texture holes noise final rendering, poor lighting, low-resolution textures. We take novel approach to augment with deep architecture that takes rendering an arbitrary viewpoint, jointly performs completion, super resolution, denoising imagery...

10.1145/3272127.3275099 article EN ACM Transactions on Graphics 2018-11-28

Interactively Co-segmentating Topically Related Images with Intelligent Scribble Guidance

OPENALEX - Publications

Dhruv Batra Adarsh Kowdle Devi Parikh Jiebo Luo Tsuhan Chen

10.1007/s11263-010-0415-x article EN International Journal of Computer Vision 2011-01-06

DepthLab: Real-time 3D Interaction with Depth Maps for Mobile Augmented Reality

OPENALEX - Publications

Ruofei Du Eric Turner Maksym Dzitsiuk Luca Prasso Ivo Duarte and 9 more

Mobile devices with passive depth sensing capabilities are ubiquitous, and recently active sensors have become available on some tablets AR/VR devices. Although real-time data is accessible, its rich value to mainstream AR applications has been sorely under-explored. Adoption of depth-based UX impeded by the complexity performing even simple operations raw data, such as detecting intersections or constructing meshes. In this paper, we introduce DepthLab, a software library that encapsulates...

10.1145/3379337.3415881 article EN 2020-10-16

Depth from motion for smartphone AR

OPENALEX - Publications

Julien Valentin Adarsh Kowdle Jonathan T. Barron Neal Wadhwa Max Dzitsiuk and 16 more

Augmented reality (AR) for smartphones has matured from a technology earlier adopters, available only on select high-end phones, to one that is truly the general public. One of key breakthroughs been in low-compute methods six degree freedom (6DoF) tracking phones using existing hardware (camera and inertial sensors). 6DoF cornerstone smartphone AR allowing virtual content be precisely locked top real world. However, really give users impression believable AR, requires mobile depth. Without...

10.1145/3272127.3275041 article EN ACM Transactions on Graphics 2018-11-28

HyperDepth: Learning Depth from Structured Light without Matching

OPENALEX - Publications

Sean Fanello Christoph Rhemann Vladimir Tankovich Adarsh Kowdle Sergio Orts Escolano and 2 more

Structured light sensors are popular due to their robustness untextured scenes and multipath. These systems triangulate depth by solving a correspondence problem between each camera projector pixel. This is often framed as local stereo matching task, correlating patches of pixels in the observed reference image. However, this computationally intensive, leading reduced accuracy framerate. We contribute an algorithm for efficiently, without compromising accuracy. For first time, cast...

10.1109/cvpr.2016.587 article EN 2016-06-01

Articulated distance fields for ultra-fast tracking of hands interacting

OPENALEX - Publications

Jonathan M. Taylor Vladimir Tankovich Danhang Tang Cem Keskin David Kim and 3 more

The state of the art in articulated hand tracking has been greatly advanced by hybrid methods that fit a generative model to depth data, leveraging both temporally and discriminatively predicted starting poses. In this paradigm, is used define an energy function local iterative optimization performed from these poses order find "good minimum" (i.e. minimum close true pose). Performing quickly key exploring more poses, performing iterations and, crucially, exploiting high frame rates ensure...

10.1145/3130800.3130853 article EN ACM Transactions on Graphics 2017-11-20

Deep reflectance fields

OPENALEX - Publications

Abhimitra Meka Christian Häne Rohit Pandey Michael Zollhöfer Sean Fanello and 17 more

We present a novel technique to relight images of human faces by learning model facial reflectance from database 4D field data several subjects in variety expressions and viewpoints. Using our learned model, face can be relit arbitrary illumination environments using only two original recorded under spherical color gradient illumination. The output deep network indicates that the contain information needed estimate full field, including specular reflections high frequency details. While...

10.1145/3306346.3323027 article EN ACM Transactions on Graphics 2019-07-12

Rapsai: Accelerating Machine Learning Prototyping of Multimedia Applications through Visual Programming

OPENALEX - Publications

Ruofei Du Na Li Jing Jin Michelle Mohr Carney Scott Miles and 12 more

In recent years, there has been a proliferation of multimedia applications that leverage machine learning (ML) for interactive experiences. Prototyping ML-based is, however, still challenging, given complex workflows are not ideal design and experimentation. To better understand these challenges, we conducted formative study with seven ML practitioners to gather insights about common evaluation workflows.

10.1145/3544548.3581338 article EN 2023-04-19

UltraStereo: Efficient Learning-Based Matching for Active Stereo Systems

OPENALEX - Publications

Sean Fanello Julien Valentin Christoph Rhemann Adarsh Kowdle Vladimir Tankovich and 2 more

Efficient estimation of depth from pairs stereo images is one the core problems in computer vision. We efficiently solve specialized problem matching under active illumination using a new learning-based algorithm. This type i.e. where scene texture augmented by an light projector proving compelling for designing cameras, largely due to improved robustness when compared time flight or traditional structured techniques. Our algorithm uses unsupervised greedy optimization scheme that learns...

10.1109/cvpr.2017.692 article EN 2017-07-01

Toward Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models

OPENALEX - Publications

Congcong Li Adarsh Kowdle Ashutosh Saxena Tsuhan Chen

Scene understanding includes many related subtasks, such as scene categorization, depth estimation, object detection, etc. Each of these subtasks is often notoriously hard, and state-of-the-art classifiers already exist for them. These operate on the same raw image provide correlated outputs. It desirable to have an algorithm that can capture correlation without requiring any changes inner workings classifier. We propose Feedback Enabled Cascaded Classification Models (FE-CCM), jointly...

10.1109/tpami.2011.232 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2011-12-06

The need 4 speed in real-time dense visual tracking

OPENALEX - Publications

Adarsh Kowdle Christoph Rhemann Sean Fanello Andrea Tagliasacchi Jonathan M. Taylor and 10 more

The advent of consumer depth cameras has incited the development a new cohort algorithms tackling challenging computer vision problems. primary reason is that provides direct geometric information largely invariant to texture and illumination. As such, substantial progress been made in human object pose estimation, 3D reconstruction simultaneous localization mapping. Most these naturally benefit from ability accurately track an or scene interest one frame next. However, commercially...

10.1145/3272127.3275062 article EN ACM Transactions on Graphics 2018-11-28

Active learning for piecewise planar 3D reconstruction

OPENALEX - Publications

Adarsh Kowdle Yao‐Jen Chang Andrew Gallagher Tsuhan Chen

This paper presents an active-learning algorithm for piecewise planar 3D reconstruction of a scene. While previous interactive algorithms require the user to provide tedious interactions identify all planes in scene, we build on successful ideas from automatic and introduce idea active learning, thereby improving reconstructions while considerably reducing effort. Our first attempts obtain scene automatically through energy minimization framework. The proposed then uses intuitive cues...

10.1109/cvpr.2011.5995638 article EN 2011-06-01

Real-time compression and streaming of 4D performances

OPENALEX - Publications

Danhang Tang Mingsong Dou Peter Lincoln Philip Davidson Kaiwen Guo and 7 more

We introduce a realtime compression architecture for 4D performance capture that is two orders of magnitude faster than current state-of-the-art techniques, yet achieves comparable visual quality and bitrate. note how much the algorithmic complexity in traditional arises from necessity to encode geometry using an explicit model (i.e. triangle mesh). In contrast, we propose encoder leverages implicit representation (namely Signed Distance Function) represent observed geometry, as well its...

10.1145/3272127.3275096 article EN ACM Transactions on Graphics 2018-11-28

Low Compute and Fully Parallel Computer Vision with HashMatch

OPENALEX - Publications

Sean Fanello Julien Valentin Adarsh Kowdle Christoph Rhemann Vladimir Tankovich and 3 more

Numerous computer vision problems such as stereo depth estimation, object-class segmentation and fore-ground/background can be formulated per-pixel image labeling tasks. Given one or many images input, the desired output of these methods is usually a spatially smooth assignment labels. The large amount has lead to significant research efforts, with state art moving from CRF-based approaches deep CNNs more recently, hybrids two. Although have significantly advanced art, vast majority solely...

10.1109/iccv.2017.418 article EN 2017-10-01

Coming Soon ...