NFDI4DS | UHH-SEMS - Publication Details

Carlos Esteves

ORCID: 0000-0001-9413-1201

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5030999390

Research Areas

3D Shape Modeling and Analysis
Robotics and Sensor-Based Localization
Advanced Vision and Imaging
Computer Graphics and Visualization Techniques
Advanced Neural Network Applications
Medical Image Segmentation Techniques
Image Processing and 3D Reconstruction
Human Pose and Action Recognition
Advanced Image and Video Retrieval Techniques
Image and Object Detection Techniques
Medical Imaging and Analysis
Geophysics and Gravity Measurements
3D Surveying and Cultural Heritage
Computational Physics and Python Applications
Optical measurement and interference techniques
Generative Adversarial Networks and Image Synthesis
Multimodal Machine Learning Applications
Neural Networks and Applications
Remote Sensing and LiDAR Applications
Pulsars and Gravitational Waves Research
Image Retrieval and Classification Techniques
Domain Adaptation and Few-Shot Learning
Machine Learning and Data Classification
Advanced Data Compression Techniques
Industrial Vision Systems and Defect Detection

Google (United States)
2021-2024

University of Pennsylvania
2017-2023

Hospital Israelita Albert Einstein
2022

Google (Canada)
2022

Philadelphia University
2019

California University of Pennsylvania
2017

Light Field Neural Rendering

OPENALEX - Publications

Mohammed Suhail Carlos Esteves Leonid Sigal Ameesh Makadia

Classical light field rendering for novel view synthesis can accurately reproduce view-dependent effects such as reflection, refraction, and translucency, but requires a dense sampling of the scene. Methods based on geometric reconstruction need only sparse views, cannot model non-Lambertian effects. We introduce that combines strengths mitigates limitations these two directions. By operating four-dimensional representation field, our learns to represent accurately. enforcing constraints...

10.1109/cvpr52688.2022.00809 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022-06-01

Learning SO(3) Equivariant Representations with Spherical CNNs

OPENALEX - Publications

Carlos Esteves Christine Allen-Blanchette Ameesh Makadia Kostas Daniilidis

10.1007/s11263-019-01220-1 article EN International Journal of Computer Vision 2019-09-06

Equivariant Multi-View Networks

OPENALEX - Publications

Carlos Esteves Yinshuang Xu Christine Allec-Blanchette Kostas Daniilidis

Several popular approaches to 3D vision tasks process multiple views of the input independently with deep neural networks pre-trained on natural images, where view permutation invariance is achieved through a single round pooling over all views. We argue that this operation discards important information and leads subpar global descriptors. In paper, we propose group convolutional approach aggregation convolutions are performed discrete subgroup rotation group, enabling, thus, joint...

10.1109/iccv.2019.00165 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Polar Transformer Networks

OPENALEX - Publications

Carlos Esteves Christine Allen-Blanchette Xiaowei Zhou Kostas Daniilidis

Convolutional neural networks (CNNs) are inherently equivariant to translation. Efforts embed other forms of equivariance have concentrated solely on rotation. We expand the notion in CNNs through Polar Transformer Network (PTN). PTN combines ideas from Spatial (STN) and canonical coordinate representations. The result is a network invariant translation both rotation scale. trained end-to-end composed three distinct stages: polar origin predictor, newly introduced transformer module...

10.48550/arxiv.1709.01889 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Fast Multi-image Matching via Density-Based Clustering

OPENALEX - Publications

Roberto Tron Xiaowei Zhou Carlos Esteves Kostas Daniilidis

We consider the problem of finding consistent matches across multiple images. Current state-of-the-art solutions use constraints on cycles together with convex optimization, leading to computationally intensive iterative algorithms. In this paper, we instead propose a clustering-based formulation: first rigorously show its equivalence traditional approaches, and then QuickMatch, novel algorithm that identifies multi-image from density function in feature space. Specifically, QuickMatch uses...

10.1109/iccv.2017.437 article EN 2017-10-01

An Analysis of SVD for Deep Rotation Estimation

OPENALEX - Publications

Jake Levinson Carlos Esteves Kefan Chen Noah Snavely Angjoo Kanazawa and 2 more

Symmetric orthogonalization via SVD, and closely related procedures, are well-known techniques for projecting matrices onto $O(n)$ or $SO(n)$. These tools have long been used applications in computer vision, example optimal 3D alignment problems solved by orthogonal Procrustes, rotation averaging, Essential matrix decomposition. Despite its utility different settings, SVD as a procedure producing is typically overlooked deep learning models, where the preferences tend toward classic...

10.48550/arxiv.2006.14616 preprint EN other-oa arXiv (Cornell University) 2020-01-01

LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs

OPENALEX - Publications

Zezhou Cheng Carlos Esteves Varun Jampani Abhishek Kar Subhransu Maji and 1 more

A critical obstacle preventing NeRF models from being deployed broadly in the wild is their reliance on accurate camera poses. Consequently, there growing interest extending to jointly optimize poses and scene representation, which offers an alternative off-the-shelf SfM pipelines have well-understood failure modes. Existing approaches for unposed operate under limiting assumptions, such as a prior pose distribution or coarse initialization, making them less effective general setting. In...

10.1109/iccv51070.2023.01679 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Theoretical Aspects of Group Equivariant Neural Networks

OPENALEX - Publications

Carlos Esteves

Group equivariant neural networks have been explored in the past few years and are interesting from theoretical practical standpoints. They leverage concepts group representation theory, non-commutative harmonic analysis differential geometry that do not often appear machine learning. In practice, they shown to reduce sample model complexity, notably challenging tasks where input transformations such as arbitrary rotations present. We begin this work with an exposition of theory machinery...

10.48550/arxiv.2004.05154 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Implicit-PDF: Non-Parametric Representation of Probability Distributions on the Rotation Manifold

OPENALEX - Publications

Kieran J. Murphy Carlos Esteves Varun Jampani Srikumar Ramalingam Ameesh Makadia

Single image pose estimation is a fundamental problem in many vision and robotics tasks, existing deep learning approaches suffer by not completely modeling handling: i) uncertainty about the predictions, ii) symmetric objects with multiple (sometimes infinite) correct poses. To this end, we introduce method to estimate arbitrary, non-parametric distributions on SO(3). Our key idea represent implicitly, neural network that estimates probability given input candidate pose. Grid sampling or...

10.48550/arxiv.2106.05965 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Influenza vaccination strategy in acute coronary syndromes: the VIP-ACS trial

OPENALEX - Publications

Henrique Fonseca Remo H.M. Furtado André Zimerman Pedro A. Lemos Marcelo Franken and 29 more

To evaluate whether a strategy of double-dose influenza vaccination during hospitalization for an acute coronary syndrome (ACS) compared with standard-dose outpatient (as recommended by current guidelines) would further reduce the risk major cardiopulmonary events.Vaccination against Influenza to Prevent cardiovascular events after Acute Coronary Syndromes (VIP-ACS) was pragmatic, randomized, multicentre, active-comparator, open-label trial blinded outcome adjudication comparing two...

10.1093/eurheartj/ehac472 article EN European Heart Journal 2022-08-24

ASIC: Aligning Sparse in-the-wild Image Collections

OPENALEX - Publications

Kamal Gupta Varun Jampani Carlos Esteves Abhinav Shrivastava Ameesh Makadia and 2 more

We present a method for joint alignment of sparse in-the-wild image collections an object category. Most prior works assume either ground-truth keypoint annotations or large dataset images single However, neither the above assumptions hold true long-tail objects in world. self-supervised technique that directly optimizes on collection particular object/object category to obtain consistent dense correspondences across collection. use pairwise nearest neighbors obtained from deep features...

10.1109/iccv51070.2023.00382 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Spin-Weighted Spherical CNNs

OPENALEX - Publications

Carlos Esteves Ameesh Makadia Kostas Daniilidis

Learning equivariant representations is a promising way to reduce sample and model complexity improve the generalization performance of deep neural networks. The spherical CNNs are successful examples, producing SO(3)-equivariant inputs. There two main types CNNs. first type lifts inputs functions on rotation group SO(3) applies convolutions group, which computationally expensive since has one extra dimension. second directly sphere, limited zonal (isotropic) filters, thus have expressivity....

10.48550/arxiv.2006.10731 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Scaling Spherical CNNs

OPENALEX - Publications

Carlos Esteves Jean-Jacques Slotine Ameesh Makadia

Spherical CNNs generalize to functions on the sphere, by using spherical convolutions as main linear operation. The most accurate and efficient way compute is in spectral domain (via convolution theorem), which still costlier than usual planar convolutions. For this reason, applications of have so far been limited small problems that can be approached with low model capacity. In work, we show how scaled for much larger problems. To achieve this, make critical improvements including novel...

10.48550/arxiv.2306.05420 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Cross-Domain 3D Equivariant Image Embeddings

OPENALEX - Publications

Carlos Esteves Avneesh Sud Zhengyi Luo Kostas Daniilidis Ameesh Makadia

Spherical convolutional networks have been introduced recently as tools to learn powerful feature representations of 3D shapes. CNNs are equivariant rotations making them ideally suited applications where data may be observed in arbitrary orientations. In this paper we 2D image embeddings with a similar structure: embedding the object should commute object. We introduce cross-domain from images into spherical CNN latent space. This encodes shape properties and is The model supervised only by...

10.48550/arxiv.1812.02716 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Labeling Panoramas with Spherical Hourglass Networks

OPENALEX - Publications

Carlos Esteves Kostas Daniilidis Ameesh Makadia

With the recent proliferation of consumer-grade 360° cameras, it is worth revisiting visual perception challenges with spherical cameras given potential benefit their global field view. To this end we introduce a convolutional hourglass network (SCHN) for dense labeling on sphere. The SCHN invariant to camera orientation (lifting usual requirement `upright' panoramic images), and its design scalable larger practical datasets. Initial experiments show promising results semantic segmentation task.

10.48550/arxiv.1809.02123 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Coming Soon ...