NFDI4DS | UHH-SEMS - Publication Details

PointFlow: 3D Point Cloud Generation With Continuous Normalizing Flows

OPENALEX - Publications

Guandao Yang Xun Huang Zekun Hao Mingyu Liu Serge Belongie and 1 more

As 3D point clouds become the representation of choice for multiple vision and graphics applications, ability to synthesize or reconstruct high-resolution, high-fidelity becomes crucial. Despite recent success deep learning models in discriminative tasks clouds, generating remains challenging. This paper proposes a principled probabilistic framework generate by modeling them as distribution distributions. Specifically, we learn two-level hierarchy distributions where first level is shapes...

10.1109/iccv.2019.00464 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Learning to Evaluate Image Captioning

OPENALEX - Publications

Yin Cui Guandao Yang Andreas Veit Xun Huang Serge Belongie

Evaluation metrics for image captioning face two challenges. Firstly, commonly used such as CIDEr, METEOR, ROUGE and BLEU often do not correlate well with human judgments. Secondly, each metric has known blind spots to pathological caption constructions, rule-based lack provisions repair once identified. For example, the newly proposed SPICE correlates judgments, but fails capture syntactic structure of a sentence. To address these challenges, we propose novel learning based discriminative...

10.1109/cvpr.2018.00608 preprint EN 2018-06-01

Geometry Processing with Neural Fields

OPENALEX - Publications

Guandao Yang

No abstract available.

10.1145/3623053.3623365 article FR Siggraph Asia Doctoral Consortium 2023-11-23

GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation

OPENALEX - Publications

Tong Wu Guandao Yang Zhibing Li Kai Zhang Ziwei Liu and 3 more

10.1109/cvpr52733.2024.02098 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

Self-Calibrating Gaussian Splatting for Large Field of View Reconstruction

OPENALEX - Publications

Youming Deng Wenqi Xian Guandao Yang Leonidas Guibas Gordon Wetzstein and 2 more

In this paper, we present a self-calibrating framework that jointly optimizes camera parameters, lens distortion and 3D Gaussian representations, enabling accurate efficient scene reconstruction. particular, our technique enables high-quality reconstruction from Large field-of-view (FOV) imagery taken with wide-angle lenses, allowing the to be modeled smaller number of images. Our approach introduces novel method for modeling complex distortions using hybrid network combines invertible...

10.48550/arxiv.2502.09563 preprint EN arXiv (Cornell University) 2025-02-13

QPyTorch: A Low-Precision Arithmetic Simulation Framework

OPENALEX - Publications

Tianyi Zhang Zhiqiu Lin Guandao Yang Christopher De

Low-precision training reduces computational cost and produces efficient models. Recent research in developing new low-precision algorithms often relies on simulation to empirically evaluate the statistical effects of quantization while avoiding substantial overhead building specific hardware. To support this empirical research, we introduce QPyTorch, a arithmetic framework. Built natively PyTorch, QPyTorch provides convenient interface that minimizes efforts needed reliably convert existing...

10.1109/emc2-nips53020.2019.00010 article EN 2019-12-01

SWALP : Stochastic Weight Averaging in Low-Precision Training

OPENALEX - Publications

Guandao Yang Tianyi Zhang Polina Kirichenko Junwen Bai Andrew Gordon Wilson and 1 more

Low precision operations can provide scalability, memory savings, portability, and energy efficiency. This paper proposes SWALP, an approach to low training that averages low-precision SGD iterates with a modified learning rate schedule. SWALP is easy implement match the performance of full-precision even all numbers quantized down 8 bits, including gradient accumulators. Additionally, we show converges arbitrarily close optimal solution for quadratic objectives, noise ball asymptotically...

10.48550/arxiv.1904.11943 preprint EN other-oa arXiv (Cornell University) 2019-01-01

PointFlow: 3D Point Cloud Generation with Continuous Normalizing Flows

OPENALEX - Publications

Guandao Yang Xun Huang Zekun Hao Mingyu Liu Serge Belongie and 1 more

As 3D point clouds become the representation of choice for multiple vision and graphics applications, ability to synthesize or reconstruct high-resolution, high-fidelity becomes crucial. Despite recent success deep learning models in discriminative tasks clouds, generating remains challenging. This paper proposes a principled probabilistic framework generate by modeling them as distribution distributions. Specifically, we learn two-level hierarchy distributions where first level is shapes...

10.48550/arxiv.1906.12320 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Orthogonal Adaptation for Modular Customization of Diffusion Models

OPENALEX - Publications

Ryan Po Guandao Yang Kfir Aberman Gordon Wetzstein

10.1109/cvpr52733.2024.00761 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

DiffusionPDE: Generative PDE-Solving Under Partial Observation

OPENALEX - Publications

Jiahe Huang Guandao Yang Zichen Wang Jeong Joon Park

We introduce a general framework for solving partial differential equations (PDEs) using generative diffusion models. In particular, we focus on the scenarios where do not have full knowledge of scene necessary to apply classical solvers. Most existing forward or inverse PDE approaches perform poorly when observations data underlying coefficients are incomplete, which is common assumption real-world measurements. this work, propose DiffusionPDE that can simultaneously fill in missing...

10.48550/arxiv.2406.17763 preprint EN arXiv (Cornell University) 2024-06-25

Polynomial Neural Fields for Subband Decomposition and Manipulation

OPENALEX - Publications

Guandao Yang Sagie Benaim Varun Jampani Kyle Genova Jonathan T. Barron and 3 more

Neural fields have emerged as a new paradigm for representing signals, thanks to their ability do it compactly while being easy optimize. In most applications, however, neural are treated like black boxes, which precludes many signal manipulation tasks. this paper, we propose class of called polynomial (PNFs). The key advantage PNF is that can represent composition number manipulable and interpretable components without losing the merits representation. We develop general theoretical...

10.48550/arxiv.2302.04862 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Fast Reading Comprehension with ConvNets

OPENALEX - Publications

Felix Wu Ni Lao John Blitzer Guandao Yang Kilian Q. Weinberger

State-of-the-art deep reading comprehension models are dominated by recurrent neural nets. Their sequential nature is a natural fit for language, but it also precludes parallelization within an instances and often becomes the bottleneck deploying such to latency critical scenarios. This particularly problematic longer texts. Here we present convolutional architecture as alternative these architectures. Using simple dilated units in place of ones, achieve results comparable state art on two...

10.48550/arxiv.1711.04352 preprint EN other-oa arXiv (Cornell University) 2017-01-01

InfoGaussian: Structure-Aware Dynamic Gaussians through Lightweight Information Shaping

OPENALEX - Publications

Yunchao Zhang Guandao Yang Leonidas Guibas Yanchao Yang

3D Gaussians, as a low-level scene representation, typically involve thousands to millions of Gaussians. This makes it difficult control the in ways that reflect underlying dynamic structure, where number independent entities is much smaller. In particular, can be challenging animate and move objects scene, which requires coordination among many To address this issue, we develop mutual information shaping technique enforces movement resonance between correlated Gaussians motion network. Such...

10.48550/arxiv.2406.05897 preprint EN arXiv (Cornell University) 2024-06-09

Learning to Evaluate Image Captioning

OPENALEX - Publications

Yin Cui Guandao Yang Andreas Veit Xun Huang Serge Belongie

Evaluation metrics for image captioning face two challenges. Firstly, commonly used such as CIDEr, METEOR, ROUGE and BLEU often do not correlate well with human judgments. Secondly, each metric has known blind spots to pathological caption constructions, rule-based lack provisions repair once identified. For example, the newly proposed SPICE correlates judgments, but fails capture syntactic structure of a sentence. To address these challenges, we propose novel learning based discriminative...

10.48550/arxiv.1806.06422 preprint EN other-oa arXiv (Cornell University) 2018-01-01

QPyTorch: A Low-Precision Arithmetic Simulation Framework

OPENALEX - Publications

Tianyi Zhang Zhiqiu Lin Guandao Yang Christopher De

Low-precision training reduces computational cost and produces efficient models. Recent research in developing new low-precision algorithms often relies on simulation to empirically evaluate the statistical effects of quantization while avoiding substantial overhead building specific hardware. To support this empirical research, we introduce QPyTorch, a arithmetic framework. Built natively PyTorch, QPyTorch provides convenient interface that minimizes efforts needed reliably convert existing...

10.48550/arxiv.1910.04540 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Learning Gradient Fields for Shape Generation

OPENALEX - Publications

Ruojin Cai Guandao Yang Hadar Averbuch‐Elor Zekun Hao Serge Belongie and 2 more

In this work, we propose a novel technique to generate shapes from point cloud data. A can be viewed as samples distribution of 3D points whose density is concentrated near the surface shape. Point generation thus amounts moving randomly sampled high-density areas. We clouds by performing stochastic gradient ascent on an unnormalized probability density, thereby toward high-likelihood regions. Our model directly predicts log field and trained with simple objective adapted score-based...

10.48550/arxiv.2008.06520 preprint EN other-oa arXiv (Cornell University) 2020-01-01

NeRF Revisited: Fixing Quadrature Instability in Volume Rendering

OPENALEX - Publications

Mikaela Angelina Uy Kiyohiro Nakayama Guandao Yang Rahul Krishna Thomas Leonidas Guibas and 1 more

Neural radiance fields (NeRF) rely on volume rendering to synthesize novel views. Volume requires evaluating an integral along each ray, which is numerically approximated with a finite sum that corresponds the exact ray under piecewise constant density. As consequence, rendered result unstable w.r.t. choice of samples phenomenon we dub quadrature instability. We propose mathematically principled solution by reformulating sample-based equation so it linear This simultaneously resolves...

10.48550/arxiv.2310.20685 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Neural Caches for Monte Carlo Partial Differential Equation Solvers

OPENALEX - Publications

Zilu Li Guandao Yang Xi Deng Christopher De Bharath Hariharan and 1 more

This paper presents a method that uses neural networks as caching mechanism to reduce the variance of Monte Carlo Partial Differential Equation solvers, such Walk-on-Spheres algorithm [Sawhney and Crane 2020]. While these PDE solvers have merits being unbiased discretization-free, their high often hinders real-time applications. On other hand, can approximate solution, evaluating at inference time be very fast. However, neural-network-based solutions may suffer from convergence difficulties...

10.1145/3610548.3618141 article EN 2023-12-10