Guandao Yang

ORCID: 0000-0002-2992-5803
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Computer Graphics and Visualization Techniques
  • Advanced Vision and Imaging
  • 3D Shape Modeling and Analysis
  • Generative Adversarial Networks and Image Synthesis
  • 3D Surveying and Cultural Heritage
  • Optical measurement and interference techniques
  • Neural Networks and Applications
  • Advanced Numerical Analysis Techniques
  • Multimodal Machine Learning Applications
  • Advanced Image and Video Retrieval Techniques
  • Model Reduction and Neural Networks
  • Advanced Neural Network Applications
  • Robotics and Sensor-Based Localization
  • Topic Modeling
  • Image Processing and 3D Reconstruction
  • Stochastic Gradient Optimization Techniques
  • Machine Learning and Algorithms
  • Data Visualization and Analytics
  • Domain Adaptation and Few-Shot Learning
  • Numerical Methods and Algorithms
  • Human Motion and Animation
  • Natural Language Processing Techniques
  • Multi-Criteria Decision Making
  • Parallel Computing and Optimization Techniques
  • Recommender Systems and Techniques

Stanford University
2023-2024

Cornell University
2018-2023

United States University
2023

China University of Petroleum, East China
2017

Shanghai University of Finance and Economics
2010

As 3D point clouds become the representation of choice for multiple vision and graphics applications, ability to synthesize or reconstruct high-resolution, high-fidelity becomes crucial. Despite recent success deep learning models in discriminative tasks clouds, generating remains challenging. This paper proposes a principled probabilistic framework generate by modeling them as distribution distributions. Specifically, we learn two-level hierarchy distributions where first level is shapes...

10.1109/iccv.2019.00464 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Evaluation metrics for image captioning face two challenges. Firstly, commonly used such as CIDEr, METEOR, ROUGE and BLEU often do not correlate well with human judgments. Secondly, each metric has known blind spots to pathological caption constructions, rule-based lack provisions repair once identified. For example, the newly proposed SPICE correlates judgments, but fails capture syntactic structure of a sentence. To address these challenges, we propose novel learning based discriminative...

10.1109/cvpr.2018.00608 preprint EN 2018-06-01

No abstract available.

10.1145/3623053.3623365 article FR Siggraph Asia Doctoral Consortium 2023-11-23

10.1109/cvpr52733.2024.02098 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

In this paper, we present a self-calibrating framework that jointly optimizes camera parameters, lens distortion and 3D Gaussian representations, enabling accurate efficient scene reconstruction. particular, our technique enables high-quality reconstruction from Large field-of-view (FOV) imagery taken with wide-angle lenses, allowing the to be modeled smaller number of images. Our approach introduces novel method for modeling complex distortions using hybrid network combines invertible...

10.48550/arxiv.2502.09563 preprint EN arXiv (Cornell University) 2025-02-13

Low-precision training reduces computational cost and produces efficient models. Recent research in developing new low-precision algorithms often relies on simulation to empirically evaluate the statistical effects of quantization while avoiding substantial overhead building specific hardware. To support this empirical research, we introduce QPyTorch, a arithmetic framework. Built natively PyTorch, QPyTorch provides convenient interface that minimizes efforts needed reliably convert existing...

10.1109/emc2-nips53020.2019.00010 article EN 2019-12-01

Low precision operations can provide scalability, memory savings, portability, and energy efficiency. This paper proposes SWALP, an approach to low training that averages low-precision SGD iterates with a modified learning rate schedule. SWALP is easy implement match the performance of full-precision even all numbers quantized down 8 bits, including gradient accumulators. Additionally, we show converges arbitrarily close optimal solution for quadratic objectives, noise ball asymptotically...

10.48550/arxiv.1904.11943 preprint EN other-oa arXiv (Cornell University) 2019-01-01

As 3D point clouds become the representation of choice for multiple vision and graphics applications, ability to synthesize or reconstruct high-resolution, high-fidelity becomes crucial. Despite recent success deep learning models in discriminative tasks clouds, generating remains challenging. This paper proposes a principled probabilistic framework generate by modeling them as distribution distributions. Specifically, we learn two-level hierarchy distributions where first level is shapes...

10.48550/arxiv.1906.12320 preprint EN other-oa arXiv (Cornell University) 2019-01-01

10.1109/cvpr52733.2024.00761 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2024-06-16

We introduce a general framework for solving partial differential equations (PDEs) using generative diffusion models. In particular, we focus on the scenarios where do not have full knowledge of scene necessary to apply classical solvers. Most existing forward or inverse PDE approaches perform poorly when observations data underlying coefficients are incomplete, which is common assumption real-world measurements. this work, propose DiffusionPDE that can simultaneously fill in missing...

10.48550/arxiv.2406.17763 preprint EN arXiv (Cornell University) 2024-06-25

Neural fields have emerged as a new paradigm for representing signals, thanks to their ability do it compactly while being easy optimize. In most applications, however, neural are treated like black boxes, which precludes many signal manipulation tasks. this paper, we propose class of called polynomial (PNFs). The key advantage PNF is that can represent composition number manipulable and interpretable components without losing the merits representation. We develop general theoretical...

10.48550/arxiv.2302.04862 preprint EN cc-by arXiv (Cornell University) 2023-01-01

State-of-the-art deep reading comprehension models are dominated by recurrent neural nets. Their sequential nature is a natural fit for language, but it also precludes parallelization within an instances and often becomes the bottleneck deploying such to latency critical scenarios. This particularly problematic longer texts. Here we present convolutional architecture as alternative these architectures. Using simple dilated units in place of ones, achieve results comparable state art on two...

10.48550/arxiv.1711.04352 preprint EN other-oa arXiv (Cornell University) 2017-01-01

3D Gaussians, as a low-level scene representation, typically involve thousands to millions of Gaussians. This makes it difficult control the in ways that reflect underlying dynamic structure, where number independent entities is much smaller. In particular, can be challenging animate and move objects scene, which requires coordination among many To address this issue, we develop mutual information shaping technique enforces movement resonance between correlated Gaussians motion network. Such...

10.48550/arxiv.2406.05897 preprint EN arXiv (Cornell University) 2024-06-09

Evaluation metrics for image captioning face two challenges. Firstly, commonly used such as CIDEr, METEOR, ROUGE and BLEU often do not correlate well with human judgments. Secondly, each metric has known blind spots to pathological caption constructions, rule-based lack provisions repair once identified. For example, the newly proposed SPICE correlates judgments, but fails capture syntactic structure of a sentence. To address these challenges, we propose novel learning based discriminative...

10.48550/arxiv.1806.06422 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Low-precision training reduces computational cost and produces efficient models. Recent research in developing new low-precision algorithms often relies on simulation to empirically evaluate the statistical effects of quantization while avoiding substantial overhead building specific hardware. To support this empirical research, we introduce QPyTorch, a arithmetic framework. Built natively PyTorch, QPyTorch provides convenient interface that minimizes efforts needed reliably convert existing...

10.48550/arxiv.1910.04540 preprint EN other-oa arXiv (Cornell University) 2019-01-01

In this work, we propose a novel technique to generate shapes from point cloud data. A can be viewed as samples distribution of 3D points whose density is concentrated near the surface shape. Point generation thus amounts moving randomly sampled high-density areas. We clouds by performing stochastic gradient ascent on an unnormalized probability density, thereby toward high-likelihood regions. Our model directly predicts log field and trained with simple objective adapted score-based...

10.48550/arxiv.2008.06520 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Neural radiance fields (NeRF) rely on volume rendering to synthesize novel views. Volume requires evaluating an integral along each ray, which is numerically approximated with a finite sum that corresponds the exact ray under piecewise constant density. As consequence, rendered result unstable w.r.t. choice of samples phenomenon we dub quadrature instability. We propose mathematically principled solution by reformulating sample-based equation so it linear This simultaneously resolves...

10.48550/arxiv.2310.20685 preprint EN cc-by arXiv (Cornell University) 2023-01-01

This paper presents a method that uses neural networks as caching mechanism to reduce the variance of Monte Carlo Partial Differential Equation solvers, such Walk-on-Spheres algorithm [Sawhney and Crane 2020]. While these PDE solvers have merits being unbiased discretization-free, their high often hinders real-time applications. On other hand, can approximate solution, evaluating at inference time be very fast. However, neural-network-based solutions may suffer from convergence difficulties...

10.1145/3610548.3618141 article EN 2023-12-10
Coming Soon ...