NFDI4DS | UHH-SEMS - Publication Details

Adding Conditional Control to Text-to-Image Diffusion Models

OPENALEX - Publications

Lvmin Zhang Anyi Rao Maneesh Agrawala

We present ControlNet, a neural network architecture to add spatial conditioning controls large, pretrained text-to-image diffusion models. ControlNet locks the production-ready large models, and reuses their deep robust encoding layers with billions of images as strong backbone learn diverse set conditional controls. The is connected "zero convolutions" (zero-initialized convolution layers) that progressively grow parameters from zero ensure no harmful noise could affect finetuning. test...

10.1109/iccv51070.2023.00355 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2023-10-01

Digital photography with flash and no-flash image pairs

OPENALEX - Publications

Georg Petschnigg Richard Szeliski Maneesh Agrawala Michael Cohen Hugues Hoppe and 1 more

Digital photography has made it possible to quickly and easily take a pair of images low-light environments: one with flash capture detail without ambient illumination. We present variety applications that analyze combine the strengths such flash/no-flash image pairs. Our include denoising transfer (to merge qualities no-flash high-frequency detail), white-balancing change color tone image), continuous interactively adjust intensity), red-eye removal repair artifacts in image). demonstrate...

10.1145/1015706.1015777 article EN ACM Transactions on Graphics 2004-08-01

Interactive digital photomontage

OPENALEX - Publications

Aseem Agarwala Mira Dontcheva Maneesh Agrawala Steven M. Drucker Alex Colburn and 3 more

We describe an interactive, computer-assisted framework for combining parts of a set photographs into single composite picture, process we call "digital photomontage." Our makes use two techniques primarily: graph-cut optimization, to choose good seams within the constituent images so that they can be combined as seamlessly possible; and gradient-domain fusion, based on Poisson equations, further reduce any remaining visible artifacts in composite. Also central is suite interactive tools...

10.1145/1015706.1015718 article EN ACM Transactions on Graphics 2004-08-01

Saliency in VR: How Do People Explore Virtual Environments?

OPENALEX - Publications

Vincent Sitzmann Ana Serrano Amy Pavel Maneesh Agrawala Diego Gutiérrez and 2 more

Understanding how people explore immersive virtual environments is crucial for many applications, such as designing reality (VR) content, developing new compression algorithms, or learning computational models of saliency visual attention. Whereas a body recent work has focused on modeling in desktop viewing conditions, VR very different from these conditions that behavior governed by stereoscopic vision and the complex interaction head orientation, gaze, other kinematic constraints. To...

10.1109/tvcg.2018.2793599 article EN IEEE Transactions on Visualization and Computer Graphics 2018-01-25

State of the Art on Neural Rendering

OPENALEX - Publications

Ayush Tewari Ohad Fried Justus Thies Vincent Sitzmann Stephen Lombardi and 14 more

Abstract Efficient rendering of photo‐realistic virtual worlds is a long standing effort computer graphics. Modern graphics techniques have succeeded in synthesizing images from hand‐crafted scene representations. However, the automatic generation shape, materials, lighting, and other aspects scenes remains challenging problem that, if solved, would make more widely accessible. Concurrently, progress vision machine learning given rise to new approach image synthesis editing, namely deep...

10.1111/cgf.14022 article EN publisher-specific-oa Computer Graphics Forum 2020-05-01

Gaze-based interaction for semi-automatic photo cropping

OPENALEX - Publications

Anthony Santella Maneesh Agrawala Doug DeCarlo David Salesin Michael Cohen

We present an interactive method for cropping photographs given minimal information about important content location, provided by eye tracking. Cropping is formulated in a general optimization framework that facilitates adding new composition rules, and adapting the system to particular applications. Our uses fixation data</ identify image compute best crop any aspect ratio or size, enabling applications such as automatic snapshot recomposition, adaptive documents, thumbnailing. validate our...

10.1145/1124772.1124886 article EN 2006-04-22

Interactive video cutout

OPENALEX - Publications

Jue Wang Pravin Bhat Alex Colburn Maneesh Agrawala Michael F. Cohen

We present an interactive system for efficiently extracting foreground objects from a video. extend previous min-cut based image segmentation techniques to the domain of video with four new contributions. provide novel painting-based user interface that allows users easily indicate object across space and time. introduce hierarchical mean-shift preprocess in order minimize number nodes must operate on. Within we also define local cost functions augment global costs defined earlier work....

10.1145/1073204.1073233 article EN ACM Transactions on Graphics 2005-07-01

Graphical Histories for Visualization: Supporting Analysis, Communication, and Evaluation

OPENALEX - Publications

Jeffrey Heer Jock D. Mackinlay Christian Stolte Maneesh Agrawala

Interactive history tools, ranging from basic undo and redo to branching timelines of user actions, facilitate iterative forms interaction. In this paper, we investigate the design mechanisms for information visualization. We present a space analysis both architectural interface issues, identifying decisions associated trade-offs. Based on analysis, contribute study graphical tools Tableau, database visualization system. These record visualize interaction histories, support data...

10.1109/tvcg.2008.137 article EN IEEE Transactions on Visualization and Computer Graphics 2008-10-28

Sizing the horizon

OPENALEX - Publications

Jeffrey Heer Nicholas Kong Maneesh Agrawala

We investigate techniques for visualizing time series data and evaluate their effect in value comparison tasks. compare line charts with horizon graphs - a space-efficient visualization technique across range of chart sizes, measuring the speed accuracy subjects' estimates differences between charts. identify transition points at which reducing height results significantly differing drops estimation compared types, we find optimal positions speed-accuracy tradeoff curve viewers performed...

10.1145/1518701.1518897 article EN 2009-04-04

Text-based editing of talking-head video

OPENALEX - Publications

Ohad Fried Ayush Tewari Michael Zollhöfer Adam Finkelstein Eli Shechtman and 5 more

Editing talking-head video to change the speech content or remove filler words is challenging. We propose a novel method edit based on its transcript produce realistic output in which dialogue of speaker has been modified, while maintaining seamless audio-visual flow (i.e. no jump cuts). Our automatically annotates an input with phonemes, visemes, 3D face pose and geometry, reflectance, expression scene illumination per frame. To video, user only transcript, optimization strategy then...

10.1145/3306346.3323028 article EN ACM Transactions on Graphics 2019-07-12

ReVision

OPENALEX - Publications

Manolis Savva Nicholas Kong Arti Chhajta Li Fei-Fei Maneesh Agrawala and 1 more

Poorly designed charts are prevalent in reports, magazines, books and on the Web. Most of these only available as bitmap images; without access to underlying data it is prohibitively difficult for viewers create more effective visual representations. In response we present ReVision, a system that automatically redesigns visualizations improve graphical perception. Given image chart input, ReVision applies computer vision machine learning techniques identify type (e.g., pie chart, bar...

10.1145/2047196.2047247 article EN 2011-10-16

Interactive furniture layout using interior design guidelines

OPENALEX - Publications

Paul Merrell Eric Schkufza Zeyang Li Maneesh Agrawala Vladlen Koltun

We present an interactive furniture layout system that assists users by suggesting arrangements are based on interior design guidelines. Our incorporates the guidelines as terms in a density function and generates suggestions rapidly sampling using hardware-accelerated Monte Carlo sampler. results demonstrate suggestion generation functionality measurably increases quality of produced participants with no prior training design.

10.1145/1964921.1964982 article EN 2011-07-25

Interactive furniture layout using interior design guidelines

OPENALEX - Publications

Paul Merrell Eric Schkufza Zeyang Li Maneesh Agrawala Vladlen Koltun

We present an interactive furniture layout system that assists users by suggesting arrangements are based on interior design guidelines. Our incorporates the guidelines as terms in a density function and generates suggestions rapidly sampling using hardware-accelerated Monte Carlo sampler. results demonstrate suggestion generation functionality measurably increases quality of produced participants with no prior training design.

10.1145/2010324.1964982 article EN ACM Transactions on Graphics 2011-07-01

Adding Conditional Control to Text-to-Image Diffusion Models

OPENALEX - Publications

Lvmin Zhang Maneesh Agrawala

We present ControlNet, a neural network architecture to add spatial conditioning controls large, pretrained text-to-image diffusion models. ControlNet locks the production-ready large models, and reuses their deep robust encoding layers with billions of images as strong backbone learn diverse set conditional controls. The is connected "zero convolutions" (zero-initialized convolution layers) that progressively grow parameters from zero ensure no harmful noise could affect finetuning. test...

10.48550/arxiv.2302.05543 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Rendering effective route maps

OPENALEX - Publications

Maneesh Agrawala Chris Stolte

Route maps, which depict a path from one location to another, have emerged as of the most popular applications on Web. Current computer-generated route however, are often very difficult use. In this paper we present set cartographic generalization techniques specifically designed improve usability maps. Our based both cognitive psychology research studying how maps used and an analysis generalizations commonly found in handdrawn We describe algorithmic implementations these within LineDrive,...

10.1145/383259.383286 article EN 2001-08-01

Scented Widgets: Improving Navigation Cues with Embedded Visualizations

OPENALEX - Publications

Wesley Willett Jeffrey Heer Maneesh Agrawala

This paper presents scented widgets, graphical user interface controls enhanced with embedded visualizations that facilitate navigation in information spaces. We describe design guidelines for adding visual cues to common widgets such as radio buttons, sliders, and combo boxes contribute a general software framework applying within applications minimal modifications existing source code. provide number of example controlled experiment which finds users exploring unfamiliar data make up twice...

10.1109/tvcg.2007.70589 article EN IEEE Transactions on Visualization and Computer Graphics 2007-11-01

The two-user Responsive Workbench

OPENALEX - Publications

Maneesh Agrawala Andrew C. Beers Ian E. McDowall Bernd Fröhlich Mark Bolas and 1 more

Article The two-user Responsive Workbench: support for collaboration through individual views of a shared space Share on Authors: Maneesh Agrawala Stanford University, Stanford, CA CAView Profile , Andrew C. Beers Ian McDowall Fakespace, Inc., Mountain View, Bernd Fröhlich Mark Bolas Pat Hanrahan Authors Info & Claims SIGGRAPH '97: Proceedings the 24th annual conference Computer graphics and interactive techniquesAugust 1997 Pages 327–332https://doi.org/10.1145/258734.258875Online:03 August...

10.1145/258734.258875 article EN 1997-01-01

Multiscale shape and detail enhancement from multi-light image collections

OPENALEX - Publications

Raanan Fattal Maneesh Agrawala Szymon Rusinkiewicz

We present a new image-based technique for enhancing the shape and surface details of an object. The input to our system is small set photographs taken from fixed viewpoint, but under varying lighting conditions. For each image we compute multiscale decomposition based on bilateral filter then reconstruct enhanced that combines detail information at scale across all images. Our approach does not require any about light source positions, or camera calibration, can produce good results with 3...

10.1145/1275808.1276441 article EN 2007-07-29

Multiscale shape and detail enhancement from multi-light image collections

OPENALEX - Publications

Raanan Fattal Maneesh Agrawala Szymon Rusinkiewicz

We present a new image-based technique for enhancing the shape and surface details of an object. The input to our system is small set photographs taken from fixed viewpoint, but under varying lighting conditions. For each image we compute multiscale decomposition based on bilateral filter then reconstruct enhanced that combines detail information at scale across all images. Our approach does not require any about light source positions, or camera calibration, can produce good results with 3...

10.1145/1276377.1276441 article EN ACM Transactions on Graphics 2007-07-29

Design Considerations for Collaborative Visual Analytics

OPENALEX - Publications

Jeffrey Heer Maneesh Agrawala

Information visualization leverages the human visual system to support process of sensemaking, in which information is collected, organized, and analyzed generate knowledge inform action. Though most research date assumes a single-user focus on perceptual cognitive processes, practice, sensemaking often social involving parallelization effort, discussion, consensus building. This suggests that fully interactive should also interaction. However, appropriate collaboration mechanisms for...

10.1109/vast.2007.4389011 article EN 2007-10-01

Interactive digital photomontage

OPENALEX - Publications

Aseem Agarwala Mira Dontcheva Maneesh Agrawala Steven M. Drucker Alex Colburn and 3 more

We describe an interactive, computer-assisted framework for combining parts of a set photographs into single composite picture, process we call "digital photomontage." Our makes use two techniques primarily: graph-cut optimization, to choose good seams within the constituent images so that they can be combined as seamlessly possible; and gradient-domain fusion, based on Poisson equations, further reduce any remaining visible artifacts in composite. Also central is suite interactive tools...

10.1145/1186562.1015718 article EN 2004-08-01

Digital photography with flash and no-flash image pairs

OPENALEX - Publications

Georg Petschnigg Richard Szeliski Maneesh Agrawala Michael Cohen Hugues Hoppe and 1 more

Digital photography has made it possible to quickly and easily take a pair of images low-light environments: one with flash capture detail without ambient illumination. We present variety applications that analyze combine the strengths such flash/no-flash image pairs. Our include denoising transfer (to merge qualities no-flash high-frequency detail), white-balancing change color tone image), continuous interactively adjust intensity), red-eye removal repair artifacts in image). demonstrate...

10.1145/1186562.1015777 article EN 2004-08-01

Interactive 3D architectural modeling from unordered photo collections

OPENALEX - Publications

Sudipta N. Sinha Drew Steedly Richard Szeliski Maneesh Agrawala Marc Pollefeys

We present an interactive system for generating photorealistic, textured, piecewise-planar 3D models of architectural structures and urban scenes from unordered sets photographs. To reconstruct geometry in our system, the user draws outlines overlaid on 2D The structure is then automatically computed by combining interaction with multi-view geometric information recovered performing motion analysis input utilize vanishing point constraints at multiple stages during reconstruction, which...

10.1145/1457515.1409112 article EN 2008-12-01

Designing effective step-by-step assembly instructions

OPENALEX - Publications

Maneesh Agrawala Doantam Phan Julie Heiser John Haymaker Jeff Klingner and 2 more

We present design principles for creating effective assembly instructions and a system that is based on these principles. The are drawn from cognitive psychology research which investigated people's conceptual models of methods to visually communicate information. Our inspired by earlier work in robotics planning visualization automated presentation design. Although other systems have considered independently, we believe it necessary address the two problems simultaneously order create...

10.1145/1201775.882352 article EN 2003-07-01

Software Design Patterns for Information Visualization

OPENALEX - Publications

Jeffrey Heer Maneesh Agrawala

Despite a diversity of software architectures supporting information visualization, it is often difficult to identify, evaluate, and re-apply the design solutions implemented within such frameworks. One popular effective approach for addressing difficulties capture successful in patterns, abstract descriptions interacting components that can be customized solve problems particular context. Based upon review existing frameworks our own experiences building visualization software, we present...

10.1109/tvcg.2006.178 article EN IEEE Transactions on Visualization and Computer Graphics 2006-09-01