NFDI4DS | UHH-SEMS - Publication Details

Arturo Deza

ORCID: 0000-0003-0199-9023

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5082359284

Research Areas

Visual Attention and Saliency Detection
Visual perception and processing mechanisms
Face Recognition and Perception
Aesthetic Perception and Analysis
Advanced Vision and Imaging
Adversarial Robustness in Machine Learning
Neural Networks and Applications
Neural dynamics and brain function
Bacillus and Francisella bacterial research
Cell Image Analysis Techniques
Gaze Tracking and Assistive Technology
Advanced Optical Sensing Technologies
Computer Graphics and Visualization Techniques
Generative Adversarial Networks and Image Synthesis
Image Enhancement Techniques
Tactile and Sensory Interactions
Image Retrieval and Classification Techniques
Integrated Circuits and Semiconductor Failure Analysis
Anomaly Detection Techniques and Applications
Domain Adaptation and Few-Shot Learning
Medical Image Segmentation Techniques
Advanced Memory and Neural Computing
Visual and Cognitive Learning Processes
Multisensory perception and integration
Glaucoma and retinal disorders

Harvard University
2021-2022

Massachusetts Institute of Technology
2020-2022

Harvard University Press
2019-2022

William James College
2021-2022

University of California, Santa Barbara
2015-2019

Understanding image virality

OPENALEX - Publications

Arturo Deza Devi Parikh

Virality of online content on social networking websites is an important but esoteric phenomenon often studied in fields like marketing, psychology and data mining. In this paper we study viral images from a computer vision perspective. We introduce three new image datasets Reddit <sup xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">1</sup> define virality score using metadata. train classifiers with state-of-the-art features to predict individual...

10.1109/cvpr.2015.7298791 preprint EN 2015-06-01

How big should this object be? Perceptual influences on viewing-size preferences

OPENALEX - Publications

Yi-Chia Chen Arturo Deza Talia Konkle

When viewing objects depicted in a frame, observers prefer to view large like cars larger sizes and smaller cups sizes. That is, the visual size of an object that "looks best" is linked its typical physical world. Why this case? One intuitive possibility these preferences are driven by semantic knowledge: For example, when we recognize sofa, access our knowledge about real-world size, influences what sofa within frame. However, might processing play role phenomenon-that do features related...

10.1016/j.cognition.2022.105114 article EN cc-by Cognition 2022-04-02

Emergent Properties of Foveated Perceptual Systems

OPENALEX - Publications

Arturo Deza Talia Konkle

The goal of this work is to characterize the representational impact that foveation operations have for machine vision systems, inspired by foveated human visual system, which has higher acuity at center gaze and texture-like encoding in periphery. To do so, we introduce models consisting a first-stage \textit{fixed} image transform followed second-stage \textit{learnable} convolutional neural network, varied first stage component. primary model foveated-textural input stage, compare with...

10.48550/arxiv.2006.07991 preprint EN other-oa arXiv (Cornell University) 2020-01-01

General object-based features account for letter perception

OPENALEX - Publications

Daniel Janini Chris Hamblin Arturo Deza Talia Konkle

After years of experience, humans become experts at perceiving letters. Is this visual capacity attained by learning specialized letter features, or reusing general features previously learned in service object categorization? To explore question, we first measured the perceptual similarity letters two behavioral tasks, search and categorization. Then, trained deep convolutional neural networks on either 26-way categorization 1000-way categorization, as a way to operationalize possible...

10.1371/journal.pcbi.1010522 article EN cc-by PLoS Computational Biology 2022-09-26

Towards Metamerism via Foveated Style Transfer

OPENALEX - Publications

Arturo Deza Aditya Jonnalagadda Miguel P. Eckstein

The problem of $\textit{visual metamerism}$ is defined as finding a family perceptually indistinguishable, yet physically different images. In this paper, we propose our NeuroFovea metamer model, foveated generative model that based on mixture peripheral representations and style transfer forward-pass algorithms. Our gradient-descent free parametrized by VGG19 encoder-decoder which allows us to encode images in high dimensional space interpolate between the content texture information with...

10.48550/arxiv.1705.10041 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Attention Allocation Aid for Visual Search

OPENALEX - Publications

Arturo Deza Jeffrey R. Peters Grant S. Taylor Amit Surana Miguel P. Eckstein

This paper outlines the development and testing of a novel, feedback-enabled attention allocation aid (AAAD), which uses real-time physiological data to improve human performance in realistic sequential visual search task. Indeed, by optimizing over duration, improves efficiency, while preserving decision accuracy, as operator identifies classifies targets within simulated aerial imagery. Specifically, using experimental eye-tracking measurements about target detectability across field, we...

10.1145/3025453.3025834 preprint EN 2017-05-02

Accelerated Texforms: Alternative Methods for Generating Unrecognizable Object Images with Preserved Mid-Level Features

OPENALEX - Publications

Arturo Deza Yi-Chia Chen Bria Long Talia Konkle

10.32470/ccn.2019.1412-0 article EN cc-by 2022 Conference on Cognitive Computational Neuroscience 2019-01-01

Can Peripheral Representations Improve Clutter Metrics on Complex Scenes?

OPENALEX - Publications

Arturo Deza Miguel P. Eckstein

Previous studies have proposed image-based clutter measures that correlate with human search times and/or eye movements. However, most models do not take into account the fact effects of interact foveated nature visual system: further from fovea has an increasing detrimental influence on perception. Here, we introduce a new model to predict in target utilizing forced fixation task. We use Feature Congestion (Rosenholtz et al.) as our non model, and stack peripheral architecture top for...

10.48550/arxiv.1608.04042 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Finding Biological Plausibility for Adversarially Robust Features via Metameric Tasks

OPENALEX - Publications

Anne Harrington Arturo Deza

Recent work suggests that representations learned by adversarially robust networks are more human perceptually-aligned than non-robust via image manipulations. Despite appearing closer to visual perception, it is unclear if the constraints in DNN match biological found vision. Human vision seems rely on texture-based/summary statistic periphery, which have been shown explain phenomena such as crowding and performance search tasks. To understand how optimizations/representations compare...

10.48550/arxiv.2202.00838 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Assessment of Faster R-CNN in Man-Machine Collaborative Search

OPENALEX - Publications

Arturo Deza Amit Surana Miguel P. Eckstein

With the advent of modern expert systems driven by deep learning that supplement human experts (e.g. radiologists, dermatologists, surveillance scanners), we analyze how and when do such enhance performance in a fine-grained small target visual search task. We set up 2 session factorial experimental design which humans visually for with without Deep Learning (DL) system. evaluate changes detection eye-movements presence DL find improvements system (computed via Faster R-CNN VGG16) interacts...

10.1109/cvpr.2019.00330 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2019-06-01

Hierarchically Compositional Tasks and Deep Convolutional Networks

OPENALEX - Publications

Arturo Deza Qianli Liao Andrzej Banburski Tomaso Poggio

The main success stories of deep learning, starting with ImageNet, depend on convolutional networks, which certain tasks perform significantly better than traditional shallow classifiers, such as support vector machines, and also fully connected networks; but what is so special about networks? Recent results in approximation theory proved an exponential advantage networks or without shared weights approximating functions hierarchical locality their compositional structure. More recently, the...

10.48550/arxiv.2006.13915 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Joint rotational invariance and adversarial training of a dual-stream Transformer yields state of the art Brain-Score for Area V4

OPENALEX - Publications

William Berrios Arturo Deza

Modern high-scoring models of vision in the brain score competition do not stem from Vision Transformers. However, this paper, we provide evidence against unexpected trend Transformers (ViT) being perceptually aligned with human visual representations by showing how a dual-stream Transformer, CrossViT$~\textit{a la}$ Chen et al. (2021), under joint rotationally-invariant and adversarial optimization procedure yields 2nd place aggregate Brain-Score 2022 competition(Schrimpf al., 2020b)...

10.48550/arxiv.2203.06649 preprint EN other-oa arXiv (Cornell University) 2022-01-01

CUDA-Optimized real-time rendering of a Foveated Visual System

OPENALEX - Publications

Elian Malkin Arturo Deza Tomaso Poggio

The spatially-varying field of the human visual system has recently received a resurgence interest with development virtual reality (VR) and neural networks. computational demands high resolution rendering desired for VR can be offset by savings in periphery, while networks trained foveated input have shown perceptual gains i.i.d o.o.d generalization. In this paper, we present technique that exploits CUDA GPU architecture to efficiently generate Gaussian-based images at definition (1920x1080...

10.48550/arxiv.2012.08655 preprint EN other-oa arXiv (Cornell University) 2020-01-01

General object-based features account for letter perception

OPENALEX - Publications

Daniel Janini Chris Hamblin Arturo Deza Talia Konkle

ABSTRACT After years of experience, humans become experts at perceiving letters. Is this visual capacity attained by learning specialized letter features, or reusing general features previously learned in service object categorization? To explore question, we first measured the perceptual similarity letters two behavioral tasks, search and categorization. Then, trained deep convolutional neural networks on either 26-way categorization 1000-way categorization, as a way to operationalize...

10.1101/2021.04.21.440772 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2021-04-22

Hierarchically Local Tasks and Deep Convolutional Networks

OPENALEX - Publications

Arturo Deza Andrzej Banburski Qianli Liao Tomaso Poggio

The main success stories of deep learning in visual perception tasks starting with ImageNet, has relied on convolutional neural networks, which certain perform significantly better than traditional shallow classifiers, such as support vector machines. Is there something special about networks that other machines do not possess? Recent results approximation theory have shown is an exponential advantage (DCN) over fully connected (FCN) approximating functions hierarchical locality their...

10.1167/jov.21.9.2465 article EN cc-by-nc-nd Journal of Vision 2021-09-01

How big should this object be? Perceptual influences on viewing-size preferences

OPENALEX - Publications

Yi-Chia Chen Arturo Deza Talia Konkle

When viewing objects depicted in a frame, most of us prefer to view large like sofas larger sizes and smaller paperclips sizes. In general, the visual size an object that "looks best" is linked its typical physical world (Konkle & Oliva, 2011). Why this case? One intuitive possibility these preferences are driven by semantic knowledge: For example, we recognize sofa, access our knowledge about real-world size, influences what sofa frame. However, might processing play role phenomenon—that...

10.1167/jov.20.11.428 article EN cc-by-nc-nd Journal of Vision 2020-10-20

How big should this object be? Perceptual influences on viewing-size preferences

OPENALEX - Publications

Yi-Chia Chen Arturo Deza Talia Konkle

Abstract When viewing objects depicted in a frame, observers prefer to view large like cars larger sizes and smaller cups sizes. That is, the visual size of an object that “looks best” is linked its typical physical world. Why this case? One intuitive possibility these preferences are driven by semantic knowledge: For example, when we recognize sofa, access our knowledge about real-world size, influences what sofa within frame. However, might processing play role phenomenon—that do features...

10.1101/2021.08.12.456159 preprint EN cc-by-nd bioRxiv (Cold Spring Harbor Laboratory) 2021-08-13

On the use of Cortical Magnification and Saccades as Biological Proxies for Data Augmentation

OPENALEX - Publications

Binxu Wang David J. Mayo Arturo Deza Andrei Barbu Colin Conwell

Self-supervised learning is a powerful way to learn useful representations from natural data. It has also been suggested as one possible means of building visual representation in humans, but the specific objective and algorithm are unknown. Currently, most self-supervised methods encourage system an invariant different transformations same image contrast those other images. However, such generally non-biologically plausible, often consist contrived perceptual schemes random cropping color...

10.48550/arxiv.2112.07173 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Scene context reduces distractor set-size effects during search

OPENALEX - Publications

Arturo Deza Emre Akbaş Miguel P. Eckstein

Scene context guides eye movements and facilitates search performance (Torralba et al., 2006; Chen & Zelinsky, Eckstein 2006). Here, we assess how scene modulates the effect of number distractors on with real scenes. Methods: Observers (64) were presented 24 grayscale images plus 96 fillers (22.53 deg. x 15.03 deg.) sampled from a dataset 1224 desk multiple viewpoints varying distractor objects. Half contained target (computer mouse). When present, appeared in 60 % (next to monitor/keyboard)...

10.1167/15.12.55 article EN cc-by-nc-nd Journal of Vision 2015-09-01

The Influence of Visual Clutter on Search Guidance with Complex Scenes

OPENALEX - Publications

Arturo Deza Grant S. Taylor Miguel P. Eckstein

Previous studies have proposed image based measures of clutter and correlated them to subjective judgments perceptual (Yu et al., 2014) or threshold contrasts during a search task (Rosenholtz 2007). Here we evaluate multiple metrics (Feature Congestion, FC; Subband Entropy, SE, Rosenholtz 2005; Freeman & Simoncelli, 1995; ProtoObject Segmentation, PS, Yu correlate with the time required for observers fixate searched target. In addition, influence on detectability as function retinal...

10.1167/16.12.1320 article EN cc-by-nc-nd Journal of Vision 2016-09-01

Coming Soon ...