NFDI4DS | UHH-SEMS - Publication Details

David A. Shamma

ORCID: 0000-0003-2399-9374

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5001270504

Research Areas

Video Analysis and Summarization
Advanced Image and Video Retrieval Techniques
Image Retrieval and Classification Techniques
Multimedia Communication and Technology
Innovative Human-Technology Interaction
Complex Network Analysis Techniques
Data Visualization and Analytics
Digital Games and Media
Visual Attention and Saliency Detection
Music and Audio Processing
Interactive and Immersive Displays
Multimodal Machine Learning Applications
Geographic Information Systems Studies
Scientific Computing and Data Management
Human Mobility and Location-Based Analysis
Virtual Reality Applications and Impacts
Mobile Crowdsensing and Crowdsourcing
Opinion Dynamics and Social Influence
Digital Marketing and Social Media
Augmented Reality Applications
Context-Aware Activity Recognition Systems
Artificial Intelligence in Games
Ethics and Social Impacts of AI
Big Data and Business Intelligence
Misinformation and Its Impacts

Toyota Industries (United States)
2022-2025

Toyota Research Institute
2022-2025

Yahoo (United States)
2009-2023

Centrum Wiskunde & Informatica
2016-2022

Rochester Institute of Technology
2021

Yahoo (Spain)
2010-2021

FX Palo Alto Laboratory
2017-2020

Association for Computing Machinery
2020

College of Western Idaho
2017

Yahoo (United Kingdom)
2007-2015

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

OPENALEX - Publications

Ranjay Krishna Yuke Zhu Oliver Groth Justin Johnson Kenji Hata and 7 more

Despite progress in perceptual tasks such as image classification, computers still perform poorly on cognitive description and question answering. Cognition is core to that involve not just recognizing, but reasoning about our visual world. However, models used tackle the rich content images for are being trained using same datasets designed tasks. To achieve success at tasks, need understand interactions relationships between objects an image. When asked "What vehicle person riding?", will...

10.1007/s11263-016-0981-7 article EN cc-by International Journal of Computer Vision 2017-02-06

YFCC100M

OPENALEX - Publications

Bart Thomée David A. Shamma Gerald Friedland Benjamin Elizalde Karl Ni and 3 more

We present the Yahoo Flickr Creative Commons 100 Million Dataset (YFCC100M), largest public multimedia collection that has ever been released. The dataset contains a total of million media objects, which approximately 99.2 are photos and 0.8 videos, all carry license. Each object in is represented by several pieces metadata, e.g. identifier, owner name, camera, title, tags, geo, source. provides comprehensive snapshot how videos were taken, described, shared over years, from inception 2004...

10.1145/2812802 article EN Communications of the ACM 2016-01-25

Image retrieval using scene graphs

OPENALEX - Publications

Justin Johnson Ranjay Krishna Michael Stark Li-Jia Li David A. Shamma and 2 more

This paper develops a novel framework for semantic image retrieval based on the notion of scene graph. Our graphs represent objects ("man", "boat"), attributes ("boat is white") and relationships between ("man standing boat"). We use these as queries to retrieve semantically related images. To this end, we design conditional random field model that reasons about possible groundings test The likelihoods are used ranking scores retrieval. introduce dataset 5,000 human-generated grounded images...

10.1109/cvpr.2015.7298990 article EN 2015-06-01

Characterizing debate performance via aggregated twitter sentiment

OPENALEX - Publications

Nicholas Diakopoulos David A. Shamma

Television broadcasters are beginning to combine social micro-blogging systems such as Twitter with television create video experiences around events. We looked at one event, the first U.S. presidential debate in 2008, conjunction aggregated ratings of message sentiment from Twitter. begin develop an analytical methodology and visual representations that could help a journalist or public affairs person better understand temporal dynamics reaction video. demonstrate visuals metrics can be...

10.1145/1753326.1753504 article EN 2010-04-10

Faces engage us

OPENALEX - Publications

Saeideh Bakhshi David A. Shamma Éric Gilbert

Photos are becoming prominent means of communication online. Despite photos' pervasive presence in social media and online world, we know little about how people interact engage with their content. Understanding photo content might signify engagement, can impact both science design, influencing production distribution. One common type that is shared on media, the photos people. From studies offline behavior, human faces powerful channels non-verbal communication. In this paper, study...

10.1145/2556288.2557403 article EN 2014-04-26

Tweet the debates

OPENALEX - Publications

David A. Shamma Lyndon Kennedy Elizabeth F. Churchill

We investigate the practice of sharing short messages (microblogging) around live media events. Our focus is on Twitter and its usage during 2008 Presidential Debates. find that analysis patterns this event can yield significant insights into semantic structure content object. Specifically, we level activity serves as a predictor changes in topics event. Further conversational cues identify key players object posts somewhat reflect discussion object, but are mostly evaluative, they express...

10.1145/1631144.1631148 article EN 2009-10-23

Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations

OPENALEX - Publications

Ranjay Krishna Yuke Zhu Oliver Groth Justin Johnson Kenji Hata and 7 more

10.48550/arxiv.1602.07332 preprint EN other-oa arXiv (Cornell University) 2016-01-01

Peaks and persistence

OPENALEX - Publications

David A. Shamma Lyndon Kennedy Elizabeth F. Churchill

A microblogged stream is delivered over time, providing an ongoing commentary of topics, trends, and issues. In this article, we present two methods finding temporal topics within these Twitter streams. Using a normalized term frequency, demonstrate how effective table contents can be extracted by localized "peaky topics". Second, find "persistent conversations" which have lower general salience but sustain persist the tweet corpus, in effect whispering conversation that lingers background....

10.1145/1958824.1958878 article EN 2011-03-19

Money talks

OPENALEX - Publications

Joseph Kaye Mary Honodel McCuistion Rebecca Gulotta David A. Shamma

How do people keep track of their money? In this paper we present a preliminary scoping study how 14 individuals in the San Francisco Bay Area earn, save, spend and understand money personal family finances. We describe practices developed for exploring sensitive topic money, then discuss three sets findings. The first is emotional component relationship have with Second, tools processes used to financial situation. Finally account unknown unpredictable nature future through decisions....

10.1145/2556288.2556975 article EN 2014-04-26

Proxemics and Social Interactions in an Instrumented Virtual Reality Workshop

OPENALEX - Publications

Julie Williamson Jie Li Vinoba Vinayagamoorthy David A. Shamma Pablo César

Virtual environments (VEs) can create collaborative and social spaces, which are increasingly important in the face of remote work travel reduction. Recent advances, such as more open widely available platforms, new possibilities to observe analyse interaction VEs. Using a custom instrumented build Mozilla Hubs measure position orientation, we conducted an academic workshop facilitate range typical activities. We analysed interactions during keynote, small group breakouts, informal...

10.1145/3411764.3445729 preprint EN 2021-05-06

Fast, Cheap, and Good

OPENALEX - Publications

Saeideh Bakhshi David A. Shamma Lyndon Kennedy Yale Song Paloma de Juan and 1 more

Animated GIFs have been around since 1987 and recently gained more popularity on social networking sites. Tumblr, a large micro blogging platform, is popular venue to share animated GIFs. Tumblr users follow blogs, generating feed or posts, choose "like' "reblog' favored posts. In this paper, we use these actions as signals analyze the engagement of over 3.9 million conclude that are significantly engaging than other kinds media. We finding with deeper visual analysis nearly 100k pair our...

10.1145/2858036.2858532 article EN 2016-05-05

Why We Filter Our Photos and How It Impacts Engagement

OPENALEX - Publications

Saeideh Bakhshi David A. Shamma Lyndon Kennedy Éric Gilbert

A variety of simple graphical filters are available to camera phone users enhance their photos on the fly; these often stylize, saturate or age a photo. In this paper, we present combination large-scale data analysis and small scale in-depth interviews understand filter-work. We look at producers’ practices photo filtering gain insights in roles play engaging consumers’ by driving social interactions. first interviewed 15 Flickr mobile app (photo producers) use perception filters. Next,...

10.1609/icwsm.v9i1.14622 article EN Proceedings of the International AAAI Conference on Web and Social Media 2021-08-03

Viral Actions: Predicting Video View Counts Using Synchronous Sharing Behaviors

OPENALEX - Publications

David A. Shamma Jude Yew Lyndon Kennedy Elizabeth F. Churchill

In this article, we present a method for predicting the view count of YouTube video using small feature set collected from synchronous sharing tool. We hypothesize that videos which have high will exhibit unique pattern when shared in environments. Using one-day sample 2,188 dyadic sessions Yahoo! Zync tool, demonstrate how to predict video's on YouTube, specifically if has over 10 million views. The prediction model is 95.8% accurate and done with relatively training set; only 15% had more...

10.1609/icwsm.v5i1.14154 article EN Proceedings of the International AAAI Conference on Web and Social Media 2021-08-03

Digital Proxemics: Designing Social and Collaborative Interaction in Virtual Environments

OPENALEX - Publications

Julie Williamson Joseph O’Hagan John A. Guerra‐Gomez John Williamson Pablo César and 1 more

Behaviour in virtual environments might be informed by our experiences physical environments, but are not constrained the same physical, perceptual, or social cues. Instead of replicating properties spaces, one can create that diverge from reality dynamically manipulating environmental, aural, and properties. This paper explores digital proxemics, which describe how we use space presence others influences behaviours, interactions, movements. First, frame open challenges proxemics terms...

10.1145/3491102.3517594 article EN CHI Conference on Human Factors in Computing Systems 2022-04-28

Multimodal Classification of Moderated Online Pro-Eating Disorder Content

OPENALEX - Publications

Stevie Chancellor Yannis Kalantidis Jessica Pater Munmun De Choudhury David A. Shamma

Social media sites are challenged by both the scale and variety of deviant behavior online. While algorithms can detect spam obscenity, behaviors that break community guidelines on some difficult because they have multimodal subtleties (images and/or text). Identifying these posts is often regulated to a few moderators. In this paper, we develop deep learning classifier jointly models textual visual characteristics pro-eating disorder content violates guidelines. Using million Tumblr photo...

10.1145/3025453.3025985 article EN 2017-05-02

Embracing Error to Enable Rapid Crowdsourcing

OPENALEX - Publications

Ranjay Krishna Kenji Hata Stephanie Chen Joshua Kravitz David A. Shamma and 2 more

Microtask crowdsourcing has enabled dataset advances in social science and machine learning, but existing schemes are too expensive to scale up with the expanding volume of data. To widen applicability crowdsourcing, we present a technique that produces extremely rapid judgments for binary categorical labels. Rather than punishing all errors, which causes workers proceed slowly deliberately, our speeds workers' point where errors acceptable even expected. We demonstrate it is possible...

10.1145/2858036.2858115 preprint EN 2016-05-05

Social VR: A New Medium for Remote Communication and Collaboration

OPENALEX - Publications

Jie Li Vinoba Vinayagamoorthy Julie Williamson David A. Shamma Pablo César

We are facing increasingly pressure on reducing travel and working remotely. Tools that support effective remote communication collaboration much needed. Social Virtual Reality (VR) is an emerging medium, which invites multiple users to join a collaborative virtual environment (VE) has the potential in natural immersive way. successfully organized CHI 2020 VR workshop virtually Mozilla Hubs, invited researchers practitioners have fruitful discussion over user representations ethics,...

10.1145/3411763.3441346 article EN 2021-05-08

Watch what I watch

OPENALEX - Publications

David A. Shamma Ryan Shaw Peter Shafton Yiming Liu

This paper presents a high-level overview of Yahoo Research Berkeley's approach to multimedia research and the ideas motivating it. is characterized primarily by shift away from building subsystems that attempt discover or understand "meaning" media content toward systems algorithms can usefully utilize information about how being used in specific contexts; semantics pragmatics. We believe that, at least for domain consumer web videos, latter provides more promising basis indexing ways...

10.1145/1290082.1290120 article EN 2007-09-24

Flexible Learning with Semantic Visual Exploration and Sequence-Based Recommendation of MOOC Videos

OPENALEX - Publications

Jian Zhao Chidansh Bhatt Matthew Cooper David A. Shamma

Massive Open Online Course (MOOC) platforms have scaled online education to unprecedented enrollments, but remain limited by their rigid, predetermined curricula. To overcome this limitation, paper contributes a visual recommender system called MOOCex. The recommends lecture videos across different courses considering both video contents and sequential inter-topic relationships mined from course syllabi; more importantly, it allows for interactive exploration of the semantic space...

10.1145/3173574.3173903 article EN 2018-04-20

Save A Tree or 6 kg of CO2? Understanding Effective Carbon Footprint Interventions for Eco-Friendly Vehicular Choices

OPENALEX - Publications

Vikram Mohanty Alexandre L. S. Filipowicz Nayeli Suseth Bravo Scott Carter David A. Shamma

From ride-hailing to car rentals, consumers are often presented with eco-friendly options. Beyond highlighting a "green" vehicle and CO2 emissions, equivalencies have been designed provide understandable amounts; we ask which will lead decisions. We conducted five scenario surveys where participants picked between regular options, testing equivalencies, social features, valence-based interventions. Further, tested car-rental embodiment gauge how an individual (needing for several days) might...

10.1145/3544548.3580675 preprint EN 2023-04-19

On LLM Wizards: Identifying Large Language Models' Behaviors for Wizard of Oz Experiments

OPENALEX - Publications

Jingchao Fang Nikos Aréchiga Keiichi Namikoshi Nayeli Suseth Bravo Candice Hogan and 1 more

The Wizard of Oz (WoZ) method is a widely adopted research approach where human ``role-plays'' not readily available technology and interacts with participants to elicit user behaviors probe the design space. With growing ability for modern large language models (LLMs) role-play, one can apply LLMs as Wizards in WoZ experiments better scalability lower cost than traditional approach. However, methodological guidance on responsibly applying systematic evaluation LLMs' role-playing are...

10.1145/3652988.3673967 preprint EN 2024-09-16

Coming Soon ...