NFDI4DS | UHH-SEMS - Publication Details

Stevan Rudinac

ORCID: 0000-0003-1904-8736

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5075331928

Research Areas

Advanced Image and Video Retrieval Techniques
Image Retrieval and Classification Techniques
Video Analysis and Summarization
Multimodal Machine Learning Applications
Advanced Graph Neural Networks
Topic Modeling
Complex Network Analysis Techniques
Human Mobility and Location-Based Analysis
Video Surveillance and Tracking Methods
Automated Road and Building Extraction
Opinion Dynamics and Social Influence
Aesthetic Perception and Analysis
Data-Driven Disease Surveillance
Music and Audio Processing
Misinformation and Its Impacts
Data Stream Mining Techniques
Visual Attention and Saliency Detection
Advanced Data Compression Techniques
Remote-Sensing Image Classification
Machine Learning and Algorithms
Anomaly Detection Techniques and Applications
Natural Language Processing Techniques
Sentiment Analysis and Opinion Mining
Social Media and Politics
Neural Networks and Applications

University of Amsterdam
2015-2024

Amsterdam University of the Arts
2014-2024

Delft University of Technology
2009-2021

University of Belgrade
2006-2007

Multimodal Popularity Prediction of Brand-related Social Media Posts

OPENALEX - Publications

Masoud Mazloom Robert Rietveld Stevan Rudinac Marcel Worring Willemijn van Dolen

Brand-related user posts on social networks are growing at a staggering rate, where users express their opinions about brands by sharing multimodal posts. However, while some become popular, others ignored. In this paper, we present an approach for identifying what aspects of determine popularity. We hypothesize that brand-related may be popular due to several cues related factual information, sentiment, vividness and entertainment parameters the brand. call ensemble engagement parameters....

10.1145/2964284.2967210 article EN Proceedings of the 30th ACM International Conference on Multimedia 2016-09-29

Automatic tagging and geotagging in video collections and communities

OPENALEX - Publications

Martha Larson Mohammad Soleymani Pavel Serdyukov Stevan Rudinac Christian Wartena and 4 more

Automatically generated tags and geotags hold great promise to improve access video collections online communities. We overview three tasks offered in the MediaEval 2010 benchmarking initiative, for each, describing its use scenario, definition data set released. For each task, a reference algorithm is presented that was used within comments are included on lessons learned. The Tagging Task, Professional involves automatically matching episodes collection of Dutch television with subject...

10.1145/1991996.1992047 article EN 2011-04-18

Generating Visual Summaries of Geographic Areas Using Community-Contributed Images

OPENALEX - Publications

Stevan Rudinac Alan Hanjalić Martha Larson

In this paper, we present a novel approach for automatic visual summarization of geographic area that exploits user-contributed images and related explicit implicit metadata collected from popular content-sharing websites. By means approach, search limited number representative but diverse to represent the within certain radius around specific location. Our is based on random walk with restarts over graph models relations between images, features extracted them, associated text, as well...

10.1109/tmm.2013.2237896 article EN IEEE Transactions on Multimedia 2013-01-04

HyperSAGE: Generalizing Inductive Representation Learning on Hypergraphs

OPENALEX - Publications

Devanshu Arya Deepak K. Gupta Stevan Rudinac Marcel Worring

Graphs are the most ubiquitous form of structured data representation used in machine learning. They model, however, only pairwise relations between nodes and not designed for encoding higher-order found many real-world datasets. To model such complex relations, hypergraphs have proven to be a natural representation. Learning node representations hypergraph is more than graph as it involves information propagation at two levels: within every hyperedge across hyperedges. Most current...

10.48550/arxiv.2010.04558 preprint EN other-oa arXiv (Cornell University) 2020-01-01

High-performance computing in healthcare: An automatic literature analysis perspective

OPENALEX - Publications

Jieyi Li Shuai Wang Stevan Rudinac Anwar Osseyran

10.1186/s40537-024-00929-2 article EN Journal Of Big Data 2024-05-02

Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models

OPENALEX - Publications

Hongyi Zhu Jia-Hong Huang Stevan Rudinac Evangelos Kanoulas

Image search stands as a pivotal task in multimedia and computer vision, finding applications across diverse domains, ranging from internet to medical diagnostics. Conventional image systems operate by accepting textual or visual queries, retrieving the top-relevant candidate results database. However, prevalent methods often rely on single-turn procedures, introducing potential inaccuracies limited recall. These also face challenges, such vocabulary mismatch semantic gap, constraining their...

10.1145/3652583.3658032 preprint EN cc-by 2024-05-30

Gradient Weight-normalized Low-rank Projection for Efficient LLM Training

OPENALEX - Publications

Jia-Hong Huang Yixian Shen Hongyi Zhu Stevan Rudinac Evangelos Kanoulas

Large Language Models (LLMs) have shown remarkable performance across various tasks, but the escalating demands on computational resources pose significant challenges, particularly in extensive utilization of full fine-tuning for downstream tasks. To address this, parameter-efficient (PEFT) methods been developed, they often underperform compared to and struggle with memory efficiency. In this work, we introduce Gradient Weight-Normalized Low-Rank Projection (GradNormLoRP), a novel approach...

10.1609/aaai.v39i23.34587 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2025-04-11

Domain-Informed Negative Sampling Strategies for Dynamic Graph Embedding in Meme Stock-Related Social Networks

OPENALEX - Publications

Yunming Hui Inez Maria Zwetsloot Simon Trimborn Stevan Rudinac

10.1145/3696410.3714650 article EN 2025-04-22

Learning Crowdsourced User Preferences for Visual Summarization of Image Collections

OPENALEX - Publications

Stevan Rudinac Martha Larson Alan Hanjalić

In this paper we propose a novel approach to selecting images suitable for inclusion in the visual summaries. The is grounded insights about how people summarize image collections. We utilize Amazon Mechanical Turk crowdsourcing platform obtain large number of manually created summaries as well information criteria summary. Based on these large-scale user tests, an automatic selection approach, which jointly utilizes analysis content, context, popularity, aesthetic appeal sentiment derived...

10.1109/tmm.2013.2261481 article EN IEEE Transactions on Multimedia 2013-05-03

Exquisitor at the Lifelog Search Challenge 2024: Blending Conversational Search with User Relevance Feedback

OPENALEX - Publications

Omar Shahbaz Khan Ujjwal Sharma Hongyi Zhu Stevan Rudinac Björn Þór Jónsson

The past decade has seen a rapid expansion of personal and interpersonal multimedia collections. These collections offer wealth information about individuals, including their interests, health, significant life events. While automated techniques can assist in structuring organizing these collections, they often have limitations helping users effectively navigate find relevant items within such large datasets. Lifelog Search Challenge (LSC) provides valuable benchmark for evaluating...

10.1145/3643489.3661132 article EN cc-by-sa 2024-06-10

Interactive Multimodal Learning for Venue Recommendation

OPENALEX - Publications

Jan Zahálka Stevan Rudinac Marcel Worring

In this paper, we propose City Melange, an interactive and multimodal content-based venue explorer. Our framework matches the interacting user to users of social media platforms exhibiting similar taste. The data collection integrates location-based networks such as Foursquare with general multimedia sharing Flickr or Picasa. interacts a set images thus implicitly underlying semantics. semantic information is captured through convolutional deep net features in visual domain latent topics...

10.1109/tmm.2015.2480007 article EN IEEE Transactions on Multimedia 2015-09-18

Blackthorn: Large-Scale Interactive Multimodal Learning

OPENALEX - Publications

Jan Zahálka Stevan Rudinac Björn Þór Jónsson D.C. Koelma Marcel Worring

This paper presents Blackthorn, an efficient interactive multimodal learning approach facilitating analysis of multimedia collections up to 100 million items on a single high-end workstation. Blackthorn features data compression, feature selection, and optimizations the process. The Ratio-64 representation introduced in this only costs tens bytes per item yet preserves most visual textual semantic information with good accuracy. optimized model scores Ratio-64-compressed directly, greatly...

10.1109/tmm.2017.2755986 article EN IEEE Transactions on Multimedia 2017-09-22

Echo Chambers Exist! (But They're Full of Opposing Views)

OPENALEX - Publications

Jonathan Bright Nahema Marchal Bharath Ganesh Stevan Rudinac

The theory of echo chambers, which suggests that online political discussions take place in conditions ideological homogeneity, has recently gained popularity as an explanation for patterns polarization and radicalization observed many democratic countries. However, while micro-level experimental work shown evidence individuals may gravitate towards information supports their beliefs, recent macro-level studies have cast doubt on whether this tendency generates chambers practice, instead...

10.48550/arxiv.2001.11461 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Multimodal Temporal Fusion Transformers Are Good Product Demand Forecasters

OPENALEX - Publications

Maarten Sukel Stevan Rudinac Marcel Worring

Multimodal demand forecasting aims at predicting product utilizing visual, textual, and contextual information. This paper proposes a method for such using an integrated architecture composed of convolutional, graph-based, transformer-based networks. Since traditional methods depend on historical factors like manually generated categorical information, they face challenges as the cold start problem handling category dynamics. To address these challenges, our allows incorporating multimodal...

10.1109/mmul.2024.3373827 article EN IEEE Multimedia 2024-03-07

Exquisitor at the Lifelog Search Challenge 2020

OPENALEX - Publications

Omar Shahbaz Khan Mathias Dybkjær Larsen Liam Alex Sonto Poulsen Björn Þór Jónsson Jan Zahálka and 3 more

We present an enhanced version of Exquisitor, our interactive and scalable media exploration system. At its core, Exquisitor is learning system using relevance feedback on items to build a model the users' information need. Relying efficient representation indexing, it facilitates real-time user interaction. The new features for Lifelog Search Challenge 2020 include support timeline browsing, search functionality finding positive examples, significant interface improvements. Participation in...

10.1145/3379172.3391718 article EN 2020-06-04

Comparison of CBIR Systems with Different Number of Feature Vector Components

OPENALEX - Publications

Stevan Rudinac Goran Zajić Marija Ušćumlić Maja Rudinac Branimir Reljin

Content-based image retrieval (CBIR) systems with user relevance feedback are considered. The influence of the type and number feature vector (FV) components on efficiency was investigated. We compared a CBIR system very small FV (only 25 describing color texture) high-dimensional inspired by MPEG-7 (556 coordinates color, texture line directions), as well using reduction (FVR) about 90% (with 50 from full-length 556-component FVs). tested over annotated Corel 1K 60K datasets. Simulation...

10.1109/smap.2007.23 article EN 2007-12-01

Finding representative and diverse community contributed images to create visual summaries of geographic areas

OPENALEX - Publications

Stevan Rudinac Alan Hanjalić Martha Larson

This paper presents an automatic approach that uses community-contributed images to create representative and diverse visual summaries of specific geographic areas. Complex relations between images, extracted features, text associated with the as well users their social network are modeled using a multimodal graph. To compute affinities nodes in graph we rely on proven concept random walk restarts. The novelty our lies its use diverse, yet representative, image set. Further, introduce...

10.1145/2072298.2071950 article EN Proceedings of the 30th ACM International Conference on Multimedia 2011-11-28

Analytic Quality

OPENALEX - Publications

Jan Zahálka Stevan Rudinac Marcel Worring

In this paper, we present analytic quality (AQ), a novel paradigm for the design and evaluation of multimedia analysis methods. AQ complements existing methods based on either machine-driven benchmarks or user studies. includes notion insight gain time needed to acquire it, both critical aspects large-scale collections analysis. To incorporate insight, introduces model. model, each simulated user, artificial actor, builds its over time, at any operating with multiple categories relevance....

10.1145/2733373.2806279 article EN 2015-10-13

Graph Neural Networks for Knowledge Enhanced Visual Representation of Paintings

OPENALEX - Publications

Athanasios Efthymiou Stevan Rudinac Monika Kackovic Marcel Worring Nachoem M. Wijnberg

We propose ArtSAGENet, a novel multimodal architecture that integrates Graph Neural Networks (GNNs) and Convolutional (CNNs), to jointly learn visual semantic-based artistic representations. First, we illustrate the significant advantages of multi-task learning for fine art analysis argue it is conceptually much more appropriate setting in domain than single-task alternatives. further demonstrate several GNN architectures can outperform strong CNN baselines range tasks, such as style...

10.1145/3474085.3475586 article EN Proceedings of the 30th ACM International Conference on Multimedia 2021-10-17

How Do Individuals in a Radical Echo Chamber React to Opposing Views? Evidence from a Content Analysis of Stormfront

OPENALEX - Publications

Jonathan Bright Nahema Marchal Bharath Ganesh Stevan Rudinac

Abstract Calls to “break up” radical echo chambers by injecting them with alternative viewpoints are common. Yet, thus far there is little evidence about the impact of such counter-messaging. To what extent and how do individuals who inhabit a chamber engage messages that challenge their core beliefs? Drawing on data from right forum Stormfront we address this question large-scale content longitudinal analysis users’ posting behavior, which analyses more than 35,000 English language...

10.1093/hcr/hqab020 article EN Human Communication Research 2021-11-17

Multimodal Classification of Violent Online Political Extremism Content with Graph Convolutional Networks

OPENALEX - Publications

Stevan Rudinac Iva Gornishka Marcel Worring

In this paper we present a multimodal approach to categorizing user posts based on their discussion topic. To integrate heterogeneous information extracted from the posts, i.e. text, visual content and about interactions with online platform, deploy graph convolutional networks that were recently proven effective in classification tasks knowledge graphs. As case study use analysis of violent political extremism content, challenging task due particularly high semantic level at which extremist...

10.1145/3126686.3126776 article EN 2017-10-23

Exploiting visual reranking to improve pseudo-relevance feedback for spoken-content-based video retrieval

OPENALEX - Publications

Stevan Rudinac Martha Larson Alan Hanjalić

In this paper we propose an approach that utilizes visual features and conventional text-based pseudo-relevance feedback (PRF) to improve the results of semantic-theme-based video retrieval. Our reranking method is based on Average Item Distance (AID) score. AID-based designed suitability items at top initial list, i.e., those selected for use in query expansion. intended help target representative regularity typifying semantic theme query. Experiments performed VideoCLEF 2008 data set a...

10.1109/wiamis.2009.5031421 article EN 2009-05-01

Coming Soon ...