NFDI4DS | UHH-SEMS - Publication Details

Damian Borth

ORCID: 0000-0002-4660-2627

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5065722787

Research Areas

Advanced Image and Video Retrieval Techniques
Neural Networks and Applications
Remote-Sensing Image Classification
Video Analysis and Summarization
Anomaly Detection Techniques and Applications
Image Retrieval and Classification Techniques
Domain Adaptation and Few-Shot Learning
Multimodal Machine Learning Applications
Music and Audio Processing
Machine Learning and Data Classification
Stock Market Forecasting Methods
Speech and Audio Processing
Speech Recognition and Synthesis
Atmospheric and Environmental Gas Dynamics
Imbalanced Data Classification Techniques
Advanced Neural Network Applications
Adversarial Robustness in Machine Learning
Air Quality Monitoring and Forecasting
Automated Road and Building Extraction
Topic Modeling
Remote Sensing and Land Use
Natural Language Processing Techniques
Computational Physics and Python Applications
Data Stream Mining Techniques
Generative Adversarial Networks and Image Synthesis

University of St. Gallen
2017-2024

Institute of Computer Science
2020-2021

Czech Academy of Sciences, Institute of Computer Science
2020

German Research Centre for Artificial Intelligence
2008-2018

University of Kaiserslautern
2008-2018

International Computer Science Institute
2014-2016

YFCC100M

OPENALEX - Publications

Bart Thomée David A. Shamma Gerald Friedland Benjamin Elizalde Karl Ni and 3 more

We present the Yahoo Flickr Creative Commons 100 Million Dataset (YFCC100M), largest public multimedia collection that has ever been released. The dataset contains a total of million media objects, which approximately 99.2 are photos and 0.8 videos, all carry license. Each object in is represented by several pieces metadata, e.g. identifier, owner name, camera, title, tags, geo, source. provides comprehensive snapshot how videos were taken, described, shared over years, from inception 2004...

10.1145/2812802 article EN Communications of the ACM 2016-01-25

EuroSAT: A Novel Dataset and Deep Learning Benchmark for Land Use and Land Cover Classification

OPENALEX - Publications

Patrick Helber Benjamin Bischke Andreas Dengel Damian Borth

In this paper, we present a patch-based land use and cover classification approach using Sentinel-2 satellite images. The images are openly freely accessible, provided in the earth observation program Copernicus. We novel dataset, based on these that covers 13 spectral bands is comprised of ten classes with total 27 000 labeled geo-referenced Benchmarks for dataset its state-of-the-art deep convolutional neural networks. An overall accuracy 98.57% was achieved proposed dataset. resulting...

10.1109/jstars.2019.2918242 article EN IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 2019-06-14

Large-scale visual sentiment ontology and detectors using adjective noun pairs

OPENALEX - Publications

Damian Borth Rongrong Ji Tao Chen Thomas M. Breuel Shih-Fu Chang

We address the challenge of sentiment analysis from visual content. In contrast to existing methods which infer or emotion directly low-level features, we propose a novel approach based on understanding concepts that are strongly related sentiments. Our key contribution is two-fold: first, present method built upon psychological theories and web mining automatically construct large-scale Visual Sentiment Ontology (VSO) consisting more than 3,000 Adjective Noun Pairs (ANP). Second, SentiBank,...

10.1145/2502081.2502282 article EN 2013-10-21

DeepSentiBank: Visual Sentiment Concept Classification with Deep Convolutional Neural Networks

OPENALEX - Publications

Tao Chen Damian Borth Trevor Darrell Shih‐Fu Chang

This paper introduces a visual sentiment concept classification method based on deep convolutional neural networks (CNNs). The concepts are adjective noun pairs (ANPs) automatically discovered from the tags of web photos, and can be utilized as effective statistical cues for detecting emotions depicted in images. Nearly one million Flickr images tagged with these ANPs downloaded to train classifiers concepts. We adopt popular model which recently shows great performance improvement...

10.48550/arxiv.1410.8586 preprint EN other-oa arXiv (Cornell University) 2014-01-01

Multi-Task Learning for Segmentation of Building Footprints with Deep Neural Networks

OPENALEX - Publications

Benjamin Bischke Patrick Helber Joachim Folz Damian Borth Andreas Dengel

The increased availability of high-resolution satellite imagery allows to sense very detailed structures on the surface our planet. Access such information opens up new directions in analysis remote sensing imagery. While deep neural networks have achieved significant advances semantic segmentation images, most existing approaches tend produce predictions with poor boundaries. In this paper, we address problem preserving boundaries by introducing a novel multi-task loss. loss leverages...

10.1109/icip.2019.8803050 article EN 2022 IEEE International Conference on Image Processing (ICIP) 2019-08-26

Introducing Eurosat: A Novel Dataset and Deep Learning Benchmark for Land Use and Land Cover Classification

OPENALEX - Publications

Patrick Helber Benjamin Bischke Andreas Dengel Damian Borth

In this paper, we address the challenge of land use and cover classification using Sentinel-2 satellite images. The key contributions are as follows. We present a novel dataset based on images covering 13 different spectral bands consisting 10 classes with in total 27,000 labeled evaluate state-of-the-art deep Convolutional Neural Networks (CNNs) its bands. also CNNs existing remote sensing datasets compare obtained results. With proposed dataset, achieved an overall accuracy 98.57%. system...

10.1109/igarss.2018.8519248 article EN IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium 2018-07-01

SentiBank

OPENALEX - Publications

Damian Borth Tao Chen Rongrong Ji Shih‐Fu Chang

A picture is worth one thousand words, but what words should be used to describe the sentiment and emotions conveyed in increasingly popular social multimedia? We demonstrate a novel system which combines sound structures from psychology folksonomy extracted multimedia develop large visual ontology consisting of 1,200 concepts associated classifiers called SentiBank. Each concept, defined as an Adjective Noun Pair (ANP), made adjective strongly indicating noun corresponding objects or scenes...

10.1145/2502081.2502268 article EN 2013-10-21

Self-supervised Vision Transformers for Land-cover Segmentation and Classification

OPENALEX - Publications

Linus Scheibenreif Joëlle Hanna Michael Mommert Damian Borth

Transformer models have recently approached or even surpassed the performance of ConvNets on computer vision tasks like classification and segmentation. To a large degree, these successes been enabled by use large-scale labelled image datasets for supervised pre-training. This poses significant challenge adaption Transformers to domains where with millions samples are not available. In this work, we bridge gap between Earth observation self-supervised pre-training unlabelled remote sensing...

10.1109/cvprw56347.2022.00148 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2022-06-01

Masked Vision Transformers for Hyperspectral Image Classification

OPENALEX - Publications

Linus Scheibenreif Michael Mommert Damian Borth

Transformer architectures have become state-of-the-art models in computer vision and natural language processing. To a significant degree, their success can be attributed to self-supervised pre-training on large scale unlabeled datasets. This work investigates the use of masked image reconstruction advance transformer for hyperspectral remote sensing imagery. facilitate pre-training, we build dataset observations from EnMAP satellite systematically investigate modifications architecture...

10.1109/cvprw59228.2023.00210 article EN 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) 2023-06-01

Neural Plasticity-Inspired Foundation Model for Observing the Earth Crossing Modalities

OPENALEX - Publications

Zhitong Xiong Yi Wang Fahong Zhang Adam J. Stewart Joëlle Hanna and 5 more

The development of foundation models has revolutionized our ability to interpret the Earth's surface using satellite observational data. Traditional have been siloed, tailored specific sensors or data types like optical, radar, and hyperspectral, each with its own unique characteristics. This specialization hinders potential for a holistic analysis that could benefit from combined strengths these diverse sources. Our novel approach introduces Dynamic One-For-All (DOFA) model, leveraging...

10.48550/arxiv.2403.15356 preprint EN arXiv (Cornell University) 2024-03-22

Detection of Anomalies in Large Scale Accounting Data using Deep Autoencoder Networks

OPENALEX - Publications

Marco Schreyer Timur Sattarov Damian Borth Andreas Dengel Bernd Reimer

Learning to detect fraud in large-scale accounting data is one of the long-standing challenges financial statement audits or investigations. Nowadays, majority applied techniques refer handcrafted rules derived from known scenarios. While fairly successful, these exhibit drawback that they often fail generalize beyond scenarios and fraudsters gradually find ways circumvent them. To overcome this disadvantage inspired by recent success deep learning we propose application autoencoder neural...

10.48550/arxiv.1709.05254 preprint EN other-oa arXiv (Cornell University) 2017-01-01

EuroSAT: A Novel Dataset and Deep Learning Benchmark for Land Use and Land Cover Classification

OPENALEX - Publications

Patrick Helber Benjamin Bischke Andreas Dengel Damian Borth

In this paper, we address the challenge of land use and cover classification using Sentinel-2 satellite images. The images are openly freely accessible provided in Earth observation program Copernicus. We present a novel dataset based on covering 13 spectral bands consisting out 10 classes with total 27,000 labeled geo-referenced provide benchmarks for its state-of-the-art deep Convolutional Neural Network (CNNs). With proposed dataset, achieved an overall accuracy 98.57%. resulting system...

10.48550/arxiv.1709.00029 preprint EN other-oa arXiv (Cornell University) 2017-01-01

Analysis and forecasting of trending topics in online media streams

OPENALEX - Publications

Tim Althoff Damian Borth J.J. van Hees Andreas Dengel

Among the vast information available on web, social media streams capture what people currently pay attention to and how they feel about certain topics. Awareness of such trending topics plays a crucial role in multimedia systems as trend aware recommendation automatic vocabulary selection for video concept detection systems. Correctly utilizing requires better understanding their various characteristics different streams. To this end, we present first comprehensive study across three major...

10.1145/2502081.2502117 article EN 2013-10-21

Contextual Enrichment of Remote-Sensed Events with Social Media Streams

OPENALEX - Publications

Benjamin Bischke Damian Borth Christian Schulze Andreas Dengel

The availability of satellite images for academic or commercial purpose is increasing rapidly due to efforts made by governmental agencies (NASA, ESA) publish such data openly startups (PlanetLabs) provide real-time data. Beyond many application, helpful create situation awareness in disaster recovery and emergency situations as wildfires, earthquakes, flooding. To fully utilize sources, we present a scalable system the contextual enrichment crawling analyzing multimedia content from social...

10.1145/2964284.2984063 article EN Proceedings of the 30th ACM International Conference on Multimedia 2016-09-29

Real-time Analysis and Visualization of the YFCC100m Dataset

OPENALEX - Publications

Sebastian Kalkowski Christian Schulze Andreas Dengel Damian Borth

With the Yahoo Flickr Creative Commons 100 Million (YFCC100m) dataset, a novel dataset was introduced to computer vision and multimedia research community. To maximize benefit for community utilize its potential, this has be made accessible by tools allowing search target concepts within mechanism browse images videos of dataset. Following best practice from data collections, such as ImageNet MS COCO, paper presents means accessibility YFCC100m This includes global analysis an online browser...

10.1145/2814815.2814820 article EN 2015-10-30

Multi-Task Learning for Segmentation of Building Footprints with Deep Neural Networks

OPENALEX - Publications

Benjamin Bischke Patrick Helber Joachim Folz Damian Borth Andreas Dengel

The increased availability of high resolution satellite imagery allows to sense very detailed structures on the surface our planet. Access such information opens up new directions in analysis remote sensing imagery. However, at same time this raises a set challenges for existing pixel-based prediction methods, as semantic segmentation approaches. While deep neural networks have achieved significant advances images past, most approaches tend produce predictions with poor boundaries. In paper,...

10.48550/arxiv.1709.05932 preprint EN other-oa arXiv (Cornell University) 2017-01-01

What do Deep Networks Like to See?

OPENALEX - Publications

Sebastian Palacio Joachim Folz J.J. van Hees Federico Raue Damian Borth and 1 more

We propose a novel way to measure and understand convolutional neural networks by quantifying the amount of input signal they let in. To do this, an autoencoder (AE) was fine-tuned on gradients from pre-trained classifier with fixed parameters. compared reconstructed samples AEs that were set image classifiers (AlexNet, VGG16, ResNet-50, Inception v3) found substantial differences. The AE learns which aspects space preserve ones ignore, based information encoded in backpropagated gradients....

10.1109/cvpr.2018.00328 article EN 2018-06-01

CONTRASTIVE SELF-SUPERVISED DATA FUSION FOR SATELLITE IMAGERY

OPENALEX - Publications

Linus Scheibenreif Michael Mommert Damian Borth

Abstract. Self-supervised learning has great potential for the remote sensing domain, where unlabelled observations are abundant, but labels hard to obtain. This work leverages multi-modal data augmentation-free contrastive self-supervised learning. Deep neural network models trained maximize similarity of latent representations obtained with different techniques from same location, while distinguishing them other locations. We showcase this idea two fusion methods and compare against...

10.5194/isprs-annals-v-3-2022-705-2022 article EN cc-by ISPRS annals of the photogrammetry, remote sensing and spatial information sciences 2022-05-17

Toward Global Estimation of Ground-Level NO2 Pollution With Deep Learning and Remote Sensing

OPENALEX - Publications

Linus Scheibenreif Michael Mommert Damian Borth

Air pollution is a central environmental problem in countries around the world. It contributes to climate change through emission of greenhouse gases, and adversely impacts health billions people. Despite its importance, detailed information about spatial temporal distribution pollutants complex obtain. Ground-level monitoring stations are sparse, approaches for modeling air rely on extensive datasets which unavailable many locations. We introduce three techniques estimation overcome these...

10.1109/tgrs.2022.3160827 article EN IEEE Transactions on Geoscience and Remote Sensing 2022-01-01

The Placing Task

OPENALEX - Publications

Jaeyoung Choi Bart Thomée Gerald Friedland Liangliang Cao Karl Ni and 6 more

The Placing Task is a yearly challenge offered by the MediaEval Multimedia Benchmarking Initiative that requires participants to develop algorithms automatically predict geo-location of social media videos and images. We introduce recent development new standardized web-scale geo-tagged dataset for 2014, which contains 5.5 million photos 35,000 videos. This benchmark with large persistent allows research community easily evaluate analyze their performance respect state-of-the-art approaches....

10.1145/2661118.2661125 article EN 2014-11-03

Large-Scale Deep Learning on the YFCC100M Dataset

OPENALEX - Publications

Karl Ni Roger Pearce Kofi Boakye Brian Van Essen Damian Borth and 2 more

We present a work-in-progress snapshot of learning with 15 billion parameter deep network on HPC architectures applied to the largest publicly available natural image and video dataset released to-date. Recent advancements in unsupervised neural networks suggest that scaling up such both model training size can yield significant improvements concepts at highest layers. train our three-layer Yahoo! Flickr Creative Commons 100M dataset. The comprises approximately 99.2 million images 800,000...

10.48550/arxiv.1502.03409 preprint EN other-oa arXiv (Cornell University) 2015-01-01

FinDiff: Diffusion Models for Financial Tabular Data Generation

OPENALEX - Publications

Timur Sattarov Marco Schreyer Damian Borth

The sharing of microdata, such as fund holdings and derivative instruments, by regulatory institutions presents a unique challenge due to strict data confidentiality privacy regulations. These challenges often hinder the ability both academics practitioners conduct collaborative research effectively. emergence generative models, particularly diffusion capable synthesizing mimicking underlying distributions real-world compelling solution. This work introduces Financial Tabular Diffusion...

10.1145/3604237.3626876 article EN cc-by 2023-11-25

Coming Soon ...