Kurt Debattista

ORCID: 0000-0003-2982-5199
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Image Enhancement Techniques
  • Advanced Vision and Imaging
  • Computer Graphics and Visualization Techniques
  • Color Science and Applications
  • Visual Attention and Saliency Detection
  • Image and Video Quality Assessment
  • Visual perception and processing mechanisms
  • 3D Surveying and Cultural Heritage
  • 3D Shape Modeling and Analysis
  • Color perception and design
  • Industrial Vision Systems and Defect Detection
  • Multisensory perception and integration
  • Virtual Reality Applications and Impacts
  • Advanced Image Processing Techniques
  • Educational Games and Gamification
  • Parallel Computing and Optimization Techniques
  • Image Processing and 3D Reconstruction
  • Advanced Neural Network Applications
  • Data Visualization and Analytics
  • Advanced Image and Video Retrieval Techniques
  • 3D Modeling in Geospatial Applications
  • Video Surveillance and Tracking Methods
  • Explainable Artificial Intelligence (XAI)
  • Remote Sensing and LiDAR Applications
  • Recycling and Waste Management Techniques

University of Warwick
2016-2025

University of Sheffield
2024

University of the West of England
2022

University of Waterloo
2022

Brno University of Technology
2019

CentraleSupélec
2017

Université Paris-Sud
2017

Université Paris-Saclay
2017

Centre National de la Recherche Scientifique
2017

Télécom Paris
2017

Abstract High dynamic range (HDR) imaging provides the capability of handling real world lighting as opposed to traditional low (LDR) which struggles accurately represent images with higher range. However, most content is still available only in LDR. This paper presents a method for generating HDR from LDR based on deep Convolutional Neural Networks (CNNs) termed ExpandNet. ExpandNet accepts input and generates an expanded end‐to‐end fashion. The model attempts reconstruct missing...

10.1111/cgf.13340 article EN Computer Graphics Forum 2018-05-01

Dense captioning provides detailed captions of complex visual scenes. While a number successes have been achieved in recent years, there are still two broad limitations: 1) most existing methods adopt an encoder-decoder framework, where the contextual information is sequentially encoded using long short-term memory (LSTM). However, forget gate mechanism LSTM makes it vulnerable when dealing with sequence and 2) vast majority prior arts consider regions interests (RoIs) equally important,...

10.1109/tnnls.2022.3152990 article EN publisher-specific-oa IEEE Transactions on Neural Networks and Learning Systems 2022-03-11

Dense captioning generates more detailed spoken descriptions for complex visual scenes. Despite several promising leads, existing methods still have two broad limitations: 1) The vast majority of prior arts only consider contextual clues during but ignore potentially important textual context; 2) current imbalanced learning mechanisms limit the diversity vocabulary learned from dictionary, thus giving rise to low language-learning efficiency. To alleviate these gaps, in this paper, we...

10.1109/tmm.2023.3241517 article EN IEEE Transactions on Multimedia 2023-01-01

This is a repository copy of Virtual category learning: semi-supervised learning method for dense prediction with extremely limited labels.

10.1109/tpami.2024.3367416 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2024-02-20

Dense captioning creates diverse Region of Interests (RoIs) descriptions for complex visual scenes. While promising results have been obtained, several issues persist. In particular: 1) it is hard to find the optimal parameters artificially designed modules (e.g., non-maximum suppression (NMS)) causing redundancies and fewer interactions benefit two sub-tasks RoI detection captioning; 2) absence a multi-scale decoder in current methods hinders acquisition scale-invariant features, thus...

10.1109/tmm.2024.3369863 article EN IEEE Transactions on Multimedia 2024-01-01

In recent years many Tone Mapping Operators (TMOs) have been presented in order to display High Dynamic Range Images (HDRI) on typical devices. TMOs compress the luminance range while trying maintain contrast. The dual of tone mapping, inverse expands a Low Image (LDRI) into HDRI. HDRIs contain broader physical values that can be perceived by human visual system. majority today's media is stored low dynamic range. Inverse (iTMOs) could thus potentially revive all this content for use high...

10.1145/1174429.1174489 article EN 2006-11-29

In recent years, there has been a notable surge in the adoption of weakly-supervised learning for medical image segmentation, utilizing scribble annotation as means to potentially reduce costs. However, inherent characteristics labeling, marked by incompleteness, subjectivity, and lack standardization, introduce inconsistencies into annotations. These become significant challenges network's process, ultimately affecting performance segmentation. To address this challenge, we propose creating...

10.1109/tip.2025.3530787 article EN IEEE Transactions on Image Processing 2025-01-01

Perceiving and understanding cyber-attacks can be a difficult task. This problem is widely recognized welldocumented, more effective techniques are needed to aid cyber-attack perception. Attack modeling (AMTs), such as attack graphs fault trees, useful visual aids that perception; however, there little empirical or comparative research which evaluates the effectiveness of these methods. paper reports results an evaluation between adapted graph method tree standard determine two methods in...

10.1109/tifs.2017.2771238 article EN IEEE Transactions on Information Forensics and Security 2017-11-08

Deep learning-based semi-supervised learning (SSL) algorithms are promising in reducing the cost of manual annotation clinicians by using unlabelled data, when developing medical image segmentation tools. However, to date, most existing treat labelled images and separately ignore explicit connection between them; this disregards essential shared information thus hinders further performance improvements. To mine images, we introduce a class-specific representation extraction approach, which...

10.1109/tmi.2022.3213372 article EN IEEE Transactions on Medical Imaging 2022-10-11

Increasing plastic recycling rates is key to addressing pollution. New technologies such as chemometric analysis of spectral data have shown great promises in improving the sorting efficiency boost rates. In this work, a novel deep learning architecture, PolymerSpectraDecisionNet (PSDN) was developed, consisting convolutional neural networks, residual networks and inception decision tree structure. To better represent conditions industry, models were built identify most widely recycled...

10.1016/j.resconrec.2022.106718 article EN cc-by-nc Resources Conservation and Recycling 2022-10-21

Abstract Sheet metal stamping is widely used for high-volume production. Despite the wide adoption, it can lead to defects in manufactured components, making their quality unacceptable. Because of variety that occur on final product, human inspectors are frequently employed detect them. However, they be unreliable and costly, particularly at speeds match rate. In this paper, we propose an automatic inspection framework process based computer vision deep learning techniques. The low cost,...

10.1007/s00170-022-10763-6 article EN cc-by The International Journal of Advanced Manufacturing Technology 2023-01-21

The computation of high-fidelity images in real-time remains one the key challenges for computer graphics. Recent work has shown that by understanding human visual system, selective rendering may be used to render only those parts which viewer is attending at high quality and rest scene a much lower quality. This can result significant reduction computational time, without being aware difference. Selective guided models typically form 2D saliency map, predict where user will looking any...

10.1145/1108590.1108595 article EN 2006-01-25

Abstract In the last few years, researchers in field of High Dynamic Range (HDR) Imaging have focused on providing tools for expanding Low (LDR) content generation HDR images due to growing popularity applications, such as photography and rendering via Image‐Based Lighting, imminent arrival displays consumer market. LDR expansion is required lack fast reliable level capture still videos. Furthermore, expansion, will allow re‐use legacy stills, videos applications created, over century more,...

10.1111/j.1467-8659.2009.01541.x article EN Computer Graphics Forum 2009-10-26

Contemporary multi-modal trackers achieve strong performance by leveraging complex backbones and fusion strategies, but this comes at the cost of computational efficiency, limiting their deployment in resource-constrained settings. On other hand, compact are more efficient often suffer from reduced due to limited feature representation. To mitigate gap between trackers, we introduce a cross-modality distillation framework. This framework includes complementarity-aware mask autoencoder...

10.1109/tpami.2025.3555485 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2025-01-01
Coming Soon ...