NFDI4DS | UHH-SEMS - Publication Details

Fast, reliable head tracking under varying illumination: an approach based on registration of texture-mapped 3D models

OPENALEX - Publications

Marco La Cascia Stan Sclaroff Vassilis Athitsos

A technique for 3D head tracking under varying illumination is proposed. The modeled as a texture mapped cylinder. Tracking formulated an image registration problem in the cylinder's map image. resulting dynamic provides stabilized view of face that can be used input to many existing 2D techniques recognition, facial expressions analysis, lip reading, and eye tracking. To solve with lighting variation motion, residual error linear combination warping templates orthogonal templates. Fast...

10.1109/34.845375 article EN IEEE Transactions on Pattern Analysis and Machine Intelligence 2000-04-01

3D skeleton-based human action classification: A survey

OPENALEX - Publications

Liliana Lo Presti Marco La Cascia

10.1016/j.patcog.2015.11.019 article EN Pattern Recognition 2015-12-02

ImageRover: a content-based image browser for the World Wide Web

OPENALEX - Publications

Stan Sclaroff Leonid Taycher Marco La Cascia

ImageRover is a search by image content navigation tool for the world wide web. To gather images expediently, collection subsystem utilizes distributed fleet of WWW robots running on different computers. The information about they find, computing appropriate decompositions and indices, store this extracted in vector form searches based content. At time, users can iteratively guide through selection relevant examples. Search performance made efficient use an approximate, optimized k-d tree...

10.1109/ivl.1997.629714 article EN 1997-01-01

Combining textual and visual cues for content-based image retrieval on the World Wide Web

OPENALEX - Publications

Marco La Cascia Sunjay Sethi Stan Sclaroff

A system is proposed that combines textual and visual statistics in a single index vector for content-based search of WWW image database. Textual are captured form using latent semantic indexing (LSI) based on text the containing HTML document. Visual color orientation histograms. By an integrated approach, it becomes possible to take advantage statistical couplings between content document (latent content) contents images (visual statistics). The combined approach allows improved...

10.1109/ivl.1998.694480 article EN 2002-11-27

Depth map generation by image classification

OPENALEX - Publications

Sebastiano Battiato Salvatore Curti Marco La Cascia Marcello Tortora Emiliano Scordato

This paper presents a novel and fully automatic technique to estimate depth information from single input image. The proposed method is based on new image classification able classify digital images (also in Bayer pattern format) as indoor, outdoor with geometric elements or without elements. Using the collected step suitable map estimated. unsupervised generate view of scene, requiring low computational resources.

10.1117/12.526634 article EN Proceedings of SPIE, the International Society for Optical Engineering/Proceedings of SPIE 2004-04-16

Is text preprocessing still worth the time? A comparative survey on the influence of popular preprocessing methods on Transformers and traditional classifiers

OPENALEX - Publications

Marco Siino Ilenia Tinnirello Marco La Cascia

With the advent of modern pre-trained Transformers, text preprocessing has started to be neglected and not specificly addressed in recent NLP literature. However, both from a linguistic computer science point view, we believe that even when using can significantly impact on performance classification model. We want investigate compare, through this study, how impacts Text Classification (TC) traditional models. report discuss techniques found literature their most variants or applications...

10.1016/j.is.2023.102342 article EN cc-by Information Systems 2023-12-23

Unifying Textual and Visual Cues for Content-Based Image Retrieval on the World Wide Web

OPENALEX - Publications

Stan Sclaroff Marco La Cascia Saratendu Sethi Leonid Taycher

A system is proposed that combines textual and visual statistics in a single index vector for content-based search of WWW image database. Textual are captured form using latent semantic indexing based on text the containing HTML document. Visual color orientation histograms. By an integrated approach, it becomes possible to take advantage statistical couplings between content document (latent content) contents images (visual statistics). The combined approach allows improved performance...

10.1006/cviu.1999.0765 article EN cc-by-nc-nd Computer Vision and Image Understanding 1999-07-01

A risk evaluation framework for the best maintenance strategy: The case of a marine salt manufacture firm

OPENALEX - Publications

Silvia Carpitella Ilyas Mzougui Julio Benítez Fortunato Carpitella Antonella Certa and 2 more

10.1016/j.ress.2020.107265 article EN Reliability Engineering & System Safety 2020-10-06

Fake News Spreaders Detection: Sometimes Attention Is Not All You Need

OPENALEX - Publications

Marco Siino Elisa Di Nuovo Ilenia Tinnirello Marco La Cascia

Guided by a corpus linguistics approach, in this article we present comparative evaluation of State-of-the-Art (SotA) models, with special focus on Transformers, to address the task Fake News Spreaders (i.e., users that share News) detection. First, explore reference multilingual dataset for considered task, exploiting techniques, such as chi-square test, keywords and Word Sketch. Second, perform experiments several models Natural Language Processing. Third, using most recent...

10.3390/info13090426 article EN cc-by Information 2022-09-09

JACOB: just a content-based query system for video databases

OPENALEX - Publications

Marco La Cascia Edoardo Ardizzone

The increasing development of advanced multimedia applications requires new technologies for organizing and retrieving by content databases still digital images or video sequences. authors describe JACOB, a prototypal system allowing content-based browsing querying in databases. JACOB automatically splits into sequence shots, extracts few representative frames (said r-frames) from each shot computes r-frame descriptors based on features like color texture. No user action is required during...

10.1109/icassp.1996.543585 article EN 2002-12-23

OPENALEX - Publications

Edoardo Ardizzone Marco La Cascia

10.1023/a:1009630331620 article EN Multimedia Tools and Applications 1997-01-01

3D stereoscopic image pairs by depth-map generation

OPENALEX - Publications

Sebastiano Battiato Alessandro Capra Salvatore Curti Marco La Cascia

This work presents a new unsupervised technique aimed to generate stereoscopic views estimating depth information from single input image. Using image, vanishing lines/points are extracted using few heuristics an approximated map. The map is then used stereo pairs. overall method well suited for real time application and works also on CFA (colour filtering array) data acquired by consumer imaging devices. Experimental results large dataset reported.

10.1109/3dpvt.2004.10 article EN International Symposium on 3D Data Processing, Visualization and Transmission 2004-09-06

The public mental representations of deepfake technology: An in-depth qualitative exploration through Quora text data analysis

OPENALEX - Publications

Barbara Caci Giulia Giordano Marianna Alesi Ambra Gentile Chiara Agnello and 6 more

The advent of deepfake technology has raised significant concerns regarding its impact on individuals' cognitive processes and beliefs, considering the pervasive relationships between human cognition. This study delves into psychological literature surrounding deepfakes, focusing people's public representation this emerging highlighting prevailing themes, opinions, emotions. Under media framing, theoretical framework is crucial in shaping schemas technology. A qualitative method been applied...

10.1371/journal.pone.0313605 article EN cc-by PLoS ONE 2024-12-30

Fast, reliable head tracking under varying illumination

OPENALEX - Publications

Marco La Cascia Stan Sclaroff

An improved technique for 3D head tracking under varying illumination conditions is proposed. The modeled as a texture mapped cylinder. Tracking formulated an image registration problem in the cylinder's map image. To solve presence of lighting variation and motion, residual error linear combination warping templates orthogonal templates. Fast stable on-line then achieved via regularized weighted least squares minimization error. regularization term tends to limit potential ambiguities that...

10.1109/cvpr.1999.787001 article EN 2003-01-20

3D stereoscopic image pairs by depth-map generation

OPENALEX - Publications

Sebastiano Battiato Alessandro Capra Salvatore Curti Marco La Cascia

This work presents a new unsupervised technique aimed to generate stereoscopic views estimating depth information from single input image. Using image, vanishing lines/points are extracted using few heuristics an approximated map. The map is then used stereo pairs. overall method well suited for real time application and works also on CFA (colour filtering array) data acquired by consumer imaging devices. Experimental results large dataset reported.

10.1109/tdpvt.2004.1335185 article EN Proceedings. 2nd International Symposium on 3D Data Processing, Visualization and Transmission, 2004. 3DPVT 2004. 2004-11-08

Hankelet-based dynamical systems modeling for 3D action recognition

OPENALEX - Publications

Liliana Lo Presti Marco La Cascia Stan Sclaroff Octavia Camps

10.1016/j.imavis.2015.09.007 article EN publisher-specific-oa Image and Vision Computing 2015-10-25

ABBIE: Attention-Based BI-Encoders for Predicting Where to Split Compound Sanskrit Words

OPENALEX - Publications

Irfan Ali Liliana Lo Presti Igor Spanò Marco La Cascia

10.5220/0013155300003890 article EN Proceedings of the 14th International Conference on Agents and Artificial Intelligence 2025-01-01

From Foundations to GPT in Text Classification: A Comprehensive Survey on Current Approaches and Future Trends

OPENALEX - Publications

Marco Siino Ilenia Tinnirello Marco La Cascia

10.1561/1500000107 article EN Foundations and Trends® in Information Retrieval 2025-01-01

Video indexing using MPEG motion compensation vectors

OPENALEX - Publications

Edoardo Ardizzone Marco La Cascia A. Avanzato Arcangelo Bruna

In the last years a lot of work has been done on color, textural, structural and semantic indexing "content-based" video databases. Motion-based less explored, with approaches generally based analysis optical flows. Compressed videos require decompression sequences computation flows, two steps computationally heavy. this paper we propose some methods to index by motion features (mainly related camera motion) motion-based spatial segmentation frames, in fully automatic way. Our idea is use...

10.1109/mmcs.1999.778574 article EN 2003-01-20

Boosting Hankel matrices for face emotion recognition and pain detection

OPENALEX - Publications

Liliana Lo Presti Marco La Cascia

10.1016/j.cviu.2016.10.007 article EN Computer Vision and Image Understanding 2016-10-20

Video indexing using optical flow field

OPENALEX - Publications

Edoardo Ardizzone Marco La Cascia

The increasing development of advanced multimedia applications requires new technologies for organizing and retrieving by content databases digital video. Several based features (color, texture, motion, etc.) are needed to perform a reliable retrieval. We present method automatic motion video indexing A prototypal system has been developed prove the validity our approach. Our automatically splits into sequence shots, extracts few representative frames (said r-frames) from each shot computes...

10.1109/icip.1996.560876 article EN 2002-12-24

Head tracking via robust registration in texture map images

OPENALEX - Publications

Marco La Cascia John Isidoro Stan Sclaroff

A novel method for 3D head tracking in the presence of large rotations and facial expression changes is described. Tracking formulated terms color image registration texture map a surface model. Model appearance recursively updated via mosaicking as orientation varies. The resulting dynamic provides stabilized view face that can be used input to many existing 2D techniques recognition, expressions analysis, lip reading, eye tracking. Parameters are estimated robust minimization procedure;...

10.1109/cvpr.1998.698653 article EN 2002-11-27