NFDI4DS | UHH-SEMS - Publication Details

Michail Vlachos

ORCID: 0000-0003-1008-5290

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5091785745

Research Areas

Time Series Analysis and Forecasting
Data Management and Algorithms
Algorithms and Data Compression
Anomaly Detection Techniques and Applications
Explainable Artificial Intelligence (XAI)
Advanced Clustering Algorithms Research
Image Retrieval and Classification Techniques
Advanced Database Systems and Queries
Music and Audio Processing
Adversarial Robustness in Machine Learning
Topic Modeling
Neural Networks and Applications
Advanced Graph Neural Networks
Data Stream Mining Techniques
Machine Learning and Data Classification
Complex Systems and Time Series Analysis
Advanced Steganography and Watermarking Techniques
Sparse and Compressive Sensing Techniques
Natural Language Processing Techniques
Recommender Systems and Techniques
Video Analysis and Summarization
Privacy-Preserving Technologies in Data
Cell Image Analysis Techniques
Machine Learning and Algorithms
Text Readability and Simplification

University of Lausanne
2019-2024

National and Kapodistrian University of Athens
2024

Institute of Communication and Computer Systems
2022

Los Alamitos Medical Center
2020

IBM Research - Zurich
2009-2019

IBM (United States)
2007-2018

IBM Research - Thomas J. Watson Research Center
2006-2017

University of California, Riverside
2002-2006

National Technical University of Athens
2004

Discovering similar multidimensional trajectories

OPENALEX - Publications

Michail Vlachos George Kollios Dimitrios Gunopulos

We investigate techniques for analysis and retrieval of object trajectories in two or three dimensional space. Such data usually contain a large amount noise, that has made previously used metrics fail. Therefore, we formalize non-metric similarity functions based on the longest common subsequence (LCSS), which are very robust to noise furthermore provide an intuitive notion between by giving more weight similar portions sequences. Stretching sequences time is allowed, as well global...

10.1109/icde.2002.994784 article EN 2003-06-25

Indexing multi-dimensional time-series with support for multiple distance measures

OPENALEX - Publications

Michail Vlachos Marios Hadjieleftheriou Dimitrios Gunopulos Eamonn Keogh

Although most time-series data mining research has concentrated on providing solutions for a single distance function, in this work we motivate the need index structure that can support multiple measures. Our specific area of interest is efficient retrieval and analysis trajectory similarities. Trajectory datasets are very common environmental applications, mobility experiments, video surveillance especially important discovery certain biological patterns. primary similarity measure based...

10.1145/956750.956777 article EN 2003-08-24

Identifying similarities, periodicities and bursts for online search queries

OPENALEX - Publications

Michail Vlachos Christopher Meek Zografoula Vagena Dimitrios Gunopulos

We present several methods for mining knowledge from the query logs of MSN search engine. Using logs, we build a time series each word or phrase (e.g., 'Thanksgiving' 'Christmas gifts') where elements are number times that is issued on day. All describe use sequences this form and can be applied to data generally. Our primary goal discovery semantically similar queries do so by identifying with demand patterns. Utilizing best Fourier coefficients energy omitted components, improve upon...

10.1145/1007568.1007586 article EN 2004-06-13

On Periodicity Detection and Structural Periodic Similarity

OPENALEX - Publications

Michail Vlachos Philip S. Yu Vittorio Castelli

This work motivates the need for more flexible structural similarity measures between time-series sequences, which are based on extraction of important periodic features. Specifically, we present non-parametric methods accurate periodicity detection and introduce new distance sequences. The goal these tools techniques to assist in detecting, monitoring visualizing changes. It is our belief that can be directly applicable manufacturing industry preventive maintenance medical sciences...

10.1137/1.9781611972757.40 article EN 2005-01-09

LB_Keogh supports exact indexing of shapes under rotation invariance with arbitrary representations and distance measures

OPENALEX - Publications

Eamonn Keogh Wei Li Xiaopeng Xi Sang‐Hee Lee Michail Vlachos

The matching of two-dimensional shapes is an important problem with applications in domains as diverse biometrics, industry, medicine and anthropology. distance measure used must be invariant to many distortions, including scale, offset, noise, partial occlusion, etc. Most these distortions are relatively easy handle, either the representation data or similarity used. However rotation invariance seems uniquely difficult. Current approaches typically try achieve data, at expense...

10.5555/1182635.1164203 article EN Very Large Data Bases 2006-09-01

Non-linear dimensionality reduction techniques for classification and visualization

OPENALEX - Publications

Michail Vlachos Carlotta Domeniconi Dimitrios Gunopulos George Kollios Nick Koudas

In this paper we address the issue of using local embeddings for data visualization in two and three dimensions, classification. We advocate their use on basis that they provide an efficient mapping procedure from original dimension data, to a lower intrinsic dimension. depict how can accurately capture user's perception similarity high-dimensional purposes. Moreover, exploit low-dimensional provided by these embeddings, develop new classification techniques, show experimentally accuracy is...

10.1145/775047.775143 article EN 2002-07-23

Indexing Multidimensional Time-Series

OPENALEX - Publications

Michail Vlachos Marios Hadjieleftheriou Dimitrios Gunopulos Eamonn Keogh

10.1007/s00778-004-0144-2 article EN The VLDB Journal 2005-07-22

Online amnesic approximation of streaming time series

OPENALEX - Publications

Themis Palpanas Michail Vlachos Eamonn Keogh Dimitrios Gunopulos Wagner Truppel

The past decade has seen a wealth of research on time series representations, because the manipulation, storage, and indexing large volumes raw data is impractical. vast majority concentrated representations that are calculated in batch mode represent each value with approximately equal fidelity. However, increasing deployment mobile devices real sensors brought home need for can be incrementally updated, approximate fidelity proportional to its age. latter property allows us answer queries...

10.1109/icde.2004.1320009 article EN 2004-09-28

Supporting exact indexing of arbitrarily rotated shapes and periodic time series under Euclidean and warping distance measures

OPENALEX - Publications

Eamonn Keogh Wei Li Xiaopeng Xi Michail Vlachos Sang‐Hee Lee and 1 more

10.1007/s00778-008-0111-4 article EN The VLDB Journal 2008-10-09

Rotation invariant distance measures for trajectories

OPENALEX - Publications

Michail Vlachos Dimitrios Gunopulos Gautam Das

For the discovery of similar patterns in 1D time-series, it is very typical to perform a normalization data (for example transformation so that follow zero mean and unit standard deviation). Such transformations can reveal latent are commonly used datamining applications. However, when dealing with multidimensional which appear naturally applications such as video-tracking, motion-capture etc, motion also be expressed at different orientations. It therefore imperative provide support for...

10.1145/1014052.1014144 article EN 2004-08-22

Global distance-based segmentation of trajectories

OPENALEX - Publications

Aris Anagnostopoulos Michail Vlachos Marios Hadjieleftheriou Eamonn Keogh Philip S. Yu

This work introduces distance-based criteria for segmentation of object trajectories. Segmentation leads to simplification the original objects into smaller, less complex primitives that are better suited storage and retrieval purposes. Previous on trajectory attacked problem locally, segmenting separately each database. Therefore, they did not directly optimize inter-object separability, which is necessary mining operations such as searching, clustering, classification large databases. In...

10.1145/1150402.1150411 article EN 2006-08-20

Domain-Driven, Actionable Knowledge Discovery

OPENALEX - Publications

Longbing Cao Chengqi Zhang Qiang Yang David Bell Michail Vlachos and 8 more

Data mining increasingly faces complex challenges in the real-life world of business problems and needs. The gap between expectations R&D results this area involves key aspects field, such as methodologies, targeted problems, pattern interestingness, infrastructure support. Both researchers practitioners are realizing importance domain knowledge to close develop actionable for real user

10.1109/mis.2007.67 article EN IEEE Intelligent Systems 2007-07-01

NET-FLi

OPENALEX - Publications

Francesco Fusco Marc Ph. Stoecklin Michail Vlachos

The ever-increasing number of intrusions in public and commercial networks has created the need for high-speed archival solutions that continuously store streaming network data to enable forensic analysis auditing. However, "turning back clock" post-attack analyses is not a trivial task. first major challenge solution sustain archiving under extremely insertion rates. Moreover, archives be stored format compressed but still amenable indexing. above requirements make general-purpose databases...

10.14778/1920841.1921011 article EN Proceedings of the VLDB Endowment 2010-09-01

Scalable and Interpretable Product Recommendations via Overlapping Co-Clustering

OPENALEX - Publications

Reinhard Heckel Michail Vlachos Thomas Parnell Celestine Duenner

We consider the problem of generating interpretable recommendations by identifying overlapping co-clusters clients and products, based only on positive or implicit feedback. Our approach is applicable very large datasets because it exhibits almost linear complexity in input examples number co-clusters. show, both real industrial data publicly available datasets, that recommendation accuracy our algorithm competitive to state-of-art matrix factorization techniques. In addition, technique has...

10.1109/icde.2017.149 article EN 2017-04-01

Robust similarity measures for mobile object trajectories

OPENALEX - Publications

Michail Vlachos Dimitrios Gunopulos George Kollios

We investigate techniques for similarity analysis of spatio-temporal trajectories mobile objects. Such data may contain a large number outliers, which degrade the performance Euclidean and time warping distance. Therefore, we propose use non-metric distance functions based on longest common subsequence (LCSS), in conjunction with sigmoidal matching function. Finally, compare these new methods to various L/sub p/ norms also (for real synthetic data) present experimental results that validate...

10.1109/dexa.2002.1045983 article EN Proceedings. 15th International Workshop on Database and Expert Systems Applications, 2004. 2004-04-23

The threshold join algorithm for top-k queries in distributed sensor networks

OPENALEX - Publications

Demetrios Zeinalipour-Yazti Zografoula Vagena Dimitrios Gunopulos Vana Kalogeraki Vassilis J. Tsotras and 3 more

In this paper we present the Threshold Join Algorithm (TJA), which is an efficient TOP-k query processing algorithm for distributed sensor networks. The objective of a top-k to find k highest ranked answers user defined similarity function. evaluation such in network environment associated with transfer data over extremely expensive communication medium. TJA uses non-uniform threshold on queried attribute order minimize number tuples that have be transferred towards querying node....

10.1145/1080885.1080896 article EN 2005-01-01

Computing Correlation Anomaly Scores Using Stochastic Nearest Neighbors

OPENALEX - Publications

Tsuyoshi Idé Spiros Papadimitriou Michail Vlachos

This paper addresses the task of change analysis correlated multi-sensor systems. The goal is to compute anomaly score each sensor when we know that system has some potential difference from a reference state. Examples include validating proper performance various car sensors in automobile industry. We solve this problem based on neighborhood preservation principle -If working normally, graph almost invariant against fluctuations experimental conditions. Here defined correlation between...

10.1109/icdm.2007.12 article EN 2007-10-01

Linear-complexity relaxed word Mover's distance with GPU acceleration

OPENALEX - Publications

Kubilay Atasu Thomas Parnell Celestine Dünner Manolis Sifalakis Haralampos Pozidis and 4 more

The amount of unstructured text-based data is growing every day. Querying, clustering, and classifying this big requires similarity computations across large sets documents. Whereas low-complexity metrics are available, attention has been shifting towards more complex methods that achieve a higher accuracy. In particular, the Word Mover's Distance (WMD) method proposed by Kusner et al. promising new approach, but its time complexity grows cubically with number unique words in Relaxed (RWMD)...

10.1109/bigdata.2017.8258005 article EN 2021 IEEE International Conference on Big Data (Big Data) 2017-12-01

A Survey of Deep Learning: From Activations to Transformers

OPENALEX - Publications

Johannes Schneider Michail Vlachos

10.5220/0012404300003636 article EN cc-by-nc-nd Proceedings of the 14th International Conference on Agents and Artificial Intelligence 2024-01-01

Elastic Translation Invariant Matching of Trajectories

OPENALEX - Publications

Michail Vlachos George Kollios Dimitrios Gunopulos

10.1007/s10994-005-5830-9 article EN Machine Learning 2005-02-01

Streaming Time Series Summarization Using User-Defined Amnesic Functions

OPENALEX - Publications

Themis Palpanas Michail Vlachos Eamonn Keogh Dimitrios Gunopulos

The past decade has seen a wealth of research on time series representations. vast majority concentrated representations that are calculated in batch mode and represent each value with approximately equal fidelity. However, the increasing deployment mobile devices real sensors brought home need for can be incrementally updated, approximate data fidelity proportional to its age. latter property allows us answer queries about recent greater precision, since many domains information is more...

10.1109/tkde.2007.190737 article EN IEEE Transactions on Knowledge and Data Engineering 2008-05-29

Coming Soon ...