Spyros Zoupanos

ORCID: 0000-0002-6069-5241
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Database Systems and Queries
  • Peer-to-Peer Network Technologies
  • Advanced Data Storage Technologies
  • Data Management and Algorithms
  • Semantic Web and Ontologies
  • Caching and Content Delivery
  • Natural Language Processing Techniques
  • Web Data Mining and Analysis
  • Data Mining Algorithms and Applications
  • Topic Modeling
  • Opportunistic and Delay-Tolerant Networks
  • Distributed and Parallel Computing Systems
  • Text and Document Classification Technologies
  • Scientific Computing and Data Management
  • Service-Oriented Architecture and Web Services
  • IoT and Edge/Fog Computing
  • Machine Learning in Materials Science
  • Catalytic Processes in Materials Science
  • Algorithms and Data Compression
  • Network Security and Intrusion Detection
  • Data Quality and Management
  • Distributed Sensor Networks and Detection Algorithms
  • Cooperative Communication and Network Coding
  • Machine Learning and Algorithms

Ionian University
2022

École Polytechnique Fédérale de Lausanne
2020

Max Planck Institute for Informatics
2011-2013

Inria Saclay - Île de France
2008-2011

Laboratoire de Recherche en Informatique
2010-2011

Université Paris-Sud
2008-2011

Max Planck Society
2011

Institut national de recherche en informatique et en automatique
2008-2010

Knowledge Integration (United Kingdom)
2007-2009

Materials Cloud is a platform designed to enable open and seamless sharing of resources for computational science, driven by applications in materials modelling. It hosts 1) archival dissemination services raw curated data, together with their provenance graph, 2) modelling virtual machines, 3) tools data analytics, pre-/post-processing, 4) educational materials. Data citable archived persistently, providing comprehensive embodiment the FAIR principles that extends workflows. leverages AiiDA...

10.1038/s41597-020-00637-5 article EN cc-by Scientific Data 2020-09-08

Abstract The ever-growing availability of computing power and the sustained development advanced computational methods have contributed much to recent scientific progress. These developments present new challenges driven by sheer amount calculations data manage. Next-generation exascale supercomputers will harden these challenges, such that automated scalable solutions become crucial. In years, we been developing AiiDA (aiida.net), a robust open-source high-throughput infrastructure...

10.1038/s41597-020-00638-4 article EN cc-by Scientific Data 2020-09-08

Mobile networks experience a tremendous increase in data volume and user density due to the massive number of coexisting users devices. An efficient technique alleviate this issue is bring closer by exploiting cache-aided edge nodes, such as fixed mobile access points, even Meanwhile, fusion machine learning wireless offers new opportunities for network optimization when traditional approaches fail or incur high complexity. Among various categories, reinforcement provides autonomous...

10.1109/access.2022.3140719 article EN cc-by IEEE Access 2022-01-01

Frequent sequence mining is one of the fundamental building blocks in data mining. While problem has been extensively studied, few available techniques are sufficiently scalable to handle datasets with billions sequences; such large-scale arise, for instance, text and session analysis. In this paper, we propose MG-FSM, a algorithm frequent on MapReduce. MG-FSM can so-called "gap constraints", which be used limit output controlled set sequences. At its heart, partitions input database way...

10.1145/2463676.2465285 article EN 2013-06-22

SIGMOD 2008 was the first database conference that offered to test submitters' programs against their data verify experiments published. This paper discusses rationale for this effort, community's reaction, our experiences, and advice future similar efforts.

10.1145/1374780.1374791 article EN ACM SIGMOD Record 2008-03-01

We consider the problem of rewriting XQuery queries using multiple materialized views. The dialect we use to express views and corresponds tree patterns (returning data from several nodes, at different granularities, ranging node identifiers full XML subtrees) with value joins. provide correct complete algorithms for finding minimal rewritings, in which no view is redundant. Our work extends state art by considering more flexible than mostly XPath 1.0 dialects previously considered, powerful...

10.1109/icde.2011.5767915 article EN 2011-04-01

The evolution of natural language processing (NLP) has drastically improved numerous applications in terms quality results and speed, like the use semantic search modern engines. NLP highly benefited from recent developments word sentence embeddings which enable transformation complex tasks, such as similarity or Question Answering (Q&A), into much simpler to perform vector comparisons. However, new problems resulting transformations have also challenging tasks address efficient comparison...

10.1145/3549737.3549752 article EN 2022-09-07

We present the WebContent platform for managing distributed repositories of XML and semantic Web data. The allows integrating various data processing building blocks (crawling, translation, annotation, full-text search, structured querying, querying), presented as services, into a large-scale efficient platform. Calls to services are combined inside ActiveXML [8] documents, which documents including service calls. An optimizer is used to: ( i ) efficiently distribute computations among...

10.14778/1454159.1454191 article EN Proceedings of the VLDB Endowment 2008-08-01

The Web has become a platform of choice for the deployment complex applications involving several business partners. Typically, such interoperate by means services, exchanging XML information. We present OptimAX, an optimization service that applies at static level (prior to enacting application) in order rewrite it into one whose execution will be more performant. OptimAX builds on ActiveXML (AXML) data-centric composition language, and demonstrates how database-style techniques can...

10.1109/icwe.2008.11 article EN 2008-07-01

Knowledge harvesting enables the automated construction of large knowledge bases. In this work, we made a first attempt to harvest spatio-temporal from news archives construct trajectories individual entities for entity tracking. Our approach consists an extraction and disambiguation module fact generation which produce pertinent trajectory records textual sources. The evaluation on 20 years' New York Times article corpus showed that our methods are effective scalable.

10.1145/1963192.1963265 preprint EN 2011-03-28

Mash-ups are being used in various Web-based applications of Web 2.0 which combine instantly information from different sources. Active XML (AXML, short) language is a tool for decentralized, data-centric service integration. AXML document includes calls to services that may be either simple request-responses long running subscriptions. Being fully composable and allowing resource sharing makes ideal mash-up style In this demo we present how can as specification, optimization distributed...

10.1109/icde.2008.4497622 article EN 2008-04-01

The proliferation of electronic content has notably lead to the apparition large corpora interrelated structured documents (such as HTML and XML Web pages) semantic annotations (typically expressed in RDF), which further complement these documents. Documents may be authored independently by different users or programs. We present AnnoVIP, a peer-to-peer platform, capable efficiently exploiting multitude annotated documents, based on innovative materialized views.

10.1109/icde.2010.5447755 article EN 2022 IEEE 38th International Conference on Data Engineering (ICDE) 2010-01-01

Mobile networks are experiencing tremendous increase in data volume and user density. An efficient technique to alleviate this issue is bring the closer users by exploiting caches of edge network nodes, such as fixed or mobile access points even devices. Meanwhile, fusion machine learning wireless offers a viable way for optimization opposed traditional approaches which incur high complexity, fail provide optimal solutions. Among various categories, reinforcement operates an online...

10.48550/arxiv.2105.05564 preprint EN cc-by-nc-nd arXiv (Cornell University) 2021-01-01

The growing volumes of XML data sources on the Web or produced by enterprises, organizations etc. raise many performance challenges for management applications. In this work, we are concerned with distributed, peer-to-peer large corpora documents, based distributed hash table (or DHT, in short) overlay networks. We present ViP2P (standing Views Peer-to-Peer), a platform sharing documents structured P2P network infrastructure (DHT). At core stand materialized views, defined arbitrary queries,...

10.48550/arxiv.1112.2610 preprint EN other-oa arXiv (Cornell University) 2011-01-01
Coming Soon ...