NFDI4DS | UHH-SEMS - Publication Details

Vassilis Tsiaras

ORCID: 0000-0002-6677-1946

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5049252564

Research Areas

Speech Recognition and Synthesis
Music and Audio Processing
Speech and Audio Processing
Neural dynamics and brain function
Functional Brain Connectivity Studies
Data Visualization and Analytics
EEG and Brain-Computer Interfaces
Bioinformatics and Genomic Networks
Computational Geometry and Mesh Generation
Advanced Adaptive Filtering Techniques
Computer Graphics and Visualization Techniques
Voice and Speech Disorders
Constraint Satisfaction and Optimization
Speech and dialogue systems
Neural Networks and Applications
Multimedia Communication and Technology
Semantic Web and Ontologies
Data Management and Algorithms
Blind Source Separation Techniques
Biomedical Text Mining and Ontologies
Gene Regulatory Network Analysis
Structural Health Monitoring Techniques
Fractal and DNA sequence analysis
Embedded Systems Design Techniques
Video Analysis and Summarization

University of Crete
2008-2024

Technical University of Crete
2014-2016

Foundation for Research and Technology Hellas
2008-2013

Aristotle University of Thessaloniki
2010

Extracting biomarkers of autism from MEG resting-state functional connectivity networks

OPENALEX - Publications

Vassilis Tsiaras Panagiotis G. Simos Roozbeh Rezaie Bhavin R. Sheth Eleftherios Garyfallidis and 2 more

10.1016/j.compbiomed.2011.04.004 article EN Computers in Biology and Medicine 2011-05-22

ON the Use of Wavenet as a Statistical Vocoder

OPENALEX - Publications

Nagaraj Adiga Vassilis Tsiaras Yannis Stylianou

In this paper, we explore the possibility of using WaveNet architecture as a statistical vocoder. that case, generation speech waveforms is locally conditioned only by acoustic features. Focusing on single speaker case at moment, investigate impact local conditions well amount data available for training. Furthermore, variations are considered and discussed in context our work. We compare work against very recent which also used vocoder same data. More specifically, two female male speakers...

10.1109/icassp.2018.8462393 article EN 2018-04-01

GlotNet—A Raw Waveform Model for the Glottal Excitation in Statistical Parametric Speech Synthesis

OPENALEX - Publications

Lauri Juvela Bajibabu Bollepalli Vassilis Tsiaras Paavo Alku

Recently, generative neural network models which operate directly on raw audio, such as WaveNet, have improved the state of art in text-to-speech synthesis (TTS). Moreover, there is increasing interest using these statistical vocoders for generating speech waveforms from various acoustic features. However, also a need to reduce model complexity, without compromising quality. Previously, glottal pulseforms (i.e., time-domain corresponding source human voice production mechanism) been...

10.1109/taslp.2019.2906484 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2019-03-27

Assessment of Linear and Nonlinear Synchronization Measures for Analyzing EEG in a Mild Epileptic Paradigm

OPENALEX - Publications

Vangelis Sakkalis Ciprian Doru Giurcăneanu Petros Xanthopoulos Michalis Zervakis Vassilis Tsiaras and 3 more

<para xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> Epilepsy is one of the most common brain disorders and may result in dysfunction cognitive disturbances. Epileptic seizures usually begin childhood without being accommodated by damage are tolerated drugs that produce no dysfunction. In this study, function evaluated children with mild epileptic controlled antiepileptic drugs. Under prism, we propose a concise technical framework combining...

10.1109/titb.2008.923141 article EN IEEE Transactions on Information Technology in Biomedicine 2009-03-09

Speaker-independent Raw Waveform Model for Glottal Excitation

OPENALEX - Publications

Lauri Juvela Vassilis Tsiaras Bajibabu Bollepalli Manu Airaksinen Junichi Yamagishi and 1 more

Recent speech technology research has seen a growing interest in using WaveNets as statistical vocoders, i.e., generating waveforms from acoustic features.These models have been shown to improve the generated quality over classical vocoders many tasks, such text-to-speech synthesis and voice conversion.Furthermore, conditioning with features allows sharing waveform generator model across multiple speakers without additional speaker codes.However, multi-speaker WaveNet require large amounts...

10.21437/interspeech.2018-1635 article EN Interspeech 2022 2018-08-28

Speech Enhancement for Noise-Robust Speech Synthesis Using Wasserstein GAN

OPENALEX - Publications

Nagaraj Adiga Yannis Pantazis Vassilis Tsiaras Yannis Stylianou

10.21437/interspeech.2019-2648 article EN Interspeech 2022 2019-09-13

Optimal brain network synchrony visualization: Application in an alcoholism paradigm

OPENALEX - Publications

Vangelis Sakkalis Vassilis Tsiaras Michalis Zervakis Ioannis G. Tollis

Although Electroencephalographic (EEG) signal synchronization studies have been a topic of increasing interest lately, there is no similar effort in the visualization such measures. In this direction graph-theoretic approach devised to study and stress coupling dynamics task-performing dynamical networks proposed. Both linear nonlinear interdependence measures are investigated an alcoholism paradigm during mental rehearsal pictures, which known reflect impairment. More specifically, widely...

10.1109/iembs.2007.4353283 article EN Conference proceedings 2007-08-01

Assessment of neural dynamic coupling and causal interactions between independent EEG components from cognitive tasks using linear and nonlinear methods

OPENALEX - Publications

Vangelis Sakkalis Vassilis Tsiaras Kostas Michalopoulos Michalis Zervakis

Over the past few years there has been an increased interest in studying underlying neural mechanism of cognitive brain activity. In this direction, we study activity based on its independent components instead EEG signal itself. Both linear and nonlinear synchronization measures are applied to components, which free volume conduction effects background noise. More specifically, a robust state-space generalized assessment method recently introduced partial directed coherence investigated...

10.1109/iembs.2008.4650028 article EN 2008-08-01

A Non-Causal FFTNet Architecture for Speech Enhancement

OPENALEX - Publications

Muhammed Shifas P.V. Nagaraj Adiga Vassilis Tsiaras Yannis Stylianou

In this paper, we suggest a new parallel, non-causal and shallow waveform domain architecture for speech enhancement based on FFTNet, neural network generating high quality audio waveform. contrast to other approaches like WaveNet, FFTNet uses an initial wide dilation pattern. Such better represents the long term correlated structure of in time domain, where noise is usually highly non-correlated, therefore it suitable enhancement. To further strengthen feature architecture, present sample...

10.21437/interspeech.2019-2622 article EN Interspeech 2022 2019-09-13

Prediction of Peak Oxygen Uptake From a Maximal Treadmill Test in 12- to 18-Year-Old Active Male Adolescents

OPENALEX - Publications

Vassilis Tsiaras Andreas Zafeiridis Κωνσταντίνα Δίπλα Kostas Patras Anastasios D. Georgoulis and 1 more

The aims were to develop and validate a VO 2peak prediction equation from treadmill running test in active male adolescents. Eighty-eight athletes (12–18 yrs.) performed maximal exercise on assess the actual 20m Shuttle-Run-Test (20mST). A step-wise linear regression analysis was used following for estimation of (mL·kg −1 ·min ) = 35.477 + 1.832 × duration min - 0.010 body mass kg developed. cross-validation statistics were: R .54, CE 0.1 mL·kg , SEE 2.5 (4.6%), TE 2.6 (4.9%). values (CE,...

10.1123/pes.22.4.624 article EN Pediatric Exercise Science 2010-11-01

Video and audio based detection of filled hesitation pauses in classroom lectures

OPENALEX - Publications

Vassilis Tsiaras Costas Panagiotakis Yannis Stylianou

In this paper we study the detection of hesitation filled pauses in oral presentations university lectures taught Greek language and recorded using a tablet PC via specialized software. We suggest hierarchical approach fusing video data with audio for increasing precision rate our system. The method works at frame level rather than usual segmental more accurate synchronization after removing detected hesitations. Audio characteristics are modeled Gaussian Mixture Models while stationarity is...

10.5281/zenodo.41616 article EN European Signal Processing Conference 2009-08-24

DAGmaps: Space Filling Visualization of Directed Acyclic Graphs

OPENALEX - Publications

Vassilis Tsiaras Sofia Triantafilou Ioannis G. Tollis

Gene Ontology information related to the biological role of genes is organized in a hierarchical manner that can be represented by directed acyclic graph (DAG). Space filling visualizations, such as treemaps, have capacity display thousands items legibly limited space via two-dimensional rectangular map. Treemaps been used visualize first transforming DAG into tree. However this transformation has several undesirable effects producing trees with large number nodes and scattering rectangles...

10.7155/jgaa.00190 article EN cc-by Journal of Graph Algorithms and Applications 2009-01-01

BrainNetVis: An Open-Access Tool to Effectively Quantify and Visualize Brain Networks

OPENALEX - Publications

Eleni Christodoulou Vangelis Sakkalis Vassilis Tsiaras Ioannis G. Tollis

This paper presents BrainNetVis, a tool which serves brain network modelling and visualization, by providing both quantitative qualitative measures of interconnectivity. It emphasizes the needs that led to creation this presenting similar works in field describing how our contributes existing scenery. also describes methods used for calculation graph metrics (global vertex metrics), carry information. To make clear understandable, we use an exemplar dataset throughout paper, on calculations...

10.1155/2011/747290 article EN Computational Intelligence and Neuroscience 2011-01-01

BrainNetVis: Analysis and visualization of brain functional networks

OPENALEX - Publications

Vassilis Tsiaras Dimitrios Andreou Ioannis G. Tollis

BrainNetVis is an application, written in Java, that displays and analyzes synchronization networks from brain signals. The program implements a number of network indices visualization techniques. We demonstrate its use through case study left hand foot motor imagery. data sets were provided by the Berlin BCI group. Using this we managed to find differences between average comparing them with idle state network.

10.1109/iembs.2009.5334489 article EN Annual International Conference of the IEEE Engineering in Medicine and Biology Society 2009-09-01

Linear dynamical models in speech synthesis

OPENALEX - Publications

Vassilis Tsiaras Ranniery Maia Vassilios Diakoloukas Yannis Stylianou Vassilios Digalakis

Hidden Markov models (HMMs) are becoming the dominant approach for text-to-speech synthesis (TTS). HMMs provide an attractive acoustic modeling scheme which has been exhaustively investigated and developed many years. Modern HMM-based speech synthesizers have approached quality of best state-of-the-art unit selection systems. However, we believe that statistical parametric not reached its potential, since limited by several assumptions do apply to properties speech. We, therefore, propose in...

10.1109/icassp.2014.6853606 article EN 2014-05-01

Global Variance in Speech Synthesis With Linear Dynamical Models

OPENALEX - Publications

Vassilis Tsiaras Ranniery Maia Vassilios Diakoloukas Yannis Stylianou Vassilios Digalakis

Linear Dynamical Models (LDMs) have been used in speech synthesis recently as an alternative to hidden Markov models (HMMs). Among the advantages of LDMs are ability capture dynamics and achievement synthesized quality similar HMM-based systems on a smaller footprint. However, such HMM case, produce over-smoothed trajectories parameters, resulting muffled synthetic speech. Inspired by problem found synthesis, where naturalness is greatly improved when global variance (GV) compensated, this...

10.1109/lsp.2016.2580672 article EN IEEE Signal Processing Letters 2016-06-14

Memory Efficient Neural Speech Synthesis Based on FastSpeech2 Using Attention Free Transformer

OPENALEX - Publications

Eirini Sisamaki Vassilis Tsiaras Yannis Stylianou

10.23919/eusipco63174.2024.10714999 article EN 2024-08-26

Towards a linear dynamical model based speech synthesizer

OPENALEX - Publications

Vassilis Tsiaras Ranniery Maia Vassilios Diakoloukas Yannis Stylianou Vassilios Digalakis

10.21437/interspeech.2015-308 article EN Interspeech 2022 2015-09-06

Speech Intelligibility Enhancement Based on a Non-causal Wavenet-like Model

OPENALEX - Publications

Muhammed Shifas PV Vassilis Tsiaras Yannis Stylianou

Low speech intelligibility in noisy listening conditions makes more difficult our communication with others.Various strategies have been suggested to modify a signal before it is presented environment the goal increase its intelligibility.A state-of-the art approach, referred as Spectral Shaping and Dynamic Range Compression (SS-DRC), relies on modifying spectral temporal structure of clean has shown considerably improve conditions.In this paper, we present non-causal Wavenet-like model for...

10.21437/interspeech.2018-2119 article EN Interspeech 2022 2018-08-28

DAGmaps and ε-Visibility Representations for DAGs: Algorithms and Characterizations

OPENALEX - Publications

Vassilis Tsiaras Ioannis G. Tollis

DAGmaps are space lling visualizations of DAGs that generalize treemaps. Deciding whether or not a DAG admits DAGmap is NPcomplete. Although any layered planar one-dimensional there was no complete characterization the class admit DAGmap. In this paper we prove if and only it directed -visibility representation. Then characterize representations. This consists downward straight-line drawing such all source sink vertices assigned to external face. Finally show denes three-dimensional

10.7155/jgaa.00262 article EN cc-by Journal of Graph Algorithms and Applications 2012-01-01

Coming Soon ...