Vassilis Tsiaras

ORCID: 0000-0002-6677-1946
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Speech Recognition and Synthesis
  • Music and Audio Processing
  • Speech and Audio Processing
  • Neural dynamics and brain function
  • Functional Brain Connectivity Studies
  • Data Visualization and Analytics
  • EEG and Brain-Computer Interfaces
  • Bioinformatics and Genomic Networks
  • Computational Geometry and Mesh Generation
  • Advanced Adaptive Filtering Techniques
  • Computer Graphics and Visualization Techniques
  • Voice and Speech Disorders
  • Constraint Satisfaction and Optimization
  • Speech and dialogue systems
  • Neural Networks and Applications
  • Multimedia Communication and Technology
  • Semantic Web and Ontologies
  • Data Management and Algorithms
  • Blind Source Separation Techniques
  • Biomedical Text Mining and Ontologies
  • Gene Regulatory Network Analysis
  • Structural Health Monitoring Techniques
  • Fractal and DNA sequence analysis
  • Embedded Systems Design Techniques
  • Video Analysis and Summarization

University of Crete
2008-2024

Technical University of Crete
2014-2016

Foundation for Research and Technology Hellas
2008-2013

Aristotle University of Thessaloniki
2010

In this paper, we explore the possibility of using WaveNet architecture as a statistical vocoder. that case, generation speech waveforms is locally conditioned only by acoustic features. Focusing on single speaker case at moment, investigate impact local conditions well amount data available for training. Furthermore, variations are considered and discussed in context our work. We compare work against very recent which also used vocoder same data. More specifically, two female male speakers...

10.1109/icassp.2018.8462393 article EN 2018-04-01

Recently, generative neural network models which operate directly on raw audio, such as WaveNet, have improved the state of art in text-to-speech synthesis (TTS). Moreover, there is increasing interest using these statistical vocoders for generating speech waveforms from various acoustic features. However, also a need to reduce model complexity, without compromising quality. Previously, glottal pulseforms (i.e., time-domain corresponding source human voice production mechanism) been...

10.1109/taslp.2019.2906484 article EN IEEE/ACM Transactions on Audio Speech and Language Processing 2019-03-27

<para xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink"> Epilepsy is one of the most common brain disorders and may result in dysfunction cognitive disturbances. Epileptic seizures usually begin childhood without being accommodated by damage are tolerated drugs that produce no dysfunction. In this study, function evaluated children with mild epileptic controlled antiepileptic drugs. Under prism, we propose a concise technical framework combining...

10.1109/titb.2008.923141 article EN IEEE Transactions on Information Technology in Biomedicine 2009-03-09

Recent speech technology research has seen a growing interest in using WaveNets as statistical vocoders, i.e., generating waveforms from acoustic features.These models have been shown to improve the generated quality over classical vocoders many tasks, such text-to-speech synthesis and voice conversion.Furthermore, conditioning with features allows sharing waveform generator model across multiple speakers without additional speaker codes.However, multi-speaker WaveNet require large amounts...

10.21437/interspeech.2018-1635 article EN Interspeech 2022 2018-08-28

Although Electroencephalographic (EEG) signal synchronization studies have been a topic of increasing interest lately, there is no similar effort in the visualization such measures. In this direction graph-theoretic approach devised to study and stress coupling dynamics task-performing dynamical networks proposed. Both linear nonlinear interdependence measures are investigated an alcoholism paradigm during mental rehearsal pictures, which known reflect impairment. More specifically, widely...

10.1109/iembs.2007.4353283 article EN Conference proceedings 2007-08-01

Over the past few years there has been an increased interest in studying underlying neural mechanism of cognitive brain activity. In this direction, we study activity based on its independent components instead EEG signal itself. Both linear and nonlinear synchronization measures are applied to components, which free volume conduction effects background noise. More specifically, a robust state-space generalized assessment method recently introduced partial directed coherence investigated...

10.1109/iembs.2008.4650028 article EN 2008-08-01

In this paper, we suggest a new parallel, non-causal and shallow waveform domain architecture for speech enhancement based on FFTNet, neural network generating high quality audio waveform. contrast to other approaches like WaveNet, FFTNet uses an initial wide dilation pattern. Such better represents the long term correlated structure of in time domain, where noise is usually highly non-correlated, therefore it suitable enhancement. To further strengthen feature architecture, present sample...

10.21437/interspeech.2019-2622 article EN Interspeech 2022 2019-09-13

The aims were to develop and validate a VO 2peak prediction equation from treadmill running test in active male adolescents. Eighty-eight athletes (12–18 yrs.) performed maximal exercise on assess the actual 20m Shuttle-Run-Test (20mST). A step-wise linear regression analysis was used following for estimation of (mL·kg −1 ·min ) = 35.477 + 1.832 × duration min - 0.010 body mass kg developed. cross-validation statistics were: R .54, CE 0.1 mL·kg , SEE 2.5 (4.6%), TE 2.6 (4.9%). values (CE,...

10.1123/pes.22.4.624 article EN Pediatric Exercise Science 2010-11-01

In this paper we study the detection of hesitation filled pauses in oral presentations university lectures taught Greek language and recorded using a tablet PC via specialized software. We suggest hierarchical approach fusing video data with audio for increasing precision rate our system. The method works at frame level rather than usual segmental more accurate synchronization after removing detected hesitations. Audio characteristics are modeled Gaussian Mixture Models while stationarity is...

10.5281/zenodo.41616 article EN European Signal Processing Conference 2009-08-24

Gene Ontology information related to the biological role of genes is organized in a hierarchical manner that can be represented by directed acyclic graph (DAG). Space filling visualizations, such as treemaps, have capacity display thousands items legibly limited space via two-dimensional rectangular map. Treemaps been used visualize first transforming DAG into tree. However this transformation has several undesirable effects producing trees with large number nodes and scattering rectangles...

10.7155/jgaa.00190 article EN cc-by Journal of Graph Algorithms and Applications 2009-01-01

This paper presents BrainNetVis, a tool which serves brain network modelling and visualization, by providing both quantitative qualitative measures of interconnectivity. It emphasizes the needs that led to creation this presenting similar works in field describing how our contributes existing scenery. also describes methods used for calculation graph metrics (global vertex metrics), carry information. To make clear understandable, we use an exemplar dataset throughout paper, on calculations...

10.1155/2011/747290 article EN Computational Intelligence and Neuroscience 2011-01-01

BrainNetVis is an application, written in Java, that displays and analyzes synchronization networks from brain signals. The program implements a number of network indices visualization techniques. We demonstrate its use through case study left hand foot motor imagery. data sets were provided by the Berlin BCI group. Using this we managed to find differences between average comparing them with idle state network.

10.1109/iembs.2009.5334489 article EN Annual International Conference of the IEEE Engineering in Medicine and Biology Society 2009-09-01

Hidden Markov models (HMMs) are becoming the dominant approach for text-to-speech synthesis (TTS). HMMs provide an attractive acoustic modeling scheme which has been exhaustively investigated and developed many years. Modern HMM-based speech synthesizers have approached quality of best state-of-the-art unit selection systems. However, we believe that statistical parametric not reached its potential, since limited by several assumptions do apply to properties speech. We, therefore, propose in...

10.1109/icassp.2014.6853606 article EN 2014-05-01

Linear Dynamical Models (LDMs) have been used in speech synthesis recently as an alternative to hidden Markov models (HMMs). Among the advantages of LDMs are ability capture dynamics and achievement synthesized quality similar HMM-based systems on a smaller footprint. However, such HMM case, produce over-smoothed trajectories parameters, resulting muffled synthetic speech. Inspired by problem found synthesis, where naturalness is greatly improved when global variance (GV) compensated, this...

10.1109/lsp.2016.2580672 article EN IEEE Signal Processing Letters 2016-06-14

Low speech intelligibility in noisy listening conditions makes more difficult our communication with others.Various strategies have been suggested to modify a signal before it is presented environment the goal increase its intelligibility.A state-of-the art approach, referred as Spectral Shaping and Dynamic Range Compression (SS-DRC), relies on modifying spectral temporal structure of clean has shown considerably improve conditions.In this paper, we present non-causal Wavenet-like model for...

10.21437/interspeech.2018-2119 article EN Interspeech 2022 2018-08-28

DAGmaps are space lling visualizations of DAGs that generalize treemaps. Deciding whether or not a DAG admits DAGmap is NPcomplete. Although any layered planar one-dimensional there was no complete characterization the class admit DAGmap. In this paper we prove if and only it directed -visibility representation. Then characterize representations. This consists downward straight-line drawing such all source sink vertices assigned to external face. Finally show denes three-dimensional

10.7155/jgaa.00262 article EN cc-by Journal of Graph Algorithms and Applications 2012-01-01
Coming Soon ...