NFDI4DS | UHH-SEMS - Publication Details

Online LiDAR-SLAM for Legged Robots with Robust Registration and Deep-Learned Loop Closure

OPENALEX - Publications

Milad Ramezani Georgi Tinchev Iuganov Egor Maurice Fallon

In this paper, we present a 3D factor-graph LiDAR-SLAM system which incorporates state-of-the-art deeply learned feature-based loop closure detector to enable legged robot localize and map in industrial environments. Point clouds are accumulated using an inertial-kinematic state estimator before being aligned ICP registration. To close loops use proposal mechanism matches individual segments between clouds. We trained descriptor offline match these segments. The efficiency of our method...

10.1109/icra40945.2020.9196769 article EN 2020-05-01

Universal Neural Vocoding with Parallel Wavenet

OPENALEX - Publications

Yunlong Jiao Adam Gabryś Georgi Tinchev Bartosz Putrycz Daniel Korzekwa and 1 more

We present a universal neural vocoder based on Parallel WaveNet, with an additional conditioning network called Audio Encoder. Our offers real-time high-quality speech synthesis wide range of use cases. tested it 43 internal speakers diverse age and gender, speaking 20 languages in 17 unique styles, which 7 voices 5 styles were not exposed during training. show that the proposed significantly outperforms speaker-dependent vocoders overall. also several existing architectures terms...

10.1109/icassp39728.2021.9414444 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2021-05-13

Learning to See the Wood for the Trees: Deep Laser Localization in Urban and Natural Environments on a CPU

OPENALEX - Publications

Georgi Tinchev Adrián Peñate-Sánchez Maurice Fallon

Localization in challenging, natural environments such as forests or woodlands is an important capability for many applications from guiding a robot navigating along forest trail to monitoring vegetation growth with handheld sensors. In this work we explore laser-based localization both urban and environments, which suitable online applications. We propose deep learning approach capable of meaningful descriptors directly 3D point clouds by comparing triplets (anchor, positive negative...

10.1109/lra.2019.2895264 article EN IEEE Robotics and Automation Letters 2019-01-25

Predicting Alignment Risk to Prevent Localization Failure

OPENALEX - Publications

Simona Nobili Georgi Tinchev Maurice Fallon

During localization and mapping the success of point cloud registration can be compromised when there is an absence geometric features or constraints in corridors across doorways, volumes scanned only partly overlap, due to occlusions constrictions between subsequent observations. This work proposes a strategy predict prevent laser-based failure. Our solution relies on explicit analysis content prior registration. A model predicting risk failed alignment learned by analysing degree spatial...

10.1109/icra.2018.8462890 article EN 2018-05-01

SKD: Keypoint Detection for Point Clouds Using Saliency Estimation

OPENALEX - Publications

Georgi Tinchev Adrián Peñate-Sánchez Maurice Fallon

We present SKD, a novel keypoint detector that uses saliency to determine the best candidates from point cloud for tasks such as registration and reconstruction. The approach can be applied any differentiable deep learning descriptor by using gradients of with respect 3D position input points measure their saliency. is combined original context information in neural network, which trained learn robust candidates. key intuition behind this keypoints are not extracted solely result geometry...

10.1109/lra.2021.3065224 article EN IEEE Robotics and Automation Letters 2021-03-11

Seeing the Wood for the Trees: Reliable Localization in Urban and Natural Environments

OPENALEX - Publications

Georgi Tinchev Simona Nobili Maurice Fallon

In this work we introduce Natural Segmentation and Matching (NSM), an algorithm for reliable localization, using laser, in both urban natural environments. Current state-of-the-art global approaches do not generalize well to structure-poor vegetated areas such as forests or orchards. these environments clutter perceptual aliasing prevents repeatable extraction of distinctive landmarks between different test runs. forests, tree trunks are distinctive, foliage intertwines there is a complete...

10.1109/iros.2018.8594042 article EN 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) 2018-10-01

InstaLoc: One-shot Global Lidar Localisation in Indoor Environments through Instance Learning

OPENALEX - Publications

Lintong Zhang Sundara Tejaswi Digumarti Georgi Tinchev Maurice Fallon

Localization for autonomous robots in prior maps is crucial their functionality.This paper offers a solution to this problem indoor environments called InstaLoc, which operates on an individual lidar scan localize it within map.We draw inspiration from how humans navigate and position themselves by recognizing the layout of distinctive objects structures.Mimicking human approach, InstaLoc identifies matches object instances scene with those map.As far as we know, first method use panoptic...

10.15607/rss.2023.xix.070 article EN 2023-07-10

Modelling Low-Resource Accents Without Accent-Specific TTS Frontend

OPENALEX - Publications

Georgi Tinchev Marta Czarnowska Kamil Rafał Deja Kayoko Yanagisawa Marius Cotescu

This work focuses on modelling a speaker's accent that does not have dedicated text-to-speech (TTS) frontend, including grapheme-to-phoneme (G2P) module. Prior accents assumes phonetic transcription is available for the target accent, which might be case low-resource, regional accents. In our work, we propose an approach whereby first augment data to sound like donor voice via conversion, then train multi-speaker multi-accent TTS model combination of recordings and synthetic data, generate...

10.1109/icassp49357.2023.10095773 article EN ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2023-05-05

Seeing the Wood for the Trees: Reliable Localization in Urban and Natural Environments

OPENALEX - Publications

Georgi Tinchev Simona Nobili Maurice Fallon

In this work we introduce Natural Segmentation and Matching (NSM), an algorithm for reliable localization, using laser, in both urban natural environments. Current state-of-the-art global approaches do not generalize well to structure-poor vegetated areas such as forests or orchards. these environments clutter perceptual aliasing prevents repeatable extraction of distinctive landmarks between different test runs. forests, tree trunks are distinctive, foliage intertwines there is a complete...

10.48550/arxiv.1809.02846 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Universal Neural Vocoding with Parallel WaveNet

OPENALEX - Publications

Yunlong Jiao Adam Gabryś Georgi Tinchev Bartosz Putrycz Daniel Korzekwa and 1 more

We present a universal neural vocoder based on Parallel WaveNet, with an additional conditioning network called Audio Encoder. Our offers real-time high-quality speech synthesis wide range of use cases. tested it 43 internal speakers diverse age and gender, speaking 20 languages in 17 unique styles, which 7 voices 5 styles were not exposed during training. show that the proposed significantly outperforms speaker-dependent vocoders overall. also several existing architectures terms...

10.48550/arxiv.2102.01106 preprint EN other-oa arXiv (Cornell University) 2021-01-01

Modelling low-resource accents without accent-specific TTS frontend

OPENALEX - Publications

Georgi Tinchev Marta Czarnowska Kamil Rafał Deja Kayoko Yanagisawa Marius Cotescu

This work focuses on modelling a speaker's accent that does not have dedicated text-to-speech (TTS) frontend, including grapheme-to-phoneme (G2P) module. Prior accents assumes phonetic transcription is available for the target accent, which might be case low-resource, regional accents. In our work, we propose an approach whereby first augment data to sound like donor voice via conversion, then train multi-speaker multi-accent TTS model combination of recordings and synthetic data, generate...

10.48550/arxiv.2301.04606 preprint EN other-oa arXiv (Cornell University) 2023-01-01

Real-time LIDAR localization in natural and urban environments

OPENALEX - Publications

Georgi Tinchev Adrián Peñate-Sánchez Maurice Fallon

Localization is a key challenge in many robotics applications. In this work we explore LIDAR-based global localization both urban and natural environments develop method suitable for online application. Our approach leverages efficient deep learning architecture capable of compact point cloud descriptors directly from 3D data. The uses an feature space representation set segmented clouds to match between the current scene prior map. We show that down-sampling inner layers network can...

10.48550/arxiv.2301.13583 preprint EN other-oa arXiv (Cornell University) 2023-01-01

InstaLoc: One-shot Global Lidar Localisation in Indoor Environments through Instance Learning

OPENALEX - Publications

Lintong Zhang Tejaswi Digumarti Georgi Tinchev Maurice Fallon

Localization for autonomous robots in prior maps is crucial their functionality. This paper offers a solution to this problem indoor environments called InstaLoc, which operates on an individual lidar scan localize it within map. We draw inspiration from how humans navigate and position themselves by recognizing the layout of distinctive objects structures. Mimicking human approach, InstaLoc identifies matches object instances scene with those As far as we know, first method use panoptic...

10.48550/arxiv.2305.09552 preprint EN cc-by-nc-sa arXiv (Cornell University) 2023-01-01

Diffusion-based accent modelling in speech synthesis

OPENALEX - Publications

Kamil Rafał Deja Georgi Tinchev Marta Czarnowska Marius Cotescu Jasha Droppo

10.21437/interspeech.2023-154 article EN Interspeech 2022 2023-08-14

Cross-lingual Knowledge Distillation via Flow-based Voice Conversion for Robust Polyglot Text-To-Speech

OPENALEX - Publications

Dariusz Piotrowski Renard Korzeniowski Alessio Falai Sebastian Cygert Kamil Pokora and 3 more

In this work, we introduce a framework for cross-lingual speech synthesis, which involves an upstream Voice Conversion (VC) model and downstream Text-To-Speech (TTS) model. The proposed consists of 4 stages. the first two stages, use VC to convert utterances in target locale voice speaker. third stage, converted data is combined with linguistic features durations from recordings language, are then used train single-speaker acoustic Finally, last stage entails training locale-independent...

10.48550/arxiv.2309.08255 preprint EN other-oa arXiv (Cornell University) 2023-01-01

SKD: Keypoint Detection for Point Clouds using Saliency Estimation

OPENALEX - Publications

Georgi Tinchev Adrián Peñate-Sánchez Maurice Fallon

We present SKD, a novel keypoint detector that uses saliency to determine the best candidates from point cloud for tasks such as registration and reconstruction. The approach can be applied any differentiable deep learning descriptor by using gradients of with respect 3D position input points measure their saliency. is combined original context information in neural network, which trained learn robust candidates. key intuition behind this keypoints are not extracted solely result geometry...

10.48550/arxiv.1912.04943 preprint EN other-oa arXiv (Cornell University) 2019-01-01