Jan Holub

ORCID: 0000-0003-3022-2694
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Algorithms and Data Compression
  • Network Packet Processing and Optimization
  • Natural Language Processing Techniques
  • Genomics and Phylogenetic Studies
  • semigroups and automata theory
  • DNA and Biological Computing
  • Data Management and Algorithms
  • Advanced Database Systems and Queries
  • Web Data Mining and Analysis
  • Diverse Musicological Studies
  • Advanced Data Compression Techniques
  • RNA and protein synthesis mechanisms
  • Musicology and Musical Analysis
  • Error Correcting Code Techniques
  • Information Technology Governance and Strategy
  • Fractal and DNA sequence analysis
  • Music and Audio Processing
  • Music Technology and Sound Studies
  • Plant and Fungal Interactions Research
  • Data Mining Algorithms and Applications
  • Cellular Automata and Applications
  • Plant Virus Research Studies
  • Urbanization and City Planning
  • Cancer-related molecular mechanisms research
  • Evolutionary Algorithms and Applications

Czech Technical University in Prague
2014-2023

Jan Evangelista Purkyně University in Ústí nad Labem
2019

Research Institute of Inorganic Chemistry
2019

The constant advances in sequencing technology have redefined the way genome is performed. They are able to produce tens of millions short sequences (reads), during a single experiment, and with much lower cost than previously possible. Due this massive amount data, efficient algorithms for mapping these reads reference great demand, recently, there has been ample work publishing such algorithms. In paper, we study different version problem: dynamically changing genomic sequence. We propose...

10.1109/bibm.2010.5706551 article EN 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2010-12-01

10.1016/j.jda.2006.10.003 article EN publisher-specific-oa Journal of Discrete Algorithms 2006-12-07

10.1016/j.ipl.2009.11.010 article EN Information Processing Letters 2009-11-28

Prefix-free parsing (PFP) was introduced by Boucher et al. (2019) as a preprocessing step to ease the computation of Burrows-Wheeler Transforms (BWTs) genomic databases. Given string S, it produces dictionary D and parse P overlapping phrases such that BWT(S) can be computed from in time workspace bounded terms their combined size |PFP(S)|. In practice are significantly smaller than S computing them is more efficient directly, at least when concatenation many genomes. this paper, we consider...

10.1137/1.9781611976472.5 article EN Society for Industrial and Applied Mathematics eBooks 2021-01-01

DCA (Data Compression using Antidictionaries) is a novel lossless data compression method working on bit streams presented by Crochemore et al. takes advantage of words that do not occur as factors in the text, i.e. are forbidden. Due to these forbidden (antiwords), some symbols text can be predicted. We build antidictionary suffix array time O(k * N log N), where k maximal antiword length. Length and LCP constructed over binary alphabet will 8 times length input text. Still memory...

10.1109/dcc.2008.95 article EN DCC 2008-03-01

Abstract Motivation The many thousands of high-quality genomes available now-a-days imply a shift from single genome to pan-genomic analyses. A basic algorithmic building brick for such scenario is online search over collection similar texts, problem with surprisingly few solutions presented so far. Results We present SOPanG, simple tool exact pattern matching an elastic-degenerate string, recently proposed simplified model the pan-genome. Thanks bit-parallelism, it achieves speeds above 400...

10.1093/bioinformatics/bty506 article EN Bioinformatics 2018-06-20

Nowadays, decreasing cost and better accessibility of sequencing methods have enabled studies genetic variation between individuals the same species also two related species. This has led to a rapid increase in biological data consisting sequences that are very similar each other, these usually being stored together one database. We propose compression method based on Wavelet Tree FM-index optimized for set sequences. The is tracking single changes (together with their context) every...

10.1109/dcc.2014.47 article EN Data Compression Conference 2014-03-01

Raspberry plants, valued for their fruits, are vulnerable to a range of viruses that adversely affect yield and quality. Utilizing high-throughput sequencing (HTS), we identified novel virus, tentatively named raspberry enamovirus 1 (RaEV1), in three distinct plants. This study provides comprehensive characterization RaEV1, focusing on its genomic structure, phylogeny, possible transmission routes. Analysis nearly complete genomes from 14 RaEV1 isolates highlighted regions variance,...

10.3390/v15122281 article EN cc-by Viruses 2023-11-21

10.1016/j.tcs.2017.04.008 article EN publisher-specific-oa Theoretical Computer Science 2017-05-02

We introduce quasi suffix arrays as a generalization of for character strings. show that array encodes enough the structure string to be useful construct many applications where full power is not necessary, notably in problems do require lexicographical order, example, pattern-matching or calculation repeating substrings. are interested arrays, we believe they can calculated by simple, fast, and space efficient algorithms. As first step towards this goal, describe family DIST algorithms...

10.5555/998223.998228 article EN Journal of automata, languages and combinatorics 2003-07-01

Covid-19 related restrictions bring new challenges to all aspects and phases of higher education. At universities, remote formats are deployed or even developed both for lectures laboratory exercises. This article addresses describes the experience with lessons during pandemic time at Department Measurement Czech Technical University in Prague, Faculty Electrical Engineering. It introduces best practices exercised using Home Lab, which helps distance teaching practical electronic classes....

10.1016/j.measen.2021.100121 article EN cc-by-nc-nd Measurement Sensors 2021-09-22

10.1016/s0304-3975(00)00064-5 article EN Theoretical Computer Science 2000-10-01

A virome screen was performed on a new breeding line, KB1, of blackcurrant. Rhabdovirus-like particles were observed by electron microscopy in ultrathin sections flower stalks, and the complete genome sequence novel virus, provisionally named blackcurrant rhabdovirus 2 (BCRV2), determined verified using high-throughput sequencing. The genomic organization BCRV2 characteristic cytorhabdoviruses (family Rhabdoviridae) included seven genes: 3 ́- N-P´-P-P3-M-G-L -5 ́. BLASTP analysis revealed...

10.3390/v14112456 article EN cc-by Viruses 2022-11-06
Coming Soon ...