Lucas B. Rocha

ORCID: 0000-0002-9606-2569
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Algorithms and Data Compression
  • Genomics and Phylogenetic Studies
  • Machine Learning in Bioinformatics
  • Genome Rearrangement Algorithms
  • Advanced Proteomics Techniques and Applications
  • Coding theory and cryptography
  • graph theory and CDMA systems

Universidade Federal de Mato Grosso do Sul
2018-2024

Abstract In computational biology, mapping a sequence s onto graph G poses significant challenge. One possible approach to tackling this problem is find walk p in that spells most similar . This challenge formally known as the Graph Sequence Mapping Problem ( GSMP ). paper, we delve into an alternative formulation De Bruijn BSMP Both problems have three variants: changes only sequence, graph, and both graph. We concentrate on addressing variant involving literature, when does not allow...

10.1101/2024.02.15.580401 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2024-02-15

Abstract In computational biology, mapping a sequence s onto graph G is significant challenge. One possible approach to addressing this problem identify walk p in that spells which most similar . This known as the Graph Sequence Mapping Problem ( GSMP ). paper, we study an alternative formulation, namely De Bruijn BSMP ), can be stated follows: given and k (where ≥ 2), find according distance metric. We present both exact algorithms approximate heuristics for solving problem, using edit...

10.1101/2023.02.05.527069 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2023-02-07

Given two sets of sequences A and B, the Substring Specific problem is to find all minimum substrings in having distance at least k for each subsequence B. This work addresses three new implementations Maaß algorithm when Hamming considered: a naive cubic-time quadratic-time algorithms. We run tests compare running time these another recently described implementation that uses edit distance. In addition, we conducted preliminary testing on large Tara Ocean database, looking efficient...

10.5753/bsb_estendido.2018.8801 article EN 2018-10-30

An important problem in Computational Biology is to determine genetic markers, substrings of a set sequences that do not occur on other sets. Applications for this include finding small specific regions primer design and find organisms or metagenomes. Genetic markers can be addressed by the Specific Substring Problem - SSP which consists all minimal given with at least k differences among another sequence set. Since spend quadratic time when Hamming distance considered we have, general,...

10.1109/bibe.2019.00052 article EN 2019-10-01
Coming Soon ...