SpanSeq: similarity-based sequence data splitting method for improved development and assessment of deep learning projects

Similarity (geometry) Biological data Sequence (biology)
DOI: 10.1093/nargab/lqae106 Publication Date: 2024-08-16T10:09:54Z
ABSTRACT
The use of deep learning models in computational biology has increased massively recent years, and it is expected to continue with the current advances fields such as Natural Language Processing. These models, although able draw complex relations between input target, are also inclined learn noisy deviations from pool data used during their development. In order assess performance on unseen (their capacity
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (65)
CITATIONS (3)