SpanSeq: similarity-based sequence data splitting method for improved development and assessment of deep learning projects
Similarity (geometry)
Biological data
Sequence (biology)
DOI:
10.1093/nargab/lqae106
Publication Date:
2024-08-16T10:09:54Z
AUTHORS (5)
ABSTRACT
The use of deep learning models in computational biology has increased massively recent years, and it is expected to continue with the current advances fields such as Natural Language Processing. These models, although able draw complex relations between input target, are also inclined learn noisy deviations from pool data used during their development. In order assess performance on unseen (their capacity
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (65)
CITATIONS (3)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....