A Convolutional Neural Network-Based Approach for the Rapid Annotation of Molecularly Diverse Natural Products

Cyclic Biological Products 0303 health sciences Tumor Magnetic Resonance Spectroscopy Neural Networks Cheminformatics General Chemistry Cyanobacteria Peptides, Cyclic Cell Line 3. Good health Machine Learning Computer 03 medical and health sciences Cell Line, Tumor Chemical Sciences Humans Neural Networks, Computer Peptides Cancer
DOI: 10.1021/jacs.9b13786 Publication Date: 2020-02-11T17:06:56Z
ABSTRACT
This report describes the first application of the novel NMR-based machine learning tool "Small Molecule Accurate Recognition Technology" (SMART 2.0) for mixture analysis and subsequent accelerated discovery and characterization of new natural products. The concept was applied to the extract of a filamentous marine cyanobacterium known to be a prolific producer of cytotoxic natural products. This environmental Symploca extract was roughly fractionated, and then prioritized and guided by cancer cell cytotoxicity, NMR-based SMART 2.0, and MS2-based molecular networking. This led to the isolation and rapid identification of a new chimeric swinholide-like macrolide, symplocolide A, as well as the annotation of swinholide A, samholides A-I, and several new derivatives. The planar structure of symplocolide A was confirmed to be a structural hybrid between swinholide A and luminaolide B by 1D/2D NMR and LC-MS2 analysis. A second example applies SMART 2.0 to the characterization of structurally novel cyclic peptides, and compares this approach to the recently appearing "atomic sort" method. This study exemplifies the revolutionary potential of combined traditional and deep learning-assisted analytical approaches to overcome longstanding challenges in natural products drug discovery.
SUPPLEMENTAL MATERIAL
Coming soon ....
REFERENCES (55)
CITATIONS (152)
EXTERNAL LINKS
PlumX Metrics
RECOMMENDATIONS
FAIR ASSESSMENT
Coming soon ....
JUPYTER LAB
Coming soon ....