NFDI4DS | UHH-SEMS - Publication Details

Effect of Atmospheric Aging on Soot Particle Toxicity in Lung Cell Models at the Air–Liquid Interface: Differential Toxicological Impacts of Biogenic and Anthropogenic Secondary Organic Aerosols (SOAs)

OPENALEX - Publications

Svenja Offer Elena Hartner Sebastiano Di Bucchianico Christoph Bisig Stefanie Bauer and 38 more

Background: Secondary organic aerosols (SOAs) formed from anthropogenic or biogenic gaseous precursors in the atmosphere substantially contribute to ambient fine particulate matter [PM ≤2.5μm aerodynamic diameter (PM2.5)] burden, which has been associated with adverse human health effects. However, there is only limited evidence on their differential toxicological impact. Objectives: We aimed discriminate effects of generated by atmospheric aging combustion soot particles (SPs) (β-pinene)...

10.1289/ehp9413 article EN public-domain Environmental Health Perspectives 2022-02-01

PureCLIP: capturing target-specific protein–RNA interaction footprints from single-nucleotide CLIP-seq data

OPENALEX - Publications

Sabrina Krakau Hugues Richard Annalisa Marsico

The iCLIP and eCLIP techniques facilitate the detection of protein–RNA interaction sites at high resolution, based on diagnostic events crosslink sites. However, previous methods do not explicitly model specifics truncation patterns possible biases. We developed PureCLIP ( https://github.com/skrakau/PureCLIP ), a hidden Markov approach, which simultaneously performs peak-calling individual site detection. It incorporates non-specific background signal and, for first time, sequence On both...

10.1186/s13059-017-1364-2 article EN cc-by Genome biology 2017-12-01

pysster: classification of biological sequences by learning sequence and structure motifs with convolutional neural networks

OPENALEX - Publications

Stefan Budach Annalisa Marsico

Convolutional neural networks (CNNs) have been shown to perform exceptionally well in a variety of tasks, including biological sequence classification. Available implementations, however, are usually optimized for particular task and difficult reuse. To enable researchers utilize these more easily, we implemented pysster, Python package training CNNs on data. Sequences classified by learning structure motifs the offers an automated hyper-parameter optimization procedure options visualize...

10.1093/bioinformatics/bty222 article EN cc-by-nc Bioinformatics 2018-04-05

SND1 binds SARS-CoV-2 negative-sense RNA and promotes viral RNA synthesis through NSP9

OPENALEX - Publications

Nora Schmidt Sabina Ganskih Yuanjie Wei Alexander Gabel Sebastian Zielinski and 26 more

Regulation of viral RNA biogenesis is fundamental to productive SARS-CoV-2 infection. To characterize host RNA-binding proteins (RBPs) involved in this process, we biochemically identified bound genomic and subgenomic RNAs. We find that the protein SND1 binds 5' end negative-sense required for synthesis. SND1-depleted cells form smaller replication organelles display diminished virus growth kinetics. discover NSP9, a RBP direct interaction partner, covalently linked ends positive- RNAs...

10.1016/j.cell.2023.09.002 article EN cc-by Cell 2023-10-01

TriPepSVM: de novo prediction of RNA-binding proteins based on short amino acid motifs

OPENALEX - Publications

Annkatrin Bressin Roman Schulte-Sasse Davide Figini Erika C. Urdaneta Benedikt M. Beckmann and 1 more

In recent years, hundreds of novel RNA-binding proteins (RBPs) have been identified, leading to the discovery domains. Furthermore, unstructured or disordered low-complexity regions RBPs identified play an important role in interactions with nucleic acids. However, these advances understanding are limited mainly eukaryotic species and we only tools faithfully predict RNA-binders bacteria. Here, describe a support vector machine-based method, called TriPepSVM, for prediction proteins....

10.1093/nar/gkz203 article EN cc-by Nucleic Acids Research 2019-03-18

Kinetics of Xist-induced gene silencing can be predicted from combinations of epigenetic and genomic features

OPENALEX - Publications

Lisa Barros de Andrade e Sousa Iris H. Jonkers Laurène Syx Ilona Dunkel Julie Chaumeil and 7 more

To initiate X-Chromosome inactivation (XCI), the long noncoding RNA Xist mediates chromosome-wide gene silencing of one X Chromosome in female mammals to equalize dosage between sexes. The efficiency is highly variable across genes, with some genes even escaping XCI somatic cells. A gene's susceptibility Xist-mediated appears be determined by a complex interplay epigenetic and genomic features; however, underlying rules remain poorly understood. We have quantified kinetics at level nascent...

10.1101/gr.245027.118 article EN cc-by-nc Genome Research 2019-06-07

Exposure to naphthalene and β-pinene-derived secondary organic aerosol induced divergent changes in transcript levels of BEAS-2B cells

OPENALEX - Publications

Michal Pardo Svenja Offer Elena Hartner Sebastiano Di Bucchianico Christoph Bisig and 41 more

The health effects of exposure to secondary organic aerosols (SOAs) are still limited. Here, we investigated and compared the toxicities soot particles (SP) coated with β-pinene SOA (SOAβPin-SP) SP naphthalene (SOANap-SP) in a human bronchial epithelial cell line (BEAS-2B) residing at air–liquid interface. SOAβPin-SP mostly contained oxygenated aliphatic compounds from photooxidation, whereas SOANap-SP significant fraction aromatic products under similar conditions. Following exposure,...

10.1016/j.envint.2022.107366 article EN cc-by Environment International 2022-06-21

Towards in silico CLIP-seq: predicting protein-RNA interaction via sequence-to-signal learning

OPENALEX - Publications

Marc Horlacher Nils Wagner Lambert Moyon Klara Kuret Nicolas Goedert and 5 more

Abstract We present RBPNet, a novel deep learning method, which predicts CLIP-seq crosslink count distribution from RNA sequence at single-nucleotide resolution. By training on up to million regions, RBPNet achieves high generalization eCLIP, iCLIP and miCLIP assays, outperforming state-of-the-art classifiers. performs bias correction by modeling the raw signal as mixture of protein-specific background signal. Through model interrogation via Integrated Gradients, identifies predictive...

10.1186/s13059-023-03015-7 article EN cc-by Genome biology 2023-08-04

Long ncRNA A-ROD activates its target gene DKK1 at its release from chromatin

OPENALEX - Publications

Evgenia Ntini Annita Louloupi Julia Liz José M. Muiño Annalisa Marsico and 1 more

Abstract Long ncRNAs are often enriched in the nucleus and at chromatin, but whether their dissociation from chromatin is important for role transcription regulation unclear. Here, we group long using epigenetic marks, expression strength of chromosomal interactions; find that transcribed loci engaged strong long-range interactions less abundant suggesting release as a crucial functional aspect target genes. To gain mechanistic insight into this, functionally validate ncRNA A-ROD, which...

10.1038/s41467-018-04100-3 article EN cc-by Nature Communications 2018-04-18

ssHMM: extracting intuitive sequence-structure motifs from high-throughput RNA-binding protein data

OPENALEX - Publications

David N. Heller Ralf Krestel Uwe Ohler Martin Vingron Annalisa Marsico

RNA-binding proteins (RBPs) play an important role in RNA post-transcriptional regulation and recognize target RNAs via sequence-structure motifs. The extent to which structure influences protein binding the presence or absence of a sequence motif is still poorly understood. Existing finders either take only partially into account, employ models are not directly interpretable as We developed ssHMM, finder based on hidden Markov model (HMM) Gibbs sampling fully captures relationship between...

10.1093/nar/gkx756 article EN cc-by-nc Nucleic Acids Research 2017-08-17

A systematic benchmark of machine learning methods for protein–RNA interaction prediction

OPENALEX - Publications

Marc Horlacher Giulia Cantini Julian Hesse Patrick Schinke Nicolas Goedert and 3 more

Abstract RNA-binding proteins (RBPs) are central actors of RNA post-transcriptional regulation. Experiments to profile-binding sites RBPs in vivo limited transcripts expressed the experimental cell type, creating need for computational methods infer missing binding information. While numerous machine-learning based have been developed this task, their use heterogeneous training and evaluation datasets across different sets CLIP-seq protocols makes a direct comparison performance difficult....

10.1093/bib/bbad307 article EN cc-by Briefings in Bioinformatics 2023-08-26

MeMotif: a database of linear motifs in α-helical transmembrane proteins

OPENALEX - Publications

Annalisa Marsico Kerstin Scheubert Anne Tuukkanen Andreas Henschel Christof Winter and 2 more

Membrane proteins are important for many processes in the cell and used as main drug targets. The increasing number of high-resolution structures available makes first time a characterization local structural functional motifs α-helical transmembrane possible. MeMotif (http://projects.biotec.tu-dresden.de/memotif) is database wiki which collects more than 2000 known novel computationally predicted linear proteins. Motifs fully described terms several features editable. contained can be...

10.1093/nar/gkp1042 article EN cc-by-nc Nucleic Acids Research 2009-11-11

Identification of 170 New Long Noncoding RNAs in Schistosoma mansoni

OPENALEX - Publications

Victor Fernandes de Oliveira Lauro A. G. Moares Ester Alves Mota Liana K. Jannotti-Passos Paulo Marcos Zech Coelho and 5 more

Long noncoding RNAs (lncRNAs) are transcripts generally longer than 200 nucleotides with no or poor protein coding potential, and most of their functions also poorly characterized. Recently, an increasing number studies have shown that lncRNAs can be involved in various critical biological processes such as organism development cancer progression. Little, however, is known about effects helminths parasites, Schistosoma mansoni . Here, we present a computational pipeline to identify...

10.1155/2018/1264697 article EN cc-by BioMed Research International 2018-07-11

A novel pattern recognition algorithm to classify membrane protein unfolding pathways with high-throughput single-molecule force spectroscopy

OPENALEX - Publications

Annalisa Marsico Dirk Labudde K. Tanuj Sapra Daniel J. Müller Michael Schroeder

Abstract Motivation: Misfolding of membrane proteins plays an important role in many human diseases such as retinitis pigmentosa, hereditary deafness and diabetes insipidus. Little is known about there are only very few high-resolution structures. Single-molecule force spectroscopy a novel technique, which measures the necessary to pull protein out membrane. Such curves contain valuable information on structure, conformation, inter- intra-molecular forces. High-throughput experiments...

10.1093/bioinformatics/btl293 article EN Bioinformatics 2007-01-15

Structural fragment clustering reveals novel structural and functional motifs in α-helical transmembrane proteins

OPENALEX - Publications

Annalisa Marsico Andreas Henschel Christof Winter Anne Tuukkanen Boris Vassilev and 2 more

A large proportion of an organism's genome encodes for membrane proteins. Membrane proteins are important many cellular processes, and several diseases can be linked to mutations in them. With the tremendous growth sequence data, there is increasing need reliably identify from sequence, functionally annotate them, correctly predict their topology.We introduce a technique called structural fragment clustering, which learns sequential motifs 3D fragments. From over 500,000 fragments, we obtain...

10.1186/1471-2105-11-204 article EN cc-by BMC Bioinformatics 2010-04-26

Identifying lncRNA-mediated regulatory modules via ChIA-PET network analysis

OPENALEX - Publications

Denise Thiel Nataša Djurdjevac Conrad Evgenia Ntini Ria X. Peschutter Heike Siebert and 1 more

Although several studies have provided insights into the role of long non-coding RNAs (lncRNAs), majority them unknown function. Recent evidence has shown importance both lncRNAs and chromatin interactions in transcriptional regulation. network-based methods, mainly exploiting gene-lncRNA co-expression, been applied to characterize lncRNA function by means ’guilt-by-association’, no strategy exists so far which identifies mRNA-lncRNA functional modules based on 3D interaction graph. To...

10.1186/s12859-019-2900-8 article EN cc-by BMC Bioinformatics 2019-05-29

PureCLIP: capturing target-specific protein-RNA interaction footprints from single-nucleotide CLIP-seq data

OPENALEX - Publications

Sabrina Krakau Hugues Richard Annalisa Marsico

Abstract iCLIP and eCLIP techniques facilitate the detection of protein-RNA interaction sites at high resolution, based on diagnostic events crosslink sites. However, previous methods do not explicitly model specifics truncation patterns possible biases. We developed PureCLIP, a hidden Markov approach, which simultaneously performs peak calling individual site detection. It incorporates RNA abundances and, for first time, non-specific sequence On both simulated real data, PureCLIP is more...

10.1101/146704 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2017-06-07

Network-Based Methods and Other Approaches for Predicting lncRNA Functions and Disease Associations

OPENALEX - Publications

Rosario M. Piro Annalisa Marsico

10.1007/978-1-4939-8982-9_12 article EN Methods in molecular biology 2019-01-01

Genome-wide measurement of RNA dissociation from chromatin classifies transcripts by their dynamics and reveals rapid dissociation of enhancer lncRNAs

OPENALEX - Publications

Evgenia Ntini Stefan Budach Ulf Andersson Ørom Annalisa Marsico

10.1016/j.cels.2023.09.005 article EN publisher-specific-oa Cell Systems 2023-10-01

pysster: Classification of Biological Sequences by Learning Sequence and Structure Motifs with Convolutional Neural Networks

OPENALEX - Publications

Stefan Budach Annalisa Marsico

Abstract Summary Convolutional neural networks (CNNs) have been shown to perform exceptionally well in a variety of tasks, including biological sequence classification. Available implementations, however, are usually optimized for particular task and difficult reuse. To enable researchers utilize these more easily we implemented pysster, Python package training CNNs on data. Sequences classified by learning structure motifs the offers an automated hyper-parameter optimization procedure...

10.1101/230086 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2017-12-06

Predicting enhancers using a small subset of high confidence examples and co-training

OPENALEX - Publications

Matthew R. Huska Anna Ramisch Martin Vingron Annalisa Marsico

Enhancers are important regulatory regions located throughout the genome, primarily in non-coding regions. Several experimental methods have been developed over last several years to identify their location, but search space is large and overlap between putative enhancer identified using these tends be very small. Computational for prediction often use one set of experimentally as input, therefore rely critically on correctness. We chose take a different approach, start with high confidence...

10.7287/peerj.preprints.2407v1 preprint EN 2016-09-01

Predicting enhancers using a small subset of high confidence examples and co-training

OPENALEX - Publications

Matthew R. Huska Anna Ramisch Martin Vingron Annalisa Marsico

Enhancers are important regulatory regions located throughout the genome, primarily in non-coding regions. Several experimental methods have been developed over last several years to identify their location, but search space is large and overlap between putative enhancer identified using these tends be very small. Computational for prediction often use one set of experimentally as input, therefore rely critically on correctness. We chose take a different approach, start with high confidence...

10.7287/peerj.preprints.2407 preprint EN 2016-09-01