NFDI4DS | UHH-SEMS - Publication Details

Pietro Pinoli

ORCID: 0000-0001-9786-2851

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5074094001

Research Areas

Gene expression and cancer classification
Bioinformatics and Genomic Networks
Genomics and Phylogenetic Studies
SARS-CoV-2 and COVID-19 Research
Biomedical Text Mining and Ontologies
Cancer Genomics and Diagnostics
Computational Drug Discovery Methods
Algorithms and Data Compression
vaccines and immunoinformatics approaches
Genomics and Chromatin Dynamics
Machine Learning in Bioinformatics
Scientific Computing and Data Management
COVID-19 Clinical Research Studies
Fiber-reinforced polymer composites
PARP inhibition in cancer therapy
Single-cell and spatial transcriptomics
Epigenetics and DNA Methylation
Semantic Web and Ontologies
Genetics, Bioinformatics, and Biomedical Research
Evolutionary Algorithms and Applications
Additive Manufacturing and 3D Printing Technologies
Bacteriophages and microbial interactions
Data Mining Algorithms and Applications
RNA modifications and cancer
Mechanical Behavior of Composites

Politecnico di Milano
2016-2025

Stanford University
2023

Center for Genomic Science
2015

Italian Institute of Technology
2015

University of Cyprus
2013

Chinese University of Hong Kong
2013

Applied Multilayers (United Kingdom)
2013

Lockheed Martin (United States)
1968-1982

Science Research Laboratory
1969-1975

GenoMetric Query Language: a novel approach to large-scale genomic data management

OPENALEX - Publications

Marco Masseroli Pietro Pinoli Francesco Venco Abdulrahman Kaitoua Vahid Jalili and 3 more

Abstract Motivation: Improvement of sequencing technologies and data processing pipelines is rapidly providing data, with associated high-level features, many individual genomes in multiple biological clinical conditions. They allow for data-driven genomic, transcriptomic epigenomic characterizations, but require state-of-the-art ‘big data’ computing strategies, abstraction levels beyond available tool capabilities. Results: We propose a high-level, declarative GenoMetric Query Language...

10.1093/bioinformatics/btv048 article EN Bioinformatics 2015-02-03

Employing a systematic approach to biobanking and analyzing clinical and genetic data for advancing COVID-19 research

OPENALEX - Publications

Sergio Daga Chiara Fallerini Margherita Baldassarri Francesca Fava Floriana Valentino and 22 more

Within the GEN-COVID Multicenter Study, biospecimens from more than 1000 SARS-CoV-2 positive individuals have thus far been collected in Biobank (GCB). Sample types include whole blood, plasma, serum, leukocytes, and DNA. The GCB links samples to detailed clinical data available Patient Registry (GCPR). It includes hospitalized patients (74.25%), broken down into intubated, treated by CPAP-biPAP, with O

10.1038/s41431-020-00793-7 article EN cc-by European Journal of Human Genetics 2021-01-17

Processing of big heterogeneous genomic datasets for tertiary analysis of Next Generation Sequencing data

OPENALEX - Publications

Marco Masseroli Arif Canakoglu Pietro Pinoli Abdulrahman Kaitoua Andrea Gulino and 6 more

We previously proposed a paradigm shift in genomic data management, based on the Genomic Data Model (GDM) for mediating existing formats and GenoMetric Query Language (GMQL) supporting, at high level of abstraction, extraction most common data-driven computations required by tertiary analysis Next Generation Sequencing datasets. Here, we present new GMQL-based system with enhanced accessibility, portability, scalability performance.The has well-designed modular architecture featuring: (i) an...

10.1093/bioinformatics/bty688 article EN Bioinformatics 2018-08-06

ViruSurf: an integrated database to investigate viral sequences

OPENALEX - Publications

Arif Canakoglu Pietro Pinoli Anna Bernasconi Tommaso Alfonsi Damianos P. Melidis and 1 more

ViruSurf, available at http://gmql.eu/virusurf/, is a large public database of viral sequences and integrated curated metadata from heterogeneous sources (RefSeq, GenBank, COG-UK NMDC); it also exposes computed nucleotide amino acid variants, called original sequences. A GISAID-specific ViruSurf database, http://gmql.eu/virusurf_gisaid/, offers subset these functionalities. Given the current pandemic outbreak, SARS-CoV-2 data are collected four sources; but contains other virus species...

10.1093/nar/gkaa846 article EN cc-by Nucleic Acids Research 2020-09-21

Matrix Factorization-based Technique for Drug Repurposing Predictions

OPENALEX - Publications

Gaia Ceddia Pietro Pinoli Stefano Ceri Marco Masseroli

Classical drug design methodologies are hugely costly and time-consuming, with approximately 85% of the new proposed molecules failing in first three phases FDA approval process. Thus, strategies to find alternative indications for already approved drugs that leverage computational methods crucial relevance. We previously demonstrated efficacy Non-negative Matrix Tri-Factorization, a method allows exploiting both data integration machine learning, infer novel drugs. In this work, we present...

10.1109/jbhi.2020.2991763 article EN IEEE Journal of Biomedical and Health Informatics 2020-05-01

SARS-CoV-2 viremia and COVID-19 mortality: A prospective observational study

OPENALEX - Publications

Andrea Giacomelli Elena Righini Valeria Micheli Pietro Pinoli Anna Bernasconi and 6 more

Background SARS-CoV-2 viremia has been found to be a potential prognostic factor in patients hospitalized for COVID-19. Objective We aimed assess the association between and mortality COVID-19 during different epidemic periods. Methods A prospective registry was queried extract all with an available performed at hospital admission March 2020 January 2022. assessed by means of GeneFinderTM Plus RealAmp Kit assay ELITe MGB ® using <45 cycle threshold define positivity. Uni multivariable...

10.1371/journal.pone.0281052 article EN cc-by PLoS ONE 2023-04-28

Modeling and interoperability of heterogeneous genomic big data for integrative processing and querying

OPENALEX - Publications

Marco Masseroli Abdulrahman Kaitoua Pietro Pinoli Stefano Ceri

10.1016/j.ymeth.2016.09.002 article EN Methods 2016-09-14

VirusViz: comparative analysis and effective visualization of viral nucleotide and amino acid variants

OPENALEX - Publications

Anna Bernasconi Andrea Gulino Tommaso Alfonsi Arif Canakoglu Pietro Pinoli and 2 more

Abstract Variant visualization plays an important role in supporting the viral evolution analysis, extremely valuable during COVID-19 pandemic. VirusViz is a web-based application for comparing variants of selected populations and their sub-populations; it primarily focused on SARS-CoV-2 variants, although tool also supports other species (SARS-CoV, MERS-CoV, Dengue, Ebola). As input, imports results queries extracting metadata from large database ViruSurf, which integrates information about...

10.1093/nar/gkab478 article EN cc-by Nucleic Acids Research 2021-05-24

The ENCODE Imputation Challenge: a critical assessment of methods for cross-cell type imputation of epigenomic profiles

OPENALEX - Publications

Jacob Schreiber Carles Boix Jin wook Lee Hongyang Li Yuanfang Guan and 37 more

A promising alternative to comprehensively performing genomics experiments is to, instead, perform a subset of and use computational methods impute the remainder. However, identifying best imputation what measures meaningfully evaluate performance are open questions. We address these questions by analyzing 23 from ENCODE Imputation Challenge. find that evaluations challenging confounded distributional shifts differences in data collection processing over time, amount available data,...

10.1186/s13059-023-02915-y article EN cc-by Genome biology 2023-04-18

Probabilistic Latent Semantic Analysis for prediction of Gene Ontology annotations

OPENALEX - Publications

Marco Masseroli Davide Chicco Pietro Pinoli

Consistency and completeness of biomolecular annotations is a keypoint correct interpretation biological experiments. Yet, the associations between genes (or proteins) features correctly annotated are just some all existing ones. As time goes by, they increase in number become more useful, but remain incomplete them incorrect. To support quicken their time-consuming curation procedure to improve consistence available annotations, computational methods that able supply ranked list predicted...

10.1109/ijcnn.2012.6252767 article EN 2022 International Joint Conference on Neural Networks (IJCNN) 2012-06-01

Computational algorithms to predict Gene Ontology annotations

OPENALEX - Publications

Pietro Pinoli Davide Chicco Marco Masseroli

Gene function annotations, which are associations between a gene and term of controlled vocabulary describing functional features, paramount importance in modern biology. Datasets these such as the ones provided by Ontology Consortium, used to design novel biological experiments interpret their results. Despite importance, sources information have some known issues. They incomplete, since knowledge is far from being definitive it rapidly evolves, erroneous annotations may be present. Since...

10.1186/1471-2105-16-s6-s4 article EN cc-by BMC Bioinformatics 2015-04-17

Latent Dirichlet Allocation based on Gibbs Sampling for gene function prediction

OPENALEX - Publications

Pietro Pinoli Davide Chicco Marco Masseroli

Gene function annotations are key elements in biology and bioinformatics. A typical annotation is the association between a gene feature term that describes functional of by using controlled vocabulary (e.g. Ontology (GO) term). Unfortunately, available contain errors biologically validated ones incomplete definition, since new knowledge continuously discovered. Thus, computational algorithms which able to provide ranked lists predicted an excellent contribution bioinformatics research....

10.1109/cibcb.2014.6845514 article EN 2014-05-01

Framework for Supporting Genomic Operations

OPENALEX - Publications

Abdulrahman Kaitoua Pietro Pinoli Michele Bertoni Stefano Ceri

Next Generation Sequencing (NGS) is a family of technologies for reading the DNA or RNA, capable producing whole genome sequences at an impressive speed, and causing revolution both biological research medical practice. In this exciting scenario, while huge number specialized bio-informatics programs extract information from sequences, there increasing need new generation systems frameworks integrating such information, providing holistic answers to needs biologists clinicians. To respond...

10.1109/tc.2016.2603980 article EN IEEE Transactions on Computers 2016-08-29

Investigating Deep Learning Based Breast Cancer Subtyping Using Pan-Cancer and Multi-Omic Data

OPENALEX - Publications

Francisco Cristovao Silvia Cascianelli Arif Canakoglu Mark Carman Luca Nanni and 2 more

Breast Cancer comprises multiple subtypes implicated in prognosis. Existing stratification methods rely on the expression quantification of small gene sets. Next Generation Sequencing promises large amounts omic data next years. In this scenario, we explore potential machine learning and, particularly, deep for breast cancer subtyping. Due to paucity publicly available data, leverage pan-cancer and non-cancer design semi-supervised settings. We make use multi-omic including microRNA...

10.1109/tcbb.2020.3042309 article EN IEEE/ACM Transactions on Computational Biology and Bioinformatics 2020-12-03

Enhanced probabilistic latent semantic analysis with weighting schemes to predict genomic annotations

OPENALEX - Publications

Pietro Pinoli Davide Chicco Marco Masseroli

Genomic annotations with functional controlled terms, such as the Gene Ontology (GO) ones, are paramount in modern biology. Yet, they known to be incomplete, since current biological knowledge is far definitive. In this scenario, computational methods that able support and quicken curation of these can very useful. a previous work, we discussed benefits using Probabilistic Latent Semantic Analysis algorithm order predict novel GO annotations, compared some Singular Value Decomposition (SVD)...

10.1109/bibe.2013.6701702 article EN 2013-11-01

Cross-organism learning method to discover new gene functionalities

OPENALEX - Publications

Giacomo Domeniconi Marco Masseroli Gianluca Moro Pietro Pinoli

10.1016/j.cmpb.2015.12.002 article EN Computer Methods and Programs in Biomedicine 2015-12-19

Discovering New Gene Functionalities from Random Perturbations of Known Gene Ontological Annotations

OPENALEX - Publications

Giacomo Domeniconi Marco Masseroli Gianluca Moro Pietro Pinoli

Genomic annotations describing functional features of genes and proteins through controlled terminologies ontologies are extremely valuable, especially for computational analyses aimed at inferring new biomedical knowledge. Thanks to the biology revolution led by introduction novel DNA sequencing technologies, several repositories such have becoming available in last decade; among them, ones including Gene Ontology most relevant. Nevertheless, set genomic is incomplete, only some represent...

10.5220/0005087801070116 article EN cc-by-nc-nd 2014-01-01

Metadata management for scientific databases

OPENALEX - Publications

Pietro Pinoli Stefano Ceri Davide Martinenghi Luca Nanni

Most scientific databases consist of datasets (or sources) which in turn include samples files) with an identical structure schema). In many cases, are associated rich metadata, describing the process that leads to building them (e.g.: experimental conditions used during sample generation). Metadata typically computations just for initial data selection; at most, metadata about query results is recovered after executing query, and its by post-processing. this way, a large body information...

10.1016/j.is.2018.10.002 article EN cc-by-nc-nd Information Systems 2018-11-15

ViruClust: direct comparison of SARS-CoV-2 genomes and genetic variants in space and time

OPENALEX - Publications

Luca Cilibrasi Pietro Pinoli Anna Bernasconi Arif Canakoglu Matteo Chiara and 1 more

The ongoing evolution of SARS-CoV-2 and the rapid emergence variants concern at distinct geographic locations have relevant implications for implementation strategies controlling COVID-19 pandemic. Combining growing body data evidence on potential functional mutations can suggest highly effective methods prioritization novel concern, e.g. increasing in frequency locally and/or globally. However, these analyses may be complex, requiring integration different resources. We claim need a...

10.1093/bioinformatics/btac030 article EN Bioinformatics 2022-01-13

The molecular basis of the anticancer effect of statins

OPENALEX - Publications

Giovanni Buccioli Carolina Testa Emanuela Jacchetti Pietro Pinoli Stephana Carelli and 2 more

Abstract Statins, widely used cardiovascular drugs that lower cholesterol by inhibiting HMG-CoA reductase, have been increasingly recognized for their potential anticancer properties. This study elucidates the underlying mechanism, revealing statins exploit Synthetic Lethality, a principle where co-occurrence of two non-lethal events leads to cell death. Our computational analysis approximately 37,000 SL pairs identified as targeting genes involved in with metastatic genes. In vitro...

10.1038/s41598-024-71240-6 article EN cc-by Scientific Reports 2024-08-31

Evaluating cloud frameworks on genomic applications

OPENALEX - Publications

Michele Bertoni Stefano Ceri Abdulrahman Kaitoua Pietro Pinoli

We are developing a new, holistic data management system for genomics, which uses cloud-based computing querying thousands of heterogeneous genomic datasets. In our project, it is essential to leverage upon modern cloud framework, so as encode query expressions into high-level operations provided by the framework. After releasing first implementation using Pig and Hadoop 1, we currently targeting Spark Flink, two emerging frameworks general-purpose big analytics. While appears have stronger...

10.1109/bigdata.2015.7363756 article EN 2021 IEEE International Conference on Big Data (Big Data) 2015-10-01

EpiSurf: metadata-driven search server for analyzing amino acid changes within epitopes of SARS-CoV-2 and other viral species

OPENALEX - Publications

Anna Bernasconi Luca Cilibrasi Ruba Al Khalaf Tommaso Alfonsi Stefano Ceri and 2 more

EpiSurf is a Web application for selecting viral populations of interest and then analyzing how their amino acid changes are distributed along epitopes. Viral sequences searched within ViruSurf, which stores curated metadata imported from the most widely used deposition sources databases (GenBank, COVID-19 Genomics UK (COG-UK) Global initiative on sharing all influenza data (GISAID)). Epitopes open source Immune Epitope Database or directly proposed by users indicating start stop positions...

10.1093/database/baab059 article EN cc-by Database 2021-09-01

Coming Soon ...