Emanuel Weitschek

ORCID: 0000-0002-8045-2925
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Gene expression and cancer classification
  • Genomics and Phylogenetic Studies
  • Cancer Genomics and Diagnostics
  • Machine Learning in Bioinformatics
  • Algorithms and Data Compression
  • Bioinformatics and Genomic Networks
  • Molecular Biology Techniques and Applications
  • Ferroelectric and Negative Capacitance Devices
  • Identification and Quantification in Food
  • Environmental DNA in Biodiversity Studies
  • RNA modifications and cancer
  • Epigenetics and DNA Methylation
  • RNA and protein synthesis mechanisms
  • Scientific Computing and Data Management
  • Genetic factors in colorectal cancer
  • EEG and Brain-Computer Interfaces
  • Advanced Biosensing Techniques and Applications
  • Functional Brain Connectivity Studies
  • ECG Monitoring and Analysis
  • Semantic Web and Ontologies
  • Biomedical Text Mining and Ontologies
  • Biosensors and Analytical Detection
  • Evolutionary Algorithms and Applications
  • Data Mining Algorithms and Applications
  • Advanced Data Storage Technologies

UniNettuno University
2015-2024

National Research Council
2012-2019

National Academies of Sciences, Engineering, and Medicine
2012-2018

Istituto di Analisi dei Sistemi ed Informatica Antonio Ruberti
2009-2018

Institute for Systems Analysis
2018

Roma Tre University
2011-2014

Recently diverged species are challenging for identification, yet they frequently of special interest scientifically as well from a regulatory perspective. DNA barcoding has proven instrumental in especially insects and vertebrates, but the identification recently it been reported to be problematic some cases. Problems mostly due incomplete lineage sorting or simply lack 'barcode gap' probably related large effective population size and/or low mutation rate. Our objective was compare six...

10.1371/journal.pone.0030490 article EN cc-by PLoS ONE 2012-01-17

The identification of early and stage-specific biomarkers for Alzheimer's disease (AD) is critical, as the development disease-modification therapies may depend on discovery validation such markers. reliable depends new diagnostic algorithms to computationally exploit information in large biological datasets. To identify potential from mRNA expression profile data, we used Logic Mining method unbiased analysis a microarray dataset anti-NGF AD11 transgenic mouse model. gene brain regions was...

10.3233/jad-2011-101881 article EN Journal of Alzheimer s Disease 2011-05-30

Alzheimer's Disease (AD) is a neurodegenaritive disorder characterized by progressive dementia, for which actually no cure known. An early detection of patients affected AD can be obtained analyzing their electroencephalography (EEG) signals, show reduction the complexity, perturbation synchrony, and slowing down rhythms. In this work, we apply procedure that exploits feature extraction classification techniques to EEG whose aim distinguish patient from ones Mild Cognitive Impairment (MCI)...

10.1186/s12911-018-0613-y article EN cc-by BMC Medical Informatics and Decision Making 2018-05-31

Electroencephalography (EEG) signal analysis is a fast, inexpensive, and accessible technique to detect the early stages of dementia, such as Mild Cognitive Impairment (MCI) Alzheimer’s disease (AD). In last years, EEG has become an important topic research extract suitable biomarkers determine subject’s cognitive impairment. this work, we propose novel simple efficient method able features with finite response filter (FIR) in double time domain order discriminate among patients affected by...

10.3390/app12115413 article EN cc-by Applied Sciences 2022-05-26

Specific fragments, coming from short portions of DNA (e.g., mitochondrial, nuclear, and plastid sequences), have been defined as Barcode can be used markers for organisms the main life kingdoms. Species classification with sequences has proven effective on different organisms. Indeed, specific gene regions identified Barcode: COI in animals, rbcL matK plants, ITS fungi. The problem assigns an unknown specimen to a known species by analyzing its Barcode. This task supported reliable methods...

10.1186/1756-0381-7-4 article EN cc-by BioData Mining 2014-04-11

Abstract BLOG (Barcoding with LOG ic) is a diagnostic and character‐based DNA Barcode analysis method. Its aim to classify specimens species based on sequences supervised machine learning approach, using classification rules that compactly characterize in terms of locations key nucleotides. The 2.0 software, its fundamental modules, online/offline user interfaces recent improvements are described. These affect both methodology software design, lead the availability different releases website...

10.1111/1755-0998.12073 article EN Molecular Ecology Resources 2013-01-28

Abstract Background According to many field experts, specimens classification based on morphological keys needs be supported with automated techniques the analysis of DNA fragments. The most successful results in this area are those obtained from a particular fragment mitochondrial DNA, gene cytochrome c oxidase I (COI) (the "barcode"). Since 2004 Consortium for Barcode Life (CBOL) promotes collection barcode and development methods analyze several tasks, among which identification rules...

10.1186/1471-2105-10-s14-s7 article EN cc-by BMC Bioinformatics 2009-11-01

Alzheimer's Disease (AD) and its preliminary stage - Mild Cognitive Impairment (MCI) are the most widespread neurodegenerative disorders, their investigation remains an open challenge. ElectroEncephalography (EEG) appears as a non-invasive repeatable technique to diagnose brain abnormalities. Despite technical advances, analysis of EEG spectra is usually carried out by experts that must manually perform laborious interpretations. Computational methods may lead quantitative these signals...

10.1109/cidm.2014.7008655 article EN 2014-12-01

Data extraction and integration methods are becoming essential to effectively access take advantage of the huge amounts heterogeneous genomics clinical data increasingly available. In this work, we focus on The Cancer Genome Atlas, a comprehensive archive tumoral containing results high-throughout experiments, mainly Next Generation Sequencing, for more than 30 cancer types.We propose TCGA2BED software tool search retrieve TCGA data, convert them in structured BED format their seamless use...

10.1186/s12859-016-1419-5 article EN cc-by BMC Bioinformatics 2017-01-03

Abstract Motivation: Nowadays, knowledge extraction methods from Next Generation Sequencing data are highly requested. In this work, we focus on RNA-seq gene expression analysis and specifically case–control studies with rule-based supervised classification algorithms that build a model able to discriminate cases controls. State of the art compute single contains few features (genes). On contrary, our goal is elicit higher amount by computing many models, therefore identify most genes...

10.1093/bioinformatics/btv635 article EN cc-by-nc Bioinformatics 2015-10-30

In the Next Generation Sequencing (NGS) era a large amount of biological data is being sequenced, analyzed, and stored in many public databases, whose interoperability often required to allow an enhanced accessibility. The combination heterogeneous NGS genomic open challenge: analysis from different experiments fundamental practice for study diseases. this work, we propose combine DNA methylation RNA sequencing at gene level supervised knowledge extraction cancer.We retrieve datasets Cancer...

10.1186/s13040-018-0184-6 article EN cc-by BioData Mining 2018-10-25

The wide spread of electronic data collection in medical environments leads to an exponential growth clinical extracted from heterogeneous patient samples. Collecting, managing, integrating and analyzing these are essential activities order shed light on diseases related therapies. major issues analysis the incompleteness (missing values), different adopted measure scales, integration disparate procedures. Therefore, main challenges managing data, discovering patients interactions, sources....

10.1109/dexa.2013.42 article EN 2013-08-01

Next Generation Sequencing technologies have produced a substantial increase of publicly available genomic data and related clinical/biospecimen information. New models methods to easily access, integrate search them effectively are needed. An effort was made by the Genomic Data Commons (GDC), which defined strict procedures for harmonizing clinical cancer, created GDC portal with its application programming interface (API). In this work, we enhance harmonization applying state art model...

10.3390/app10186367 article EN cc-by Applied Sciences 2020-09-12

Big Data technologies have significantly increased the possibility for sellers to adopt personalisation strategies, especially in digital markets. Among such price discrimination, a practice where same commodity is sold at different prices, either customer or customers, stands out. Particularly, online airline ticket market has risen attention of economists recent studies, both because its specificity and high data availability. This manuscript enters debate analyses an original way. Indeed,...

10.3390/jtaer16060126 article EN cc-by Journal of theoretical and applied electronic commerce research 2021-09-09

Microarray Logic Analyzer (MALA) is a clustering and classification software, particularly engineered for microarray gene expression analysis. The aims of MALA are to cluster the profiles in order reduce amount data be analyzed classify experiments. To fulfil this objective uses machine learning process based methodology, that relies on 1) Discretization, 2) Gene clustering, 3) Feature selection, 4) Formulas computation,5) Classification. In paper we describe software design, different...

10.1109/dexa.2012.29 article EN 2012-09-01

Next Generation Sequencing (NGS) machines extract from a biological sample large number of short DNA fragments (reads). These reads are then used for several applications, e.g., sequence reconstruction, assembly, gene expression profiling, mutation analysis. We propose method to evaluate the similarity between reads. This does not rely on alignment and it is based distance frequencies their substrings fixed dimensions (k-mers). compare this alignment-free with measures derived two methods:...

10.1186/1756-0500-7-869 article EN cc-by BMC Research Notes 2014-01-01

Alignment-free algorithms can be used to estimate the similarity of biological sequences and hence are often applied phylogenetic reconstruction genomes. Most these rely on comparing frequency all distinct substrings fixed length (k-mers) that occur in analyzed sequences. In this paper, we present Logic Alignment Free (LAF), a method combines alignment-free techniques rule-based classification order assign samples their taxa. This searches for minimal subset k-mers whose relative frequencies...

10.1186/s13040-015-0073-1 article EN cc-by BioData Mining 2015-06-01

Continuous improvements in next generation sequencing technologies led to ever-increasing collections of genomic sequences, which have not been easily characterized by biologists, and whose analysis requires huge computational effort. The classification species emerged as one the main applications DNA has addressed with several approaches, e.g., multiple alignments-, phylogenetic trees-, statistical- character-based methods.We propose a supervised method based on genetic algorithm identify...

10.1186/s13040-016-0116-2 article EN cc-by BioData Mining 2016-12-01

Differences in genomic sequences are crucial for the classification of viruses into different species. In this work, viral DNA belonging to human polyomaviruses BKPyV, JCPyV, KIPyV, WUPyV, and MCPyV analyzed using a logic data mining method order identify nucleotides which able distinguish five polyomaviruses. The approach presented work is successful as it discovers several rules that effectively characterize studied individuated separate precisely one type from other assign an unknown...

10.1186/1743-422x-9-58 article EN cc-by Virology Journal 2012-03-02

The high growth of Next Generation Sequencing data currently demands new knowledge extraction methods. In particular, the RNA sequencing gene expression experimental technique stands out for case-control studies on cancer, which can be addressed with supervised machine learning techniques able to extract human interpretable models composed genes, and their relation investigated disease. State art rule-based classifiers are designed a single classification model, possibly few relevant genes....

10.1186/s12859-018-2299-7 article EN cc-by BMC Bioinformatics 2018-10-01

Alzheimer's Disease (AD) is the most widespread and incurable neurodegenerative disorder, together with its preliminary stage - Mild Cognitive Impairment (MCI) detection still remains a challenging issue. Electroencephalography (EEG) non-invasive repeatable technique to diagnose brain abnormalities. However, analysis of EEG spectra carried out manually by experts effective computer science methods extract relevant information from these signals become necessity. Through data mining approach,...

10.1109/bibm.2018.8621473 article EN 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2018-12-01
Coming Soon ...