Asa Ben‐Hur

ORCID: 0000-0001-8269-6942
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Bioinformatics and Genomic Networks
  • Machine Learning in Bioinformatics
  • RNA Research and Splicing
  • RNA modifications and cancer
  • RNA and protein synthesis mechanisms
  • Protein Structure and Dynamics
  • Gene expression and cancer classification
  • Computational Drug Discovery Methods
  • Genomics and Phylogenetic Studies
  • Genomics and Chromatin Dynamics
  • Biomedical Text Mining and Ontologies
  • Advanced Clustering Algorithms Research
  • Face and Expression Recognition
  • Integrated Circuits and Semiconductor Failure Analysis
  • Advancements in Photolithography Techniques
  • Particle accelerators and beam dynamics
  • Computability, Logic, AI Algorithms
  • Cellular Automata and Applications
  • Prion Diseases and Protein Misfolding
  • Gene Regulatory Network Analysis
  • Evolutionary Algorithms and Applications
  • Machine Learning in Materials Science
  • Theoretical and Computational Physics
  • Machine Learning and Algorithms
  • Neurological diseases and metabolism

Colorado State University
2016-2025

Sir Charles Gairdner Hospital
2023

University of Washington
2004-2008

École Nationale Supérieure des Mines de Paris
2007

Stanford University
2001-2006

Bradley University
2006

Howard Hughes Medical Institute
2005

University of Zurich
2005

Technion – Israel Institute of Technology
1999-2003

Hebrew University of Jerusalem
1996

We present a novel clustering method using the approach of support vector machines. Data points are mapped by means Gaussian kernel to high dimensional feature space, where we search for minimal enclosing sphere. This sphere, when back data can separate into several components, each cluster points. simple algorithm identifying these clusters. The width controls scale at which is probed while soft margin constant helps coping with outliers and overlapping structure dataset explored varying...

10.5555/944790.944807 article EN Journal of Machine Learning Research 2002-03-01
Predrag Radivojac Wyatt T. Clark Tal Oron Alexandra M. Schnoes Tobias Wittkop and 95 more Artem Sokolov Kiley Graim Christopher S. Funk Karin Verspoor Asa Ben‐Hur Gaurav Pandey Jeffrey M. Yunes Ameet Talwalkar Susanna Repo Michael L Souza Damiano Piovesan Rita Casadio Zheng Wang Jianlin Cheng Hai Fang Julian Gough Patrik Koskinen Petri Törönen Jussi Nokso-Koivisto Liisa Holm Domenico Cozzetto Daniel Buchan Kevin Bryson David T. Jones Bhakti Limaye Harshal Inamdar Avik Datta Sunitha K Manjari Rajendra Joshi Meghana Chitale Daisuke Kihara Andreas Martin Lisewski Serkan Erdin Eric Venner Olivier Lichtarge Robert Rentzsch Haixuan Yang Alfonso E. Romero Prajwal Bhat Alberto Paccanaro Tobias Hamp Rebecca Kaßner Stefan Seemayer Esmeralda Vicedo Christian Schaefer Dominik Achten Florian Auer Ariane C. Boehm Tatjana Braun Maximilian Hecht B. Mark Heron Peter Hönigschmid Thomas A. Hopf Stefanie Kaufmann Michael Kiening Denis Krompaß Cedric Landerer Yannick Mahlich Manfred Roos Jari Björne Tapio Salakoski Andrew Wong Hagit Shatkay Fanny Gatzmann I. Sommer Mark N. Wass Michael J.E. Sternberg Nives Škunca Fran Supek Matko Bošnjak Panče Panov Sašo Džeroski Tomislav Šmuc Yiannis Kourmpetis Aalt D. J. van Dijk Cajo J. F. ter Braak Yuanpeng Zhou Qingtian Gong Xinran Dong Weidong Tian Marco Falda Paolo Fontana Enrico Lavezzo Barbara Di Camillo Stefano Toppo Liang Lan Nemanja Djuric Yuhong Guo Slobodan Vučetić Amos Bairoch Michal Linial Patricia C. Babbitt Steven E. Brenner Christine Orengo Burkhard Rost

Automated annotation of protein function is challenging. As the number sequenced genomes rapidly grows, overwhelming majority products can only be annotated computationally. If computational predictions are to relied upon, it crucial that accuracy these methods high. Here we report results from first large-scale community-based critical assessment (CAFA) experiment. Fifty-four representing state art for prediction were evaluated on a target set 866 proteins 11 organisms. Two findings stand...

10.1038/nmeth.2340 article EN cc-by-nc-sa Nature Methods 2013-01-27

10.1007/978-1-60327-241-4_13 article EN Methods in molecular biology 2009-10-30

Motivation: Despite advances in high-throughput methods for discovering protein–protein interactions, the interaction networks of even well-studied model organisms are sketchy at best, highlighting continued need computational to help direct experimentalists search novel interactions.

10.1093/bioinformatics/bti1016 article EN Bioinformatics 2005-06-01

Abstract Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity a genome gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used analyse transcriptomes. However, major limitation short-read data is that it difficult accurately predict full-length splice isoforms. Here we sequenced the sorghum using Pacific Biosciences single-molecule real-time long-read...

10.1038/ncomms11706 article EN cc-by Nature Communications 2016-06-24
Yuxiang Jiang Tal Oron Wyatt T. Clark Asma Bankapur Daniel D’Andrea and 95 more Rosalba Lepore Christopher S. Funk Indika Kahanda Karin Verspoor Asa Ben‐Hur Da Chen Emily Koo Duncan Penfold-Brown Dennis Shasha Noah Youngs Richard Bonneau Alexandra J. Lin Sayed Mohammad Ebrahim Sahraeian Pier Luigi Martelli Giuseppe Profiti Rita Casadio Renzhi Cao Zhaolong Zhong Jianlin Cheng Adrian Altenhoff Nives Škunca Christophe Dessimoz Tunca Doğan Kai Hakala Suwisa Kaewphan Farrokh Mehryary Tapio Salakoski Filip Ginter Hai Fang Ben Smithers Matt E. Oates Julian Gough Petri Törönen Patrik Koskinen Liisa Holm Ching-Tai Chen Wen−Lian Hsu Kevin Bryson Domenico Cozzetto Federico Minneci David T. Jones Samuel Chapman Dukka Bkc Ishita Khan Daisuke Kihara Dan Ofer Nadav Rappoport Amos Stern Elena Cibrián–Uhalte Paul Denny Rebecca E. Foulger Reija Hieta Duncan Legge Ruth C. Lovering Michele Magrane Anna N. Melidoni Prudence Mutowo Klemens Pichler Aleksandra Shypitsyna Biao Li Pooya Zakeri Sarah ElShal Léon-Charles Tranchevent Sayoni Das Natalie L. Dawson David Lee Jonathan Lees Ian Sillitoe Prajwal Bhat Tamás Nepusz Alfonso E. Romero Rajkumar Sasidharan Haixuan Yang Alberto Paccanaro Jesse Gillis Adriana E. Sedeño-Cortés Paul Pavlidis Shou Feng Juan Miguel Cejuela Tatyana Goldberg Tobias Hamp Lothar Richter Asaf Salamov Toni Gabaldón Marina Marcet‐Houben Fran Supek Qingtian Gong Wei Ning Yuanpeng Zhou Weidong Tian Marco Falda Paolo Fontana Enrico Lavezzo Stefano Toppo Carlo Ferrari Manuel Giollo

A major bottleneck in our understanding of the molecular underpinnings life is assignment function to proteins. While experiments provide most reliable annotation proteins, their relatively low throughput and restricted purview have led an increasing role for computational prediction. However, assessing methods protein prediction tracking progress field remain challenging.We conducted second critical assessment functional (CAFA), a timed challenge assess that automatically assign function....

10.1186/s13059-016-1037-6 article EN cc-by Genome biology 2016-09-07

Metabolomic data are frequently acquired using chromatographically coupled mass spectrometry (MS) platforms. For such datasets, the first step in analysis relies on feature detection, where a is defined by and retention time. While typically derived from single compound, spectrum of signals more more-accurate representation spectrometric signal for given metabolite. Here, we report novel grouping method that operates an unsupervised manner to group MS into spectra without relying...

10.1021/ac501530d article EN Analytical Chemistry 2014-06-13

Deep learning architectures have recently demonstrated their power in predicting DNA- and RNA-binding specificity. Existing methods fall into three classes: Some are based on convolutional neural networks (CNNs), others use recurrent (RNNs) rely hybrid combining CNNs RNNs. However, existing studies the relative merit of various remains unclear.In this study we present a systematic exploration deep for For purpose, deepRAM, an end-to-end tool that provides implementation wide selection...

10.1093/bioinformatics/btz339 article EN Bioinformatics 2019-05-14

Abstract Microbial breakdown of organic matter is one the most important processes on Earth, yet controls decomposition are poorly understood. Here we track 36 terrestrial human cadavers in three locations and show that a phylogenetically distinct, interdomain microbial network assembles during despite selection effects location, climate season. We generated metagenome-assembled genome library from cadaver-associated soils integrated it with metabolomics data to identify links between...

10.1038/s41564-023-01580-y article EN cc-by Nature Microbiology 2024-02-12

We carried out a large-scale screen to identify interactions between integral membrane proteins of Saccharomyces cerevisiae by using modified split-ubiquitin technique. Among 705 annotated as membrane, we identified 1,985 putative involving 536 proteins. To ascribe confidence levels the interactions, used support vector machine algorithm classify based on assay results and protein data derived from literature. Previously computationally supported were train machine, which 131 highest...

10.1073/pnas.0505482102 article EN Proceedings of the National Academy of Sciences 2005-08-10

Understanding the genetic basis of HIV-1 drug resistance is essential to developing new antiretroviral drugs and optimizing use existing drugs. This understanding, however, hampered by large numbers mutation patterns associated with cross-resistance within each class. We used five statistical learning methods (decision trees, neural networks, support vector regression, least-squares least angle regression) relate protease reverse transcriptase mutations in vitro susceptibility 16 Learning...

10.1073/pnas.0607274103 article EN Proceedings of the National Academy of Sciences 2006-10-26

Abstract The protein-protein interaction networks of even well-studied model organisms are sketchy at best, highlighting the continued need for computational methods to help direct experimentalists in search novel interactions. This has prompted development a number predicting interactions based on various sources data and methodologies. common method choosing negative examples training predictor is annotations cellular localization, observation that pairs proteins have different...

10.1186/1471-2105-7-s1-s2 article EN cc-by BMC Bioinformatics 2006-03-01

Clustering is one of the most commonly used tools in analysis gene expression data (1,2). The usage grouping genes based on premise that coexpression a result coregulation. It often as preliminary step extracting networks and inference function (3,4). experiments can be to discover novel phenotypic aspects cells tissues (3,5,6), including sensitivity drugs (7), also detect artifacts experimental conditions (8). its applications biology are presented greater detail Chapter 13 (see ref. 9)....

10.1385/1-59259-364-x:159 article EN Humana Press eBooks 2003-11-15

Remote homology detection is the problem of detecting in cases low sequence similarity. It a hard computational with no approach that works well all cases.We present method for remote based on presence discrete motifs. The motif content pair sequences used to define similarity as kernel Support Vector Machine (SVM) classifier. We test two tasks: prediction previously unseen SCOP family and an enzyme class given other enzymes have similar function substrates. find it performs significantly...

10.1093/bioinformatics/btg1002 article EN Bioinformatics 2003-07-03

We propose a method for predicting splice graphs that enhances curated gene models using evidence from RNA-Seq and EST alignments. Results obtained experiments in Arabidopsis thaliana show predictions made by our SpliceGrapher are more consistent with current than TAU Cufflinks. Furthermore, analysis of plant human data indicates the machine learning approach used is useful discriminating between real spurious sites, can improve reliability detection alternative splicing. available download...

10.1186/gb-2012-13-1-r4 article EN cc-by Genome biology 2012-01-31

Prions are important disease agents and epigenetic regulatory elements. Prion formation involves the structural conversion of proteins from a soluble form into an insoluble amyloid form. In many cases, this is driven by glutamine/asparagine (Q/N)-rich prion-forming domain. However, our understanding sequence requirements for prion propagation Q/N-rich domains has been insufficient accurate propensity prediction or domain design. By focusing exclusively on amino acid composition, we have...

10.1073/pnas.1119366109 article EN Proceedings of the National Academy of Sciences 2012-04-02

We present a novel partner-specific protein-protein interaction site prediction method called PAIRpred. Unlike most existing machine learning binding methods, PAIRpred uses information from both proteins in protein complex to predict pairs of interacting residues the two proteins. captures sequence and structure about residue through pairwise kernels that are used for training support vector classifier. As result, presents more detailed model binding, offers state art accuracy predicting...

10.1002/prot.24479 article EN Proteins Structure Function and Bioinformatics 2013-11-16

Drought is a major limiting factor of crop yields. In response to drought, plants reprogram their gene expression, which ultimately regulates multitude biochemical and physiological processes. The timing this reprogramming the nature drought-regulated genes in different genotypes are thought confer differential tolerance drought stress. Sorghum highly drought-tolerant has been increasingly used as model cereal identify that tolerance. Also, there considerable natural variation resistance...

10.3390/ijms21030772 article EN International Journal of Molecular Sciences 2020-01-24

Abiotic stresses affect plant physiology, development, growth, and alter pre-mRNA splicing. Western poplar is a model woody tree potential bioenergy feedstock. To investigate the extent of stress-regulated alternative splicing, we conducted an in-depth survey leaf, root, stem xylem transcriptomes under drought, salt, or temperature stress. Analysis approximately one billion genome-aligned RNA-Seq reads from tissue- stress-specific libraries revealed over fifteen millions novel splice...

10.3389/fpls.2018.00005 article EN cc-by Frontiers in Plant Science 2018-02-12

Plant SR45 and its metazoan ortholog RNPS1 are serine/arginine-rich (SR)-like RNA binding proteins that function in splicing/postsplicing events regulate diverse processes eukaryotes. Interactions of with both RNAs crucial for regulating processing. However, vivo targets currently unclear. Using immunoprecipitation followed by high-throughput sequencing, we identified over 4000 Arabidopsis thaliana directly or indirectly associate SR45, designated as SR45-associated (SARs). Comprehensive...

10.1105/tpc.15.00641 article EN The Plant Cell 2015-11-24

Summary Comprehensive, automated software testing requires an oracle to check whether the output produced by a test case matches expected behaviour of programme. But challenges in creating suitable oracles limit ability perform some programmes, and especially scientific software. Metamorphic is method for automating process programmes without oracles. This technique operates checking programme behaves according properties called metamorphic relations . A relation describes change when input...

10.1002/stvr.1594 article EN Software Testing Verification and Reliability 2015-11-16
Coming Soon ...