NFDI4DS | UHH-SEMS - Publication Details

David Belanger

ORCID: 0000-0001-9673-1630

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5103250966

Research Areas

Topic Modeling
Natural Language Processing Techniques
Catalytic Alkyne Reactions
Machine Learning in Bioinformatics
Machine Learning and Algorithms
Machine Learning in Materials Science
Microtubule and mitosis dynamics
Domain Adaptation and Few-Shot Learning
Cancer-related Molecular Pathways
Handwritten Text Recognition Techniques
Generative Adversarial Networks and Image Synthesis
Genomics and Phylogenetic Studies
Semantic Web and Ontologies
Synthetic Organic Chemistry Methods
Advanced Image Processing Techniques
Machine Learning and Data Classification
Algorithms and Data Compression
Data Quality and Management
Advanced Multi-Objective Optimization Algorithms
Cyclopropane Reaction Mechanisms
Asymmetric Synthesis and Catalysis
Mass Spectrometry Techniques and Applications
Asymmetric Hydrogenation and Catalysis
Catalytic C–H Functionalization Methods
Protein Structure and Dynamics

Google (United States)
2016-2021

Ghent University
2021

University of Massachusetts Amherst
2014-2020

Stevens Institute of Technology
2019-2020

Novartis (United States)
2018-2019

Buckingham Browne & Nichols
2012

Merck & Co., Inc., Rahway, NJ, USA (United States)
2010-2011

RTX (United States)
2010-2011

Chevron (Netherlands)
2006

Montana State University
1998-2003

Fast and Accurate Entity Recognition with Iterated Dilated Convolutions

OPENALEX - Publications

Emma Strubell Patrick Verga David Belanger Andrew McCallum

Today when many practitioners run basic NLP on the entire web and large-volume traffic, faster methods are paramount to saving time energy costs. Recent advances in GPU hardware have led emergence of bi-directional LSTMs as a standard method for obtaining per-token vector representations serving input labeling tasks such NER (often followed by prediction linear-chain CRF). Though expressive accurate, these models fail fully exploit parallelism, limiting their computational efficiency. This...

10.18653/v1/d17-1283 article EN cc-by Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing 2017-01-01

Sequential regulatory activity prediction across chromosomes with convolutional neural networks

OPENALEX - Publications

David R. Kelley Yakir Reshef Maxwell L. Bileschi David Belanger Cory Y. McLean and 1 more

Models for predicting phenotypic outcomes from genotypes have important applications to understanding genomic function and improving human health. Here, we develop a machine-learning system predict cell-type–specific epigenetic transcriptional profiles in large mammalian genomes DNA sequence alone. By use of convolutional neural networks, this identifies promoters distal regulatory elements synthesizes their content make effective gene expression predictions. We show that model predictions...

10.1101/gr.227819.117 article EN cc-by-nc Genome Research 2018-03-27

Rethinking Attention with Performers

OPENALEX - Publications

Krzysztof Choromański Valerii Likhosherstov D. Dohan Xingyou Song Andreea Gane and 8 more

We introduce Performers, Transformer architectures which can estimate regular (softmax) full-rank-attention Transformers with provable accuracy, but using only linear (as opposed to quadratic) space and time complexity, without relying on any priors such as sparsity or low-rankness. To approximate softmax attention-kernels, Performers use a novel Fast Attention Via positive Orthogonal Random features approach (FAVOR+), may be of independent interest for scalable kernel methods. FAVOR+ also...

10.48550/arxiv.2009.14794 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Chains of Reasoning over Entities, Relations, and Text using Recurrent Neural Networks

OPENALEX - Publications

Rajarshi Das Arvind Neelakantan David Belanger Andrew McCallum

Rajarshi Das, Arvind Neelakantan, David Belanger, Andrew McCallum. Proceedings of the 15th Conference European Chapter Association for Computational Linguistics: Volume 1, Long Papers. 2017.

10.18653/v1/e17-1013 article EN cc-by 2017-01-01

Ask the GRU

OPENALEX - Publications

Trapit Bansal David Belanger Andrew McCallum

In a variety of application domains the content to be recommended users is associated with text. This includes research papers, movies plot summaries, news articles, blog posts, etc. Recommendation approaches based on latent factor models can extended naturally leverage text by employing an explicit mapping from factors. enables recommendations for new, unseen content, and may generalize better, since factors all items are produced compactly-parametrized model. Previous work has used topic...

10.1145/2959100.2959180 preprint EN 2016-09-01

Rapid Prediction of Electron–Ionization Mass Spectrometry Using Neural Networks

OPENALEX - Publications

Jennifer N. Wei David Belanger Ryan P. Adams D. Sculley

When confronted with a substance of unknown identity, researchers often perform mass spectrometry on the sample and compare observed spectrum to library previously collected spectra identify molecule. While popular, this approach will fail molecules that are not in existing library. In response, we propose improve library's coverage by augmenting it synthetic predicted from candidate using machine learning. We contribute lightweight neural network model quickly predicts for small molecules,...

10.1021/acscentsci.9b00085 article EN publisher-specific-oa ACS Central Science 2019-03-19

Multilingual Relation Extraction using Compositional Universal Schema

OPENALEX - Publications

Patrick Verga David Belanger Emma Strubell Benjamin Roth Andrew McCallum

Patrick Verga, David Belanger, Emma Strubell, Benjamin Roth, Andrew McCallum. Proceedings of the 2016 Conference North American Chapter Association for Computational Linguistics: Human Language Technologies. 2016.

10.18653/v1/n16-1103 article EN cc-by Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies 2016-01-01

Learning Latent Permutations with Gumbel-Sinkhorn Networks

OPENALEX - Publications

Gonzalo E. Mena David Belanger Scott W. Linderman Jasper Snoek

Permutations and matchings are core building blocks in a variety of latent variable models, as they allow us to align, canonicalize, sort data. Learning such models is difficult, however, because exact marginalization over these combinatorial objects intractable. In response, this paper introduces collection new methods for end-to-end learning that approximate discrete maximum-weight matching using the continuous Sinkhorn operator. iteration attractive it functions simple, easy-to-implement...

10.48550/arxiv.1802.08665 preprint EN other-oa arXiv (Cornell University) 2018-01-01

Boundless: Generative Adversarial Networks for Image Extension

OPENALEX - Publications

Dilip Krishnan Piotr Teterwak Aaron Sarna Aaron Maschinot Ce Liu and 2 more

Image extension models have broad applications in image editing, computational photography and computer graphics. While inpainting has been extensively studied the literature, it is challenging to directly apply state-of-the-art methods as they tend generate blurry or repetitive pixels with inconsistent semantics. We introduce semantic conditioning discriminator of a generative adversarial network (GAN), achieve strong results on coherent semantics visually pleasing colors textures. also...

10.1109/iccv.2019.01062 article EN 2021 IEEE/CVF International Conference on Computer Vision (ICCV) 2019-10-01

Using Deep Learning to Annotate the Protein Universe

OPENALEX - Publications

Maxwell L. Bileschi David Belanger Drew Bryant Theo Sanderson Brandon Carter and 3 more

Abstract Understanding the relationship between amino acid sequence and protein function is a long-standing problem in molecular biology with far-reaching scientific implications. Despite six decades of progress, state-of-the-art techniques cannot annotate 1/3 microbial sequences, hampering our ability to exploit sequences collected from diverse organisms. In this paper, we explore an alternative methodology based on deep learning that learns unaligned their functional annotations across all...

10.1101/626507 preprint EN bioRxiv (Cold Spring Harbor Laboratory) 2019-05-03

Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers

OPENALEX - Publications

Krzysztof Choromański Valerii Likhosherstov D. Dohan Xingyou Song Andreea Gane and 6 more

Transformer models have achieved state-of-the-art results across a diverse range of domains. However, concern over the cost training attention mechanism to learn complex dependencies between distant inputs continues grow. In response, solutions that exploit structure and sparsity learned matrix blossomed. real-world applications involve long sequences, such as biological sequence analysis, may fall short meeting these assumptions, precluding exploration models. To address this challenge, we...

10.48550/arxiv.2006.03555 preprint EN other-oa arXiv (Cornell University) 2020-01-01

ProteInfer: deep networks for protein functional inference

OPENALEX - Publications

Theo Sanderson Maxwell L. Bileschi David Belanger Lucy J. Colwell

Predicting the function of a protein from its amino acid sequence is long-standing challenge in bioinformatics. Traditional approaches use alignment to compare query either thousands models families or large databases individual sequences. Here we instead employ deep convolutional neural networks directly predict variety functions – EC numbers and GO terms an unaligned sequence. This approach provides precise predictions which complement alignment-based methods, computational efficiency...

10.1101/2021.09.20.461077 preprint EN cc-by bioRxiv (Cold Spring Harbor Laboratory) 2021-09-23

Thermal promotion of the cobalt catalyzed intramolecular Pauson-Khand reaction — An alternative experimental protocol for cyclopentenone synthesis

OPENALEX - Publications

David Belanger Donogh J. R. O’Mahony Tom Livinghouse

10.1016/s0040-4039(98)01693-1 article EN Tetrahedron Letters 1998-10-01

Hexacarbonyldicobalt-alkyne complexes as convenient Co2(CO)8 surrogates in the catalytic Pauson-Khand reaction

OPENALEX - Publications

David Belanger Tom Livinghouse

10.1016/s0040-4039(98)01694-3 article EN Tetrahedron Letters 1998-10-01

Sequential regulatory activity prediction across chromosomes with convolutional neural networks

OPENALEX - Publications

David R. Kelley Yakir Reshef Maxwell L. Bileschi David Belanger Cory Y. McLean and 1 more

Abstract Models for predicting phenotypic outcomes from genotypes have important applications to understanding genomic function and improving human health. Here, we develop a machine-learning system predict cell type-specific epigenetic transcriptional profiles in large mammalian genomes DNA sequence alone. Using convolutional neural networks, this identifies promoters distal regulatory elements synthesizes their content make effective gene expression predictions. We show that model...

10.1101/161851 preprint EN cc-by-nc-nd bioRxiv (Cold Spring Harbor Laboratory) 2017-07-10

(Alkylthio)alkynes as Addends in the Co(0) Catalyzed Intramolecular Pauson-Khand Reaction. Substituent Driven Enhancements of Annulation Efficiency and Stereoselectivity

OPENALEX - Publications

Brian L. Pagenkopf David Belanger Donogh J. R. O’Mahony Tom Livinghouse

Compared to terminal alkynes, (methylthio)alkynes are generally superior substrates for the thermally promoted, Co2(CO)8 catalyzed Pauson-Khand reaction of enynes and allenynes, providing enones in higher yields with enhanced diastereoselectivity. Improvements yield dependent upon use 2,2,2-trifluoroethanol as co-solvent an apparent preference endo selectivity (ethoxy)alkynes also disclosed.

10.1055/s-2000-6301 article EN Synthesis 2000-01-01

On the Counterion Dependence of the Rhodium(I)-Catalysed [4 + 2] Cycloaddition - A Remarkable Accelerating Effect of the Hexafluoroantimonate Anion

OPENALEX - Publications

Donogh J. R. O’Mahony David Belanger Tom Livinghouse

The choice of electronic environment about the metal atom is crucial to observed selectivity in Rh(I)-catalysed [4 + 2] cycloaddition. An account influence counterion on rate, diastereo-, enantio- and product described.

10.1055/s-1998-1664 article EN Synlett 1998-04-01

Discovery of imidazo[1,2-a]pyrazine-based Aurora kinase inhibitors

OPENALEX - Publications

David Belanger Patrick J. Curran Alan Hruza Johannes Voigt Zhaoyang Meng and 4 more

10.1016/j.bmcl.2010.07.008 article EN Bioorganic & Medicinal Chemistry Letters 2010-07-09

Discovery of novel imidazo[1,2-a]pyrazin-8-amines as Brk/PTK6 inhibitors

OPENALEX - Publications

Hongbo Zeng David Belanger Patrick J. Curran Gerald W. Shipps Hua Miao and 4 more

10.1016/j.bmcl.2011.07.101 article EN Bioorganic & Medicinal Chemistry Letters 2011-08-05

Is Transfer Learning Necessary for Protein Landscape Prediction?

OPENALEX - Publications

Amir Shanehsazzadeh David Belanger David Dohan

Recently, there has been great interest in learning how to best represent proteins, specifically with fixed-length embeddings. Deep become a popular tool for protein representation as model's hidden layers produce potentially useful vector TAPE introduced number of benchmark tasks and showed that semi-supervised learning, via pretraining language models on large corpus, improved performance downstream tasks. Two the (fluorescence prediction stability prediction) involve fitness landscapes....

10.48550/arxiv.2011.03443 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Coming Soon ...