NFDI4DS | UHH-SEMS - Publication Details

Anders Andreassen

ORCID: 0000-0003-3504-3919

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5052645158

Research Areas

Particle physics theoretical and experimental studies
Topic Modeling
Natural Language Processing Techniques
Particle Detector Development and Performance
High-Energy Particle Collisions Research
Adversarial Robustness in Machine Learning
Computational Physics and Python Applications
Quantum Chromodynamics and Particle Interactions
Quantum Mechanics and Applications
Cosmology and Gravitation Theories
Machine Learning and Data Classification
Gaussian Processes and Bayesian Inference
Genomics and Phylogenetic Studies
Cold Atom Physics and Bose-Einstein Condensates
Black Holes and Theoretical Physics
Multimodal Machine Learning Applications
Domain Adaptation and Few-Shot Learning
Computer Graphics and Visualization Techniques
Infectious Diseases and Mycology
Anomaly Detection Techniques and Applications
Ferroelectric and Negative Capacitance Devices
Neural Networks and Applications
Markov Chains and Monte Carlo Methods
Relativity and Gravitational Theory
Big Data and Business Intelligence

Google (United States)
2019-2021

Lawrence Berkeley National Laboratory
2019-2020

University of California, Berkeley
2019-2020

Harvard University
2014-2019

Harvard University Press
2016

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

OPENALEX - Publications

Aarohi Srivastava Abhinav Rastogi Abhishek S. Rao Abu Awal Shoeb Abubakar Abid and 95 more

Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these are as yet poorly characterized. In order to inform future research, prepare for disruptive model capabilities, ameliorate socially harmful effects, it is vital that we understand the present near-future limitations of language models. To address this challenge, introduce Beyond Imitation Game benchmark (BIG-bench). BIG-bench...

10.48550/arxiv.2206.04615 preprint EN other-oa arXiv (Cornell University) 2022-01-01

Gemini: A Family of Highly Capable Multimodal Models

OPENALEX - Publications

Gemini Team Rohan Anil Sebastian Borgeaud Jean-Baptiste Alayrac Jiahui Yu and 95 more

This report introduces a new family of multimodal models, Gemini, that exhibit remarkable capabilities across image, audio, video, and text understanding. The Gemini consists Ultra, Pro, Nano sizes, suitable for applications ranging from complex reasoning tasks to on-device memory-constrained use-cases. Evaluation on broad range benchmarks shows our most-capable Ultra model advances the state art in 30 32 these - notably being first achieve human-expert performance well-studied exam...

10.48550/arxiv.2312.11805 preprint EN cc-by arXiv (Cornell University) 2023-01-01

Solving Quantitative Reasoning Problems with Language Models

OPENALEX - Publications

Aitor Lewkowycz Anders Andreassen D. Dohan Ethan Dyer Henryk Michalewski and 9 more

Language models have achieved remarkable performance on a wide range of tasks that require natural language understanding. Nevertheless, state-of-the-art generally struggled with quantitative reasoning, such as solving mathematics, science, and engineering problems at the college level. To help close this gap, we introduce Minerva, large model pretrained general data further trained technical content. The achieves benchmarks without use external tools. We also evaluate our over two hundred...

10.48550/arxiv.2206.14858 preprint EN cc-by arXiv (Cornell University) 2022-01-01

JUNIPR: a framework for unsupervised machine learning in particle physics

OPENALEX - Publications

Anders Andreassen Ilya Feige Christopher Frye Matthew D. Schwartz

In applications of machine learning to particle physics, a persistent challenge is how go beyond discrimination learn about the underlying physics. To this end, powerful tool would be framework for unsupervised learning, where learns intricate high-dimensional contours data upon which it trained, without reference pre-established labels. order approach such complex task, an network must structured intelligently, based on qualitative understanding data. paper, we scaffold neural network's...

10.1140/epjc/s10052-019-6607-9 article EN cc-by The European Physical Journal C 2019-02-01

Show Your Work: Scratchpads for Intermediate Computation with Language Models

OPENALEX - Publications

Maxwell Nye Anders Andreassen Guy Gur-Ari Henryk Michalewski Jacob Austin and 7 more

Large pre-trained language models perform remarkably well on tasks that can be done "in one pass", such as generating realistic text or synthesizing computer programs. However, they struggle with require unbounded multi-step computation, adding integers executing Surprisingly, we find these same are able to complex computations -- even in the few-shot regime when asked operation "step by step", showing results of intermediate computations. In particular, train transformers asking them emit...

10.48550/arxiv.2112.00114 preprint EN other-oa arXiv (Cornell University) 2021-01-01

OmniFold: A Method to Simultaneously Unfold All Observables

OPENALEX - Publications

Anders Andreassen Patrick Komiske Eric Metodiev Benjamin Nachman Jesse Thaler

Collider data must be corrected for detector effects (``unfolded'') to compared with many theoretical calculations and measurements from other experiments. Unfolding is traditionally done individual, binned observables without including all information relevant characterizing the response. We introduce OmniFold, an unfolding method that iteratively reweights a simulated dataset, using machine learning capitalize on available information. Our approach unbinned, works arbitrarily...

10.1103/physrevlett.124.182001 article EN cc-by Physical Review Letters 2020-05-07

Simulation assisted likelihood-free anomaly detection

OPENALEX - Publications

Anders Andreassen Benjamin Nachman David Shih

Given the lack of evidence for new particle discoveries at Large Hadron Collider (LHC), it is critical to broaden search program. A variety model-independent searches have been proposed, adding sensitivity unexpected signals. There are generally two types such searches: those that rely heavily on simulations and entirely based (unlabeled) data. This paper introduces a hybrid method makes best both approaches. For potential signals resonant in one known feature, this first learns...

10.1103/physrevd.101.095004 article EN cc-by Physical review. D/Physical review. D. 2020-05-06

Scale-invariant instantons and the complete lifetime of the standard model

OPENALEX - Publications

Anders Andreassen William Frost Matthew D. Schwartz

In a classically scale-invariant quantum field theory, tunneling rates are infrared divergent due to the existence of instantons any size. While one expects such divergences be resolved by effects, it has been unclear how higher-loop corrections can resolve problem appearing already at loop. With careful power counting, we uncover series loop contributions that dominate over one-loop result and sum all necessary terms. We also clarify previously incomplete treatments related issues...

10.1103/physrevd.97.056006 article EN cc-by Physical review. D/Physical review. D. 2018-03-12

Consistent Use of the Standard Model Effective Potential

OPENALEX - Publications

Anders Andreassen William Frost Matthew D. Schwartz

The stability of the standard model is determined by true minimum effective Higgs potential. We show that potential at its when computed traditional method strongly dependent on gauge parameter. It moreover depends scale where calculated. provide a consistent for determining absolute independent both and calculation scale, order in perturbation theory. This leads to revised bounds ${m}_{h}^{\text{pole}}>(129.4\ifmmode\pm\else\textpm\fi{}2.3)\text{ }\text{ }\mathrm{GeV}$...

10.1103/physrevlett.113.241801 article EN publisher-specific-oa Physical Review Letters 2014-12-08

Precision decay rate calculations in quantum field theory

OPENALEX - Publications

Anders Andreassen David Farhi William Frost Matthew D. Schwartz

Tunneling in quantum field theory is worth understanding properly, not least because it controls the long-term fate of our Universe. There are, however, a number features tunneling rate calculations which lack desirable transparency, such as necessity analytic continuation, appropriateness using an effective instead classical potential, and sensitivity to short-distance physics. This paper attempts review pedagogical detail physical origin its connection path integral. Both traditional...

10.1103/physrevd.95.085011 article EN publisher-specific-oa Physical review. D/Physical review. D. 2017-04-13

Consistent use of effective potentials

OPENALEX - Publications

Anders Andreassen William Frost Matthew D. Schwartz

It is well known that effective potentials can be gauge dependent while their values at extrema should invariant. Unfortunately, establishing this invariance in perturbation theory not straightforward, since contributions from arbitrarily high-order loops of the same size. We show massless scalar QED an infinite class summed (and must summed) to give a gauge-invariant value for potential its minimum. In addition, we exact depends on both scale which it calculated and normalization fields,...

10.1103/physrevd.91.016009 article EN publisher-specific-oa Physical review. D. Particles, fields, gravitation, and cosmology/Physical review. D, Particles, fields, gravitation, and cosmology 2015-01-28

Neural networks for full phase-space reweighting and parameter tuning

OPENALEX - Publications

Anders Andreassen Benjamin Nachman

Precise scientific analysis in collider-based particle physics is possible because of complex simulations that connect fundamental theories to observable quantities. The significant computational cost these programs limits the scope, precision, and accuracy Standard Model measurements searches for new phenomena. We therefore introduce Deep neural networks using Classification Tuning Reweighting (DCTR), a network-based approach reweight fit all kinematic flavor information -- full phase...

10.1103/physrevd.101.091901 article EN cc-by Physical review. D/Physical review. D. 2020-05-12

Exploring Length Generalization in Large Language Models

OPENALEX - Publications

Cem Anil Yuhuai Wu Anders Andreassen Aitor Lewkowycz Vedant Misra and 5 more

The ability to extrapolate from short problem instances longer ones is an important form of out-of-distribution generalization in reasoning tasks, and crucial when learning datasets where are rare. These include theorem proving, solving quantitative mathematics problems, reading/summarizing novels. In this paper, we run careful empirical studies exploring the length capabilities transformer-based language models. We first establish that naively finetuning transformers on tasks shows...

10.48550/arxiv.2207.04901 preprint EN cc-by arXiv (Cornell University) 2022-01-01

Understanding the Failure Modes of Out-of-Distribution Generalization

OPENALEX - Publications

Vaishnavh Nagarajan Anders Andreassen Behnam Neyshabur

Empirical studies suggest that machine learning models often rely on features, such as the background, may be spuriously correlated with label only during training time, resulting in poor accuracy test-time. In this work, we identify fundamental factors give rise to behavior, by explaining why fail way {\em even} easy-to-learn tasks where one would expect these succeed. particular, through a theoretical study of gradient-descent-trained linear classifiers some tasks, uncover two...

10.48550/arxiv.2010.15775 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Direct Approach to Quantum Tunneling

OPENALEX - Publications

Anders Andreassen David Farhi William Frost Matthew D. Schwartz

The decay rates of quasistable states in quantum field theories are usually calculated using instanton methods. Standard derivations these methods rely a crucial way upon deformations and analytic continuations the physical potential, on saddle point approximation. While resulting procedure can be checked against other semi-classical approaches some one-dimensional cases, it is challenging to trace role relevant scales, any intuitive handle precision approximations involved at best obscure....

10.1103/physrevlett.117.231601 article EN publisher-specific-oa Physical Review Letters 2016-11-30

binary junipr: An Interpretable Probabilistic Model for Discrimination

OPENALEX - Publications

Anders Andreassen Ilya Feige Christopher Frye Matthew D. Schwartz

junipr is an approach to unsupervised learning in particle physics that scaffolds a probabilistic model for jets around their representation as binary trees. Separate models can be learned different event or jet types, then compared and explored physical insight. The relative probabilities also used discrimination. In this Letter, we show how the training of separate refined context classification optimize discrimination power. We refer junipr. achieves state-of-the-art performance...

10.1103/physrevlett.123.182001 article EN cc-by Physical Review Letters 2019-10-31

The Evolution of Out-of-Distribution Robustness Throughout Fine-Tuning

OPENALEX - Publications

Anders Andreassen Yasaman Bahri Behnam Neyshabur Rebecca Roelofs

Although machine learning models typically experience a drop in performance on out-of-distribution data, accuracies in- versus data are widely observed to follow single linear trend when evaluated across testbed of models. Models that more accurate the relative this baseline exhibit "effective robustness" and exceedingly rare. Identifying such models, understanding their properties, is key improving performance. We conduct thorough empirical investigation effective robustness during...

10.48550/arxiv.2106.15831 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Parameter estimation using neural networks in the presence of detector effects

OPENALEX - Publications

Anders Andreassen S.‐C. Hsu Benjamin Nachman Natchanon Suaysom Adi Suresh

Histogram-based template fits are the main technique used for estimating parameters of high energy physics Monte Carlo generators. Parametrized neural network reweighting can be to extend this fitting procedure many dimensions and does not require binning. If fit is performed using reconstructed data, then expensive detector simulations must training networks. We introduce a new two-level approach that only requires one dataset with simulation set additional generation-level datasets without...

10.1103/physrevd.103.036001 article EN cc-by Physical review. D/Physical review. D. 2021-02-01

Scaffolding Simulations with Deep Learning for High-dimensional Deconvolution

OPENALEX - Publications

Anders Andreassen Patrick Komiske Eric Metodiev Benjamin Nachman Adi Suresh and 1 more

A common setting for scientific inference is the ability to sample from a high-fidelity forward model (simulation) without having an explicit probability density of data. We propose simulation-based maximum likelihood deconvolution approach in this called OmniFold. Deep learning enables be naturally unbinned and (variable-, and) high-dimensional. In contrast parameter estimation, goal remove detector distortions order enable variety down-stream tasks. Our deep generalization Richardson-Lucy...

10.48550/arxiv.2105.04448 preprint EN cc-by arXiv (Cornell University) 2021-01-01

Asymptotics of Wide Convolutional Neural Networks

OPENALEX - Publications

Anders Andreassen Ethan Dyer

Wide neural networks have proven to be a rich class of architectures for both theory and practice. Motivated by the observation that finite width convolutional appear outperform infinite networks, we study scaling laws wide CNNs with skip connections. Following approach (Dyer & Gur-Ari, 2019), present simple diagrammatic recipe derive asymptotic dependence many quantities interest. These relationships provide solvable description training dynamics networks. We test these relations across...

10.48550/arxiv.2008.08675 preprint EN other-oa arXiv (Cornell University) 2020-01-01

Reducing the top quark mass uncertainty with jet grooming

OPENALEX - Publications

Anders Andreassen Matthew D. Schwartz

The measurement of the top quark mass has large systematic uncertainties coming from Monte Carlo simulations that are used to match theory and experiment. We explore how much uncertainty can be reduced by using jet grooming procedures. Using ATLAS A14 tunes pythia, we estimate choice tuning parameters in what is meant around 530 MeV without any corrections. This 60% 200 calibrating W 70% 140 additionally applying soft-drop (or 170 trimming). At e + − colliders, associated 110 MeV, reducing...

10.1007/jhep10(2017)151 article EN cc-by Journal of High Energy Physics 2017-10-01

Coming Soon ...