NFDI4DS | UHH-SEMS - Publication Details

Aaditya Ramdas

ORCID: 0000-0003-0497-311X

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5032389695

Research Areas

Statistical Methods and Inference
Statistical Methods in Clinical Trials
Machine Learning and Algorithms
Advanced Bandit Algorithms Research
Advanced Statistical Methods and Models
Advanced Statistical Process Monitoring
Statistical Methods and Bayesian Inference
Bayesian Methods and Mixture Models
Advanced Causal Inference Techniques
Machine Learning and Data Classification
Markov Chains and Monte Carlo Methods
Bayesian Modeling and Causal Inference
Stochastic Gradient Optimization Techniques
Sparse and Compressive Sensing Techniques
Gaussian Processes and Bayesian Inference
Adversarial Robustness in Machine Learning
Gene expression and cancer classification
Random Matrices and Applications
Anomaly Detection Techniques and Applications
Probability and Risk Models
Imbalanced Data Classification Techniques
Optimal Experimental Design Methods
Distributed Sensor Networks and Detection Algorithms
Sports Analytics and Performance
Auction Theory and Applications

Carnegie Mellon University
2016-2025

Google (United States)
2023-2024

University of Waterloo
2021

University of California, Berkeley
2015-2020

University of Chicago
2019

Amazon (Germany)
2019

Berkeley College
2015

On Wasserstein Two-Sample Testing and Related Families of Nonparametric Tests

OPENALEX - Publications

Aaditya Ramdas Nicolás García Trillos Marco Cuturi

Nonparametric two-sample or homogeneity testing is a decision theoretic problem that involves identifying differences between two random variables without making parametric assumptions about their underlying distributions. The literature old and rich, with wide variety of statistics having being designed analyzed, both for the unidimensional multivariate setting. In this short survey, we focus on test involve Wasserstein distance. Using an entropic smoothing distance, connect these to very...

10.3390/e19020047 article EN cc-by Entropy 2017-01-26

Simultaneously Uncovering the Patterns of Brain Regions Involved in Different Story Reading Subprocesses

OPENALEX - Publications

Leila Wehbe Brian Murphy Partha Talukdar Alona Fyshe Aaditya Ramdas and 1 more

Story understanding involves many perceptual and cognitive subprocesses, from perceiving individual words, to parsing sentences, the relationships among story characters. We present an integrated computational model of reading that incorporates these additional simultaneously discovering their fMRI signatures. Our predicts activity associated with arbitrary text passages, well enough distinguish which two segments is being read 74% accuracy. This approach first track diverse subprocesses...

10.1371/journal.pone.0112575 article EN cc-by PLoS ONE 2014-11-26

Convergence Properties of the Randomized Extended Gauss--Seidel and Kaczmarz Methods

OPENALEX - Publications

Anna Ma Deanna Needell Aaditya Ramdas

The Kaczmarz and Gauss--Seidel methods both solve a linear system $\boldsymbol{X}{\boldsymbol{\beta}} = \boldsymbol{y}$ by iteratively refining the solution estimate. Recent interest in these has been sparked proof of Strohmer Vershynin which shows randomized method converges linearly expectation to solution. Lewis Leventhal then proved similar result for algorithm. However, behavior depends heavily on whether is underdetermined or overdetermined, it consistent not. Here we provide unified...

10.1137/15m1014425 article EN SIAM Journal on Matrix Analysis and Applications 2015-01-01

Predictive inference with the jackknife+

OPENALEX - Publications

Rina Foygel Barber Emmanuel J. Candès Aaditya Ramdas Ryan J. Tibshirani

This paper introduces the jackknife+, which is a novel method for constructing predictive confidence intervals. Whereas jackknife outputs an interval centered at predicted response of test point, with width determined by quantiles leave-one-out residuals, jackknife+ also uses predictions point to account variability in fitted regression function. Assuming exchangeable training samples, we prove that this crucial modification permits rigorous coverage guarantees regardless distribution data...

10.1214/20-aos1965 article EN The Annals of Statistics 2021-01-29

Conformal prediction beyond exchangeability

OPENALEX - Publications

Rina Foygel Barber Emmanuel J. Candès Aaditya Ramdas Ryan J. Tibshirani

Conformal prediction is a popular, modern technique for providing valid predictive inference arbitrary machine learning models. Its validity relies on the assumptions of exchangeability data, and symmetry given model fitting algorithm as function data. However, often violated when models are deployed in practice. For example, if data distribution drifts over time, then points no longer exchangeable; moreover, such settings, we might want to use nonsymmetric that treats recent observations...

10.1214/23-aos2276 article EN The Annals of Statistics 2023-04-01

Estimating means of bounded random variables by betting

OPENALEX - Publications

Ian Waudby-Smith Aaditya Ramdas

Abstract We derive confidence intervals (CIs) and sequences (CSs) for the classical problem of estimating a bounded mean. Our approach generalizes improves on celebrated Chernoff method, yielding best closed-form "empirical-Bernstein" CSs CIs (converging exactly to oracle Bernstein width) as well non-closed-form "betting" CIs. method combines new composite nonnegative (super)martingales with Ville's maximal inequality, strong connections testing by betting mixtures. also show how these ideas...

10.1093/jrsssb/qkad009 article EN Journal of the Royal Statistical Society Series B (Statistical Methodology) 2023-02-16

On the Decreasing Power of Kernel and Distance Based Nonparametric Hypothesis Tests in High Dimensions

OPENALEX - Publications

Aaditya Ramdas Sashank J. Reddi Barnabás Póczos Aarti Singh Larry Wasserman

This paper is about two related decision theoretic problems, nonparametric two-sample testing and independence testing. There a belief that recently proposed solutions, based on kernels distances between pairs of points, behave well in high-dimensional settings. We identify different sources misconception give rise to the above belief. Specifically, we differentiate hardness estimation test statistics from whether these are zero or not, explicitly discuss notion "fair" alternative hypotheses...

10.1609/aaai.v29i1.9692 article EN Proceedings of the AAAI Conference on Artificial Intelligence 2015-03-04

Universal inference

OPENALEX - Publications

Larry Wasserman Aaditya Ramdas Sivaraman Balakrishnan

Significance Most statistical methods rely on certain mathematical conditions, known as regularity assumptions, to ensure their validity. Without these quantities like P values and confidence intervals might not be valid. In this paper we give a surprisingly simple method for producing significance statements without any conditions. The resulting hypothesis tests can used parametric model several nonparametric models.

10.1073/pnas.1922664117 article EN Proceedings of the National Academy of Sciences 2020-07-06

Generative Models and Model Criticism via Optimized Maximum Mean Discrepancy

OPENALEX - Publications

Danica J. Sutherland Hsiao-Yu Fish Tung Heiko Strathmann Soumyajit De Aaditya Ramdas and 2 more

We propose a method to optimize the representation and distinguishability of samples from two probability distributions, by maximizing estimated power statistical test based on maximum mean discrepancy (MMD). This optimized MMD is applied setting unsupervised learning generative adversarial networks (GAN), in which model attempts generate realistic samples, discriminator tell these apart data samples. In this context, may be used roles: first, as discriminator, either directly or features...

10.48550/arxiv.1611.04488 preprint EN other-oa arXiv (Cornell University) 2016-01-01

The limits of distribution-free conditional predictive inference

OPENALEX - Publications

Rina Foygel Barber Emmanuel J. Candès Aaditya Ramdas Ryan J. Tibshirani

Abstract We consider the problem of distribution-free predictive inference, with goal producing coverage guarantees that hold conditionally rather than marginally. Existing methods such as conformal prediction offer marginal guarantees, where holds on average over all possible test points, but this is not sufficient for many practical applications we would like to know our predictions are valid a given individual, merely population. On other hand, exact conditional inference known be...

10.1093/imaiai/iaaa017 article EN Information and Inference A Journal of the IMA 2020-08-04

Time-uniform, nonparametric, nonasymptotic confidence sequences

OPENALEX - Publications

Steven R. Howard Aaditya Ramdas Jon McAuliffe Jasjeet S. Sekhon

A confidence sequence is a of intervals that uniformly valid over an unbounded time horizon. Our work develops sequences whose widths go to zero, with nonasymptotic coverage guarantees under nonparametric conditions. We draw connections between the Cram\'er-Chernoff method for exponential concentration, law iterated logarithm (LIL), and sequential probability ratio test -- our are time-uniform extensions first; provide tight, characterizations second; generalize third settings, including...

10.1214/20-aos1991 article EN The Annals of Statistics 2021-04-01

False Discovery Rate Control with E-values

OPENALEX - Publications

Ruodu Wang Aaditya Ramdas

Abstract E-values have gained attention as potential alternatives to p-values measures of uncertainty, significance and evidence. In brief, e-values are realized by random variables with expectation at most one under the null; examples include betting scores, (point null) Bayes factors, likelihood ratios stopped supermartingales. We design a natural analogue Benjamini-Hochberg (BH) procedure for false discovery rate (FDR) control that utilizes e-values, called e-BH procedure, compare it...

10.1111/rssb.12489 article EN Journal of the Royal Statistical Society Series B (Statistical Methodology) 2022-01-11

Game-Theoretic Statistics and Safe Anytime-Valid Inference

OPENALEX - Publications

Aaditya Ramdas Peter Grünwald Vladimir Vovk Glenn Shafer

Safe anytime-valid inference (SAVI) provides measures of statistical evidence and certainty—e-processes for testing confidence sequences estimation—that remain valid at all stopping times, accommodating continuous monitoring analysis accumulating data optional or continuation any reason. These crucially rely on test martingales, which are nonnegative martingales starting one. Since a martingale is the wealth process player in betting game, SAVI centrally employs game-theoretic intuition,...

10.1214/23-sts894 article EN Statistical Science 2023-11-01

Multiple testing under negative dependence

OPENALEX - Publications

Ziyu Chi Aaditya Ramdas Ruodu Wang

10.3150/24-bej1768 article DA Bernoulli 2025-02-11

Time-uniform Chernoff bounds via nonnegative supermartingales

OPENALEX - Publications

Steven R. Howard Aaditya Ramdas Jon McAuliffe Jasjeet S. Sekhon

We develop a class of exponential bounds for the probability that martingale sequence crosses time-dependent linear threshold. Our key insight is it both natural and fruitful to formulate concentration inequalities in this way. illustrate point by presenting single assumption theorem together unify strengthen many tail martingales, including classical (1960–80) Bernstein, Bennett, Hoeffding, Freedman; contemporary (1980–2000) Shorack Wellner, Pinelis, Blackwell, van de Geer, la Peña; several...

10.1214/18-ps321 article EN cc-by Probability Surveys 2020-01-01

Conformal Prediction Under Covariate Shift

OPENALEX - Publications

Ryan J. Tibshirani Rina Foygel Barber Emmanuel J. Candès Aaditya Ramdas

We extend conformal prediction methodology beyond the case of exchangeable data. In particular, we show that a weighted version can be used to compute distribution-free intervals for problems in which test and training covariate distributions differ, but likelihood ratio between these two is known---or, practice, estimated accurately with access large set unlabeled data (test points). Our extension also applies more generally, settings satisfies certain notion exchangeability. discuss other...

10.48550/arxiv.1904.06019 preprint EN other-oa arXiv (Cornell University) 2019-01-01

Nested conformal prediction and quantile out-of-bag ensemble methods

OPENALEX - Publications

Chirag Gupta Arun Kumar Kuchibhotla Aaditya Ramdas

10.1016/j.patcog.2021.108496 article EN Pattern Recognition 2021-12-29

Classification accuracy as a proxy for two-sample testing

OPENALEX - Publications

Ilmun Kim Aaditya Ramdas Aarti Singh Larry Wasserman

When data analysts train a classifier and check if its accuracy is significantly different from chance, they are implicitly performing two-sample test. We investigate the statistical properties of this flexible approach in high-dimensional setting. prove two results that hold for all classifiers any dimensions: true error remains $\epsilon $-better than chance some >0$ as $d,n\to \infty $, then (a) permutation-based test consistent (has power approaching to one), (b) computationally...

10.1214/20-aos1962 article EN The Annals of Statistics 2021-01-29

Sequential estimation of quantiles with applications to A/B testing and best-arm identification

OPENALEX - Publications

Steven R. Howard Aaditya Ramdas

We propose confidence sequences -- of intervals which are valid uniformly over time for quantiles any distribution a complete, fully-ordered set, based on stream i.i.d. observations. give methods both tracking fixed quantile and all simultaneously. Specifically, we provide explicit expressions with small constants whose widths shrink at the fastest possible $\sqrt{t^{-1} \log\log t}$ rate, along non-asymptotic concentration inequality empirical function holds same rate. The latter...

10.3150/21-bej1388 article EN Bernoulli 2022-04-25

Fast and Flexible ADMM Algorithms for Trend Filtering

OPENALEX - Publications

Aaditya Ramdas Ryan J. Tibshirani

This paper presents a fast and robust algorithm for trend filtering, recently developed nonparametric regression tool. It has been shown that, estimating functions whose derivatives are of bounded variation, filtering achieves the minimax optimal error rate, while other popular methods like smoothing splines kernels do not. Standing in way more widespread practical adoption, however, is lack scalable numerically stable algorithms fitting estimates. highly efficient, specialized ADMM routine...

10.1080/10618600.2015.1054033 article EN Journal of Computational and Graphical Statistics 2015-06-25

Thep-filter: Multilayer False Discovery Rate Control for Grouped Hypotheses

OPENALEX - Publications

Rina Foygel Barber Aaditya Ramdas

Summary In many practical applications of multiple testing, there are natural ways to partition the hypotheses into groups by using structural, spatial or temporal relatedness hypotheses, and this prior knowledge is not used in classical Benjamini–Hochberg procedure for controlling false discovery rate (FDR). When one can define (possibly several) such partitions, it may be desirable control group FDR simultaneously all partitions (as special cases, ‘finest’ divides n hypothesis each,...

10.1111/rssb.12218 article EN Journal of the Royal Statistical Society Series B (Statistical Methodology) 2016-11-20

Rows versus Columns: Randomized Kaczmarz or Gauss--Seidel for Ridge Regression

OPENALEX - Publications

Ahmed Hefny Deanna Needell Aaditya Ramdas

The Kaczmarz and Gauss--Seidel methods aim to solve an $m \times n$ linear system $X{\beta} = {y}$ by iteratively refining the solution estimate; former uses random rows of $X$ update ${\beta}$ given corresponding equations latter columns coordinates in ${\beta}$. Recent work analyzed these algorithms a parallel comparison for overcomplete undercomplete systems, showing convergence ordinary least squares (OLS) minimum Euclidean norm solution, respectively. This paper considers natural...

10.1137/16m1077891 article EN SIAM Journal on Scientific Computing 2017-01-01

A unified treatment of multiple testing with prior knowledge using the p-filter

OPENALEX - Publications

Aaditya Ramdas Rina Foygel Barber Martin J. Wainwright Michael I. Jordan

There is a significant literature on methods for incorporating knowledge into multiple testing procedures so as to improve their power and precision. Some common forms of prior include (a) beliefs about which hypotheses are null, modeled by nonuniform weights; (b) differing importances hypotheses, penalties false discoveries; (c) arbitrary partitions the (possibly overlapping) groups (d) independence, positive or dependence between groups, suggesting use more aggressive conservative...

10.1214/18-aos1765 article EN The Annals of Statistics 2019-08-03

Coming Soon ...