Arnak S. Dalalyan

ORCID: 0000-0003-0054-3227
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Statistical Methods and Inference
  • Advanced Statistical Methods and Models
  • Sparse and Compressive Sensing Techniques
  • Markov Chains and Monte Carlo Methods
  • Bayesian Methods and Mixture Models
  • Stochastic processes and financial applications
  • Machine Learning and Algorithms
  • Advanced Neuroimaging Techniques and Applications
  • Financial Risk and Volatility Modeling
  • Statistical Methods and Bayesian Inference
  • Advanced Image and Video Retrieval Techniques
  • Advanced Statistical Process Monitoring
  • Advanced Bandit Algorithms Research
  • Control Systems and Identification
  • Robotics and Sensor-Based Localization
  • Advanced MRI Techniques and Applications
  • Gaussian Processes and Bayesian Inference
  • Risk and Portfolio Optimization
  • Distributed Sensor Networks and Detection Algorithms
  • Numerical methods in inverse problems
  • Image and Signal Denoising Methods
  • Advanced Vision and Imaging
  • Stochastic processes and statistical mechanics
  • Generative Adversarial Networks and Image Synthesis
  • Medical Imaging Techniques and Applications

École Nationale de la Statistique et de l'Administration Économique
2012-2024

Centre de Recherche en Économie et Statistique
2012-2023

Centre for Research in Engineering Surface Technology
2012-2023

Center for Responsible Travel
2022-2023

Groupe des Écoles Nationales d'Économie et Statistique
2023

École Nationale d'Administration
2019-2020

Université Paris Nanterre
2019

Université Paris-Saclay
2017-2018

ParisTech
2014-2017

Paris-Est Sup
2016

Summary Sampling from various kinds of distribution is an issue paramount importance in statistics since it often the key ingredient for constructing estimators, test procedures or confidence intervals. In many situations, exact sampling a given impossible computationally expensive and, therefore, one needs to resort approximate strategies. However, there no well-developed theory providing meaningful non-asymptotic guarantees procedures, especially high dimensional problems. The paper makes...

10.1111/rssb.12183 article EN Journal of the Royal Statistical Society Series B (Statistical Methodology) 2016-04-23

10.1016/j.spa.2019.02.016 article EN publisher-specific-oa Stochastic Processes and their Applications 2019-03-05

Although the Lasso has been extensively studied, relationship between its prediction performance and correlations of covariates is not fully understood. In this paper, we give new insights into in context multiple linear regression. We show, particular, that incorporation a simple correlation measure tuning parameter can lead to nearly optimal even for highly correlated covariates. However, also reveal moderately covariates, be mediocre irrespective choice parameter. finally show our results...

10.3150/15-bej756 article EN other-oa Bernoulli 2016-09-27

In recent years, overcomplete dictionaries combined with sparse learning techniques became extremely popular in computer vision. While their usefulness is undeniable, the improvement they provide specific tasks of vision still poorly understood. The aim present work to demonstrate that for task image denoising, nearly state-of-the-art results can be achieved using orthogonal only, provided are learned directly from noisy image. To this end, we introduce three patchbased denoising algorithms...

10.5244/c.25.25 preprint EN 2011-01-01

10.1016/j.jcss.2011.12.023 article EN Journal of Computer and System Sciences 2012-01-18

Langevin diffusion processes and their discretizations are often used for sampling from a target density. The most convenient framework assessing the quality of such scheme corresponds to smooth strongly log-concave densities defined on $\mathbb{R}^{p}$. present work focuses this studies behavior Monte Carlo algorithm based kinetic diffusion. We first prove geometric mixing property with rate that is optimal in terms its dependence condition number. then use result obtaining improved...

10.3150/19-bej1178 article EN Bernoulli 2020-04-27

We address the issue of variable selection in regression model with very high ambient dimension, that is, when number variables is large. The main focus on situation where relevant variables, called intrinsic much smaller than dimension $d$. Without assuming any parametric form underlying function, we get tight conditions making it possible to consistently estimate set variables. These relate and sample size. procedure provably consistent under these based comparing quadratic functionals...

10.1214/12-aos1046 article EN other-oa The Annals of Statistics 2012-10-01

In this paper, we revisit the recently established theoretical guarantees for convergence of Langevin Monte Carlo algorithm sampling from a smooth and (strongly) log-concave density. We improve existing results when is measured in Wasserstein distance provide further insights on very tight relations between, one hand, and, other gradient descent optimization. Finally, also establish version that based noisy evaluations gradient.

10.48550/arxiv.1704.04752 preprint EN other-oa arXiv (Cornell University) 2017-01-01

We consider the problem of combining a (possibly uncountably infinite) set affine estimators in nonparametric regression model with heteroscedastic Gaussian noise. Focusing on exponentially weighted aggregate, we prove PAC-Bayesian type inequality that leads to sharp oracle inequalities discrete but also continuous settings. The framework is general enough cover combinations various procedures such as least square regression, kernel ridge shrinking and many other used literature statistical...

10.1214/12-aos1038 article EN other-oa The Annals of Statistics 2012-08-01

We consider the problem of aggregating elements a possibly infinite dictionary for building decision procedure that aims at minimizing given criterion. Along with dictionary, an independent identically distributed training sample is available, on which performance can be tested. In fairly general set-up, we establish oracle inequality Mirror Averaging aggregate any prior distribution. By choosing appropriate prior, apply this in context prediction under sparsity assumption problems...

10.3150/11-bej361 article EN other-oa Bernoulli 2012-06-28

The statistical problem of estimating the effective dimension-reduction (EDR) subspace in multi-index regression model with deterministic design and additive noise is considered. A new procedure for recovering directions EDR proposed. Under mild assumptions, $\sqrt n$-consistency proposed proved (up to a logarithmic factor) case when structural dimension not larger than 4. empirical behavior algorithm studied through numerical simulations.

10.48550/arxiv.math/0701887 preprint EN other-oa arXiv (Cornell University) 2007-01-01

We consider the problem of testing a particular type composite null hypothesis under nonparametric multivariate regression model. For given quadratic functional $Q$, states that function $f$ satisfies constraint $Q[f]=0$, while alternative corresponds to functions for which $Q[f]$ is bounded away from zero. On one hand, we provide minimax rates and exact separation constants, along with sharp-optimal procedure, diagonal nonnegative functionals. smoothness classes ellipsoidal form check our...

10.1214/13-ejs766 article EN cc-by Electronic Journal of Statistics 2013-01-01

10.1007/s00440-004-0416-1 article EN Probability Theory and Related Fields 2005-05-03

In this paper, we consider the problem of estimating covariation two diffusion processes when observations are subject to non-synchronicity. Building on recent papers \cite{Hay-Yos03, Hay-Yos04}, derive second-order asymptotic expansions for distribution Hayashi-Yoshida estimator in a fairly general setup including random sampling schemes and non-anticipative drifts. The key steps leading our results decomposition estimator's Gaussian set-up, stochastic itself an accurate evaluation...

10.1214/10-aihp383 article EN Annales de l Institut Henri Poincaré Probabilités et Statistiques 2011-06-23

In this paper we revisit the risk bounds of lasso estimator in context transductive and semi-supervised learning. other terms, setting under consideration is that regression with random design partial labeling. The main goal to obtain user-friendly on off-sample prediction risk. To end, simple bounded response variable (high-dimensional) covariates considered. We propose some new adaptations these settings establish oracle inequalities both expectation deviation. These results provide...

10.1214/18-ejs1457 article EN cc-by Electronic Journal of Statistics 2018-01-01

The problem of matching two sets features appears in various tasks computer vision and can be often formalized as a permutation estimation. We address this from statistical point view provide theoretical analysis the accuracy several natural estimators. To end, minimax rate separation is investigated its expression obtained function sample size, noise level dimension. consider cases homoscedastic heteroscedastic establish, each case, tight upper bounds on distance These are shown to...

10.48550/arxiv.1310.4661 preprint EN other-oa arXiv (Cornell University) 2013-01-01
Coming Soon ...