Matías Salibián‐Barrera

ORCID: 0000-0003-1873-4611
Publications
Citations
Views
---
Saved
---
About
Contact & Profiles
Research Areas
  • Advanced Statistical Methods and Models
  • Statistical Methods and Inference
  • Advanced Statistical Process Monitoring
  • Fault Detection and Control Systems
  • Bayesian Methods and Mixture Models
  • Spectroscopy and Chemometric Analyses
  • Control Systems and Identification
  • Statistical Distribution Estimation and Applications
  • Advanced Clustering Algorithms Research
  • Fuzzy Systems and Optimization
  • Neural Networks and Applications
  • Face and Expression Recognition
  • Statistical Methods and Bayesian Inference
  • Geochemistry and Geologic Mapping
  • Optimal Experimental Design Methods
  • Probabilistic and Robust Engineering Design
  • Radiative Heat Transfer Studies
  • Rough Sets and Fuzzy Logic
  • Gene expression and cancer classification
  • Transportation Systems and Logistics
  • Sparse and Compressive Sensing Techniques
  • Radar Systems and Signal Processing
  • Genetic and Environmental Crop Studies
  • Advanced Image Processing Techniques
  • Privacy-Preserving Technologies in Data

University of British Columbia
2011-2023

Statistics Canada
2018-2023

McGill University
2022-2023

University Medical Center Freiburg
2022

Universidad Torcuato Di Tella
2019

Natural Sciences and Engineering Research Council
2006-2011

Simon Fraser University
2011

Decision Sciences (United States)
2010

University of Buenos Aires
2006-2008

Ghent University
2006-2008

AbstractEquivariant high-breakdown point regression estimates are computationally expensive, and the corresponding algorithms become unfeasible for moderately large number of regressors. One important advance to improve computational speed one such estimator is fast-LTS algorithm. This article proposes an analogous algorithm computing S-estimates. The new algorithm, that we call "fast-S", also based on a "local improvement" step resampling initial candidates. allows substantial reduction...

10.1198/106186006x113629 article EN Journal of Computational and Graphical Statistics 2006-05-15

We introduce a new computer-intensive method to estimate the distribution of robust regression estimates. The basic idea behind our is bootstrap reweighted representation To obtain that asymptotically correct, we include auxiliary scale in Our computationally simple because for each sample only have solve linear system equations. weights use are decreasing functions absolute value residuals and hence outlying observations receive small weights. This results resistant presence outliers data....

10.1214/aos/1021379865 article EN The Annals of Statistics 2002-04-01

We consider robust principal components analysis (PCA) based on multivariate MM estimators. first study the robustness and efficiency of these estimators, particularly in terms eigenvalues eigenvectors. then focus inference procedures a fast bootstrap for This method is an alternative to approach asymptotic distribution estimators can also be used assess stability components. A formal consistency proof given, its finite-sample performance investigated through simulations. illustrate use PCA...

10.1198/016214506000000096 article EN Journal of the American Statistical Association 2006-09-01

10.1007/s10260-007-0048-6 article EN Statistical Methods & Applications 2007-02-22

Witten and Tibshirani (2010) proposed an algorithim to simultaneously find clusters select clustering variables, called sparse K-means (SK-means). SK-means is particularly useful when the dataset has a large fraction of noise variables (that is, without information separate clusters). works very well on clean complete data but cannot handle outliers nor missing data. To remedy these problems we introduce new robust algorithm implemented in R package RSKC. We demonstrate use our four...

10.18637/jss.v072.i05 article EN cc-by Journal of Statistical Software 2016-01-01

Abstract We study the empirical size and power of some recently proposed tests for multivariate normality (MVN) compare them with existing proposals that performed best in previously published studies. show Royston's [Royston, J.P., 1983b, Some techniques assessing based on Shapiro-Wilk W. Applied Statistics, 32, 121–133.] extension to Shapiro Wilk [Shapiro, S.S., Wilk, M.B., 1965, An analysis variance test (complete samples). Biometrika, 52, 591–611.] is unable achieve nominal significance...

10.1080/10629360600878449 article EN Journal of Statistical Computation and Simulation 2007-11-22

Principal component analysis is a widely used technique that provides an optimal lower-dimensional approximation to multivariate or functional datasets. These approximations can be very useful in identifying potential outliers among high-dimensional observations. In this article, we propose new class of estimators for principal components based on robust scale estimators. For fixed dimension q, robustly estimate the q-dimensional linear space best prediction data, sense minimizing sum...

10.1080/01621459.2014.946991 article EN Journal of the American Statistical Association 2014-08-05

10.1016/j.csda.2008.05.007 article EN Computational Statistics & Data Analysis 2008-05-21

Yohai and Zamar's τ-estimators of regression have excellent statistical properties but are nevertheless rarely used in practice because a lack available software the general impression that difficult to approximate. We will show, however, computational difficulties approximating similar nature those more popular S-estimators. The main goal this article is compare an algorithm for based on random resampling with some alternative heuristic search algorithms. show former not only simpler, when...

10.1198/106186008x343785 article EN Journal of Computational and Graphical Statistics 2008-09-01

Abstract We are interested in a class of unsupervised methods to detect possible disease outbreaks, that is, rapid increases the number cases particular deviate from pattern observed past. The motivating application for this article deals with detecting outbreaks using generalized additive models (GAMs) model weekly counts certain infectious diseases. can use distance between predicted and specific week determine whether an important departure has occurred. Unfortunately, approach may not...

10.1198/jasa.2011.tm09654 article EN Journal of the American Statistical Association 2011-06-01

In large-scale quantitative proteomic studies, scientists measure the abundance of thousands proteins from human proteome in search novel biomarkers for a given disease. Penalized regression estimators can be used to identify potential among large set molecular features measured. Yet, performance and statistical properties these depend on loss penalty functions define them. Motivated by real plasma study, we propose new class penalized robust based elastic net penalty, which tuned keep...

10.1214/19-aoas1269 article EN The Annals of Applied Statistics 2019-11-28

10.1016/j.csda.2020.107065 article EN Computational Statistics & Data Analysis 2020-08-18

This paper is about S-estimation for penalized regression splines.Penalized splines are one of the currently most used methods smoothing noisy data.The estimation method fitting such a spline model mostly based on least squares methods, which known to be sensitive outlying observations.In real world applications, outliers quite commonly observed.There several robust taking observations into account.We define and study S-estimators models.Hereby we replace by suitable method.By keeping...

10.1198/jcgs.2010.08149 article EN Journal of Computational and Graphical Statistics 2010-01-01

Summary Preferential sampling in geostatistics occurs when the locations at which observations are made may depend on spatial process that underlines correlation structure of measurements. We show previously proposed Monte Carlo estimates for likelihood function not be approximating desired function. Furthermore, we argue that, preferential moderate complexity, alternative and widely available numerical methods to approximate produce better results than methods. illustrate our findings...

10.1111/rssc.12286 article EN Journal of the Royal Statistical Society Series C (Applied Statistics) 2018-04-27

10.1016/j.spl.2006.01.008 article EN Statistics & Probability Letters 2006-02-21

In many situations where the interest lies in identifying clusters one might expect that not all available variables carry information about these groups. Furthermore, data quality (e.g. outliers or missing entries) present a serious and sometimes hard-to-assess problem for large complex datasets. this paper we show small proportion of atypical observations have adverse effects on solutions found by sparse clustering algorithm Witten Tibshirani (2010). We propose robustification their...

10.48550/arxiv.1201.6082 preprint EN other-oa arXiv (Cornell University) 2012-01-01

10.1016/j.aeue.2016.07.001 article EN AEU - International Journal of Electronics and Communications 2016-07-14
Coming Soon ...