NFDI4DS | UHH-SEMS - Publication Details

Antonio Punzo

ORCID: 0000-0001-7742-1821

Publications

Citations

Views

---

Saved

---

About

Contact & Profiles

A5009939605

Research Areas

Bayesian Methods and Mixture Models
Statistical Methods and Bayesian Inference
Statistical Methods and Inference
Advanced Statistical Methods and Models
Statistical Distribution Estimation and Applications
Financial Risk and Volatility Modeling
Advanced Clustering Algorithms Research
Complex Systems and Time Series Analysis
Advanced Statistical Modeling Techniques
Insurance, Mortality, Demography, Risk Management
Psychometric Methodologies and Testing
Sensory Analysis and Statistical Methods
Neural Networks and Applications
Data Management and Algorithms
Fault Detection and Control Systems
Urban, Neighborhood, and Segregation Studies
Integrated Energy Systems Optimization
Renewable Energy and Sustainability
Forecasting Techniques and Applications
Housing Market and Economics
Market Dynamics and Volatility
Insurance and Financial Risk Management
Statistical Methods in Clinical Trials
VLSI and Analog Circuit Testing
Algorithms and Data Compression

University of Catania
2016-2025

Heidelberg (Poland)
2018-2022

University of Hohenheim
2017-2022

University of Bologna
2020-2021

University of Milano-Bicocca
2018-2019

McMaster University
2018

Albany Research Institute
2018

University of Southampton
2015

Libera Università Maria SS. Assunta
2015

Statistical Research (United States)
2015

Parsimonious mixtures of multivariate contaminated normal distributions

OPENALEX - Publications

Antonio Punzo Paul D. McNicholas

A mixture of multivariate contaminated normal distributions is developed for model-based clustering. In addition to the parameters classical mixture, our has, each cluster, a parameter controlling proportion mild outliers and one specifying degree contamination. Crucially, these do not have be specified priori, adding flexibility approach. Parsimony introduced via eigen-decomposition component covariance matrices, sufficient conditions identifiability all members resulting family are...

10.1002/bimj.201500144 article EN Biometrical Journal 2016-08-11

Model-based clustering via linear cluster-weighted models

OPENALEX - Publications

Salvatore Ingrassia Simona C. Minotti Antonio Punzo

10.1016/j.csda.2013.02.012 article EN Computational Statistics & Data Analysis 2013-02-19

Robust Clustering in Regression Analysis via the Contaminated Gaussian Cluster-Weighted Model

OPENALEX - Publications

Antonio Punzo Paul D. McNicholas

10.1007/s00357-017-9234-x article EN Journal of Classification 2017-06-20

The Generalized Linear Mixed Cluster-Weighted Model

OPENALEX - Publications

Salvatore Ingrassia Antonio Punzo Giorgio Vittadini Simona C. Minotti

10.1007/s00357-015-9175-1 article EN Journal of Classification 2015-04-01

Compound unimodal distributions for insurance losses

OPENALEX - Publications

Antonio Punzo Luca Bagnato Antonello Maruotti

10.1016/j.insmatheco.2017.10.007 article EN Insurance Mathematics and Economics 2017-11-01

Multivariate Response and Parsimony for Gaussian Cluster-Weighted Models

OPENALEX - Publications

Utkarsh J. Dang Antonio Punzo Paul D. McNicholas Salvatore Ingrassia Ryan P. Browne

10.1007/s00357-017-9221-2 article EN Journal of Classification 2017-03-15

Finite mixtures of unimodal beta and gamma densities and the $$k$$ -bumps algorithm

OPENALEX - Publications

Luca Bagnato Antonio Punzo

10.1007/s00180-012-0367-4 article EN Computational Statistics 2012-10-06

A new look at the inverse Gaussian distribution with applications to insurance and economic data

OPENALEX - Publications

Antonio Punzo

Insurance and economic data are often positive, we need to take into account this peculiarity in choosing a statistical model for their distribution. An example is the inverse Gaussian (IG), which one of most famous considered distributions with positive support. With aim increasing use IG distribution on insurance data, propose convenient mode-based parameterization yielding reparametrized (rIG) distribution; it allows/simplifies various branches statistics, give some examples. In...

10.1080/02664763.2018.1542668 article EN Journal of Applied Statistics 2018-11-03

Cluster-weighted $$t$$ t -factor analyzers for robust model-based clustering and dimension reduction

OPENALEX - Publications

Sanjeena Subedi Antonio Punzo Salvatore Ingrassia Paul D. McNicholas

10.1007/s10260-015-0298-7 article EN Statistical Methods & Applications 2015-02-28

The multivariate leptokurtic‐normal distribution and its application in model‐based clustering

OPENALEX - Publications

Luca Bagnato Antonio Punzo Maria Grazia Zoia

Abstract This article proposes the elliptical multivariate leptokurtic‐normal (MLN) distribution to fit data with excess kurtosis. The MLN is a Gram–Charlier expansion of normal (MN) and has closed‐form representation characterized by one additional parameter denoting It obtained from MN distribution, reshaping its generating variate associated orthogonal polynomials. strength this approach for obtaining lies in general applicability as it can be applied any law get suitable data. Maximum...

10.1002/cjs.11308 article EN Canadian Journal of Statistics 2016-11-23

Fitting insurance and economic data with outliers: a flexible approach based on finite mixtures of contaminated gamma distributions

OPENALEX - Publications

Antonio Punzo Angelo Mazza Antonello Maruotti

Insurance and economic data are frequently characterized by positivity, skewness, leptokurtosis, multi-modality; although many parametric models have been used in the literature, often these peculiarities call for more flexible approaches. Here, we propose a finite mixture of contaminated gamma distributions that provides better characterization data. It is placed between non-parametric density estimation strikes balance alternatives, as large class densities can be implemented. We adopt...

10.1080/02664763.2018.1428288 article EN Journal of Applied Statistics 2018-01-28

flexCWM: A Flexible Framework for Cluster-Weighted Models

OPENALEX - Publications

Angelo Mazza Antonio Punzo Salvatore Ingrassia

Cluster-weighted models (CWMs) are mixtures of regression with random covariates. However, besides having recently become rather popular in statistics and data mining, there is still a lack support for CWMs within the most statistical suites. In this paper, we introduce flexCWM, an R package specifically conceived fitting CWMs. The supports modeling conditioned response variable by means common distributions exponential family t distribution. Covariates allowed to be mixed-type parsimonious...

10.18637/jss.v086.i02 article EN cc-by Journal of Statistical Software 2018-01-01

On the Number of Components for Matrix‐Variate Mixtures: A Comparison Among Information Criteria

OPENALEX - Publications

Salvatore D. Tomarchio Antonio Punzo

Summary This study explores the crucial task of determining optimal number components in mixture models, known as order, when considering matrix‐variate data. Despite growing interest this data type among practitioners and researchers, effectiveness information criteria selecting order remains largely unexplored branch literature. Although Bayesian criterion (BIC) is commonly utilised, its only marginally tested context, several other potentially valuable exist. An extensive simulation...

10.1111/insr.12607 article EN cc-by International Statistical Review 2025-01-08

A contaminated regression model for count health data

OPENALEX - Publications

A. F. Otto Johan Ferreira Salvatore D. Tomarchio Andriëtte Bekker Antonio Punzo

In medical and health research, investigators are often interested in countable quantities such as hospital length of stay (e.g., days) or the number doctor visits. Poisson regression is commonly used to model count data, but this approach can’t accommodate overdispersion—when variance exceeds mean. To address issue, negative binomial (NB) distribution (NB-D) and, by extension, NB provide a well-documented alternative. However, real-data applications present additional challenges that must...

10.1177/09622802241307613 article EN Statistical Methods in Medical Research 2025-01-19

An EM algorithm for fitting matrix-variate normal distributions on interval-censored and missing data

OPENALEX - Publications

Víctor H. Lachos Salvatore D. Tomarchio Antonio Punzo Salvatore Ingrassia

10.1007/s11222-025-10575-0 article EN cc-by Statistics and Computing 2025-01-28

Finite mixtures of multivariate skew tail-inflated normal distributions

OPENALEX - Publications

Salvatore D. Tomarchio Luca Bagnato Antonio Punzo

10.1080/00949655.2025.2479063 article EN Journal of Statistical Computation and Simulation 2025-03-13

Clustering and classification via cluster-weighted factor analyzers

OPENALEX - Publications

Sanjeena Subedi Antonio Punzo Salvatore Ingrassia Paul D. McNicholas

10.1007/s11634-013-0124-8 article EN Advances in Data Analysis and Classification 2013-03-01

Flexible mixture modelling with the polynomial Gaussian cluster-weighted model

OPENALEX - Publications

Antonio Punzo

In the context of mixture models with random covariates, this article presents polynomial Gaussian cluster-weighted model (CWM). It extends linear CWM, for bivariate data, in a twofold way. First, it allows possible nonlinear dependencies components by considering regression. Second, is not restricted to be used model-based clustering only being contextualized most general classification framework. Maximum likelihood parameter estimates are derived using EM algorithm and selection carried...

10.1177/1471082x13503455 article EN Statistical Modelling 2014-05-14

Clustering Multivariate Longitudinal Observations: The Contaminated Gaussian Hidden Markov Model

OPENALEX - Publications

Antonio Punzo Antonello Maruotti

The Gaussian hidden Markov model (HMM) is widely considered for the analysis of heterogenous continuous multivariate longitudinal data. To robustify this approach with respect to possible elliptical heavy-tailed departures from normality, due presence outliers, spurious points, or noise (collectively referred as bad points herein), contaminated HMM here introduced. distribution represents an generalization and allows automatic detection in same natural way observations are typically assigned...

10.1080/10618600.2015.1089776 article EN Journal of Computational and Graphical Statistics 2015-09-30

KernSmoothIRT: AnRPackage for Kernel Smoothing in Item Response Theory

OPENALEX - Publications

Angelo Mazza Antonio Punzo Brian E. McGuire

Item response theory (IRT) models are a class of statistical used to describe the behaviors individuals set items having certain number options. They adopted by researchers in social science, particularly analysis performance or attitudinal data, psychology, education, medicine, marketing and other fields where aim is measure latent constructs. Most IRT analyses use parametric that rely on assumptions often not satisfied. In such cases, nonparametric approach might be preferable;...

10.18637/jss.v058.i06 article EN cc-by Journal of Statistical Software 2014-01-01

ContaminatedMixt: An R Package for Fitting Parsimonious Mixtures of Multivariate Contaminated Normal Distributions

OPENALEX - Publications

Antonio Punzo Angelo Mazza Paul D. McNicholas

We introduce the R package ContaminatedMixt, conceived to disseminate use of mixtures multivariate contaminated normal distributions as a tool for robust clustering and classification under common assumption elliptically contoured groups. Thirteen variants model are also implemented parsimony. The expectationconditional maximization algorithm is adopted obtain maximum likelihood parameter estimates, likelihood-based selection criteria used select number Parallel computation can be on...

10.18637/jss.v085.i10 article EN cc-by Journal of Statistical Software 2018-01-01

Dichotomous unimodal compound models: application to the distribution of insurance losses

OPENALEX - Publications

Salvatore D. Tomarchio Antonio Punzo

A correct modelization of the insurance losses distribution is crucial in industry. This generally highly positively skewed, unimodal hump-shaped, and with a heavy right tail. Compound models are profitable way to accommodate situations which some probability masses shifted tails distribution. Therefore, this work, general approach compound hump-shaped distributions mixing dichotomous introduced. 2-parameter distribution, defined on positive support, considered reparametrized respect mode...

10.1080/02664763.2020.1789076 article EN Journal of Applied Statistics 2020-07-07

Mixtures of multivariate contaminated normal regression models

OPENALEX - Publications

Angelo Mazza Antonio Punzo

10.1007/s00362-017-0964-y article EN Statistical Papers 2017-11-13

Model-based time-varying clustering of multivariate longitudinal data with covariates and outliers

OPENALEX - Publications

Antonello Maruotti Antonio Punzo

10.1016/j.csda.2016.05.024 article EN Computational Statistics & Data Analysis 2016-06-07

Coming Soon ...