Project Details
Description
A common problem in several disciplines is the missing information. When applying statistical methods to database with multivariate information, it is requires complete information, for instance, GGE models, additive main effects with multiplicative interaction models -AMMIand principal components, so, the value imputation is relevant for any area of knowledge. The literature and the associated statistical software provide several alternatives of imputation, such as the parametric regression, the propensity score method, or the Markov Chain Monte Carlo (MCMC) method (Zhang 2003, Yuan 2011). However, these methodologies require that certain assumptions are met. The assumption in all three methods is that the missing data depend on observed variables, which means that there is a missing at random mechanism (MAR), as defined by Little and Rubin (2002). Also, parametric regression andMCMCdepend on the assumption of multivariate normality. There are other missing value imputation methods that have no structural or distributional assumptions like those using the SVD, one of these methods is the Krzanowski algorithm described below. However, it is not known that a generalization of this method has been developed using regularised singular value decomposition. This project aims to propose a generalisation of the algorithm by means of different regularisation alternatives found in the literature. To achieve this objective, simulation and computational techniques will be explored for the calculation of regularisations, search of real data for a practical application and statistics to evaluate the imputation uncertainty. As an impact of this research will present new statistical methodologies that solve the problem of missing value without distributional or structural assumptions.
Layman's description
A common problem in several disciplines is the missing information. When applying statistical methods to database with multivariate information, it is requires complete information, for instance, GGE models, additive main effects with multiplicative interaction models -AMMIand principal components, so, the value imputation is relevant for any area of knowledge. The literature and the associated statistical software provide several alternatives of imputation, such as the parametric regression, the propensity score method, or the Markov Chain Monte Carlo (MCMC) method (Zhang 2003, Yuan 2011). However, these methodologies require that certain assumptions are met. The assumption in all three methods is that the missing data depend on observed variables, which means that there is a missing at random mechanism (MAR), as defined by Little and Rubin (2002). Also, parametric regression andMCMCdepend on the assumption of multivariate normality. There are other missing value imputation methods that have no structural or distributional assumptions like those using the SVD, one of these methods is the Krzanowski algorithm described below. However, it is not known that a generalization of this method has been developed using regularised singular value decomposition. This project aims to propose a generalisation of the algorithm by means of different regularisation alternatives found in the literature. To achieve this objective, simulation and computational techniques will be explored for the calculation of regularisations, search of real data for a practical application and statistics to evaluate the imputation uncertainty. As an impact of this research will present new statistical methodologies that solve the problem of missing value without distributional or structural assumptions.
Key findings
missing values, singular value decomposition, cross-validation, genotype-by-environment trials
| Status | Finished |
|---|---|
| Effective start/end date | 12/02/21 → 12/08/22 |
Collaborative partners
- Universidad de La Sabana (lead)
- Universidad Javeriana (Executor)
Project Status
- Succesfully closed
Relation Academy- enterprises
- No
Training for research
- Yes
Interdisciplinary
- Yes
Collaborative project between research groups
- Yes
Project with potential for technological development susceptible to intellectual property protection.
- No
Degree work - Master's or Ph
- None
Area of knowledge (OECD)
- MATHEMATICS, STATISTICS AND RELATED
Rol Sabana
- Co- Executor
-
Missing-Value Imputation Using The Robust Singular-Value Decomposition: Proposals And Numerical Evaluation
Duarte Vogel, D. D. (Ponente), Arciniegas Alarcon, S. (Ponente), Krzanowski, W. (Ponente) & García Peña, M. (Ponente), 30 Oct 2021.Research output: Contribution to conference › Poster › peer-review
-
Techniques for Robust Imputation in Incomplete Two-Way Tables
Arciniegas Alarcon, S. (First Author), Rengifo Gutierrez, C. (Third Author), Krzanowski, W. (Fourth Autor) & García Peña, M. (Second Author), 4 Sep 2021, In: Applied System Innovation. p. 1-12 11 p.Research output: Contribution to journal › Article › peer-review
3 Scopus citations