A Bootstrap Procedure in the Context of Correspondence Analysis: Numerical Approach to Measure the Stability of Axes

VALENCIA GARCÍA, OLGA; ÁLVAREZ ESTEBAN, RAMÓN

doi:10.25115/EEA.V39I9.6996

A Bootstrap Procedure in the Context of Correspondence AnalysisNumerical Approach to Measure the Stability of Axes

VALENCIA GARCÍA, OLGA ¹
ÁLVAREZ ESTEBAN, RAMÓN

1 Departamento de Economía Aplicada, Área de Métodos Cuantitativos para la Economía y la Empresa, UNIVERSIDAD DE BURGOS, ESPAÑA.

Revista:

Estudios de economía aplicada

ISSN: 1133-3197, 1697-5731

Año de publicación: 2012

Título del ejemplar: Mercado de trabajo y crisis económica

Volumen: 30

Número: 1

Páginas: 357

Tipo: Artículo

DOI: 10.25115/EEA.V39I9.6996 DIALNET GOOGLE SCHOLAR Dialnet editor

Otras publicaciones en: Estudios de economía aplicada

Resumen

Los procedimientos Bootstrap se utilizan habitualmente para evaluar la estabilidad de los métodos basados en ejes principales. En nuestra investigación, la atención se centra en el caso particular del Análisis de Correspondencias. Dado que en el Análisis de Correspondencias clásico las métricas están determinadas por las frecuencias marginales de las tablas y, por tanto, variarían a través de las tablas replicadas, sugerimos aquí un procedimiento Bootstrap espe-cífico en el que se lleva a cabo un Análisis de Correspondencias generalizado de las tablas replicadas imponiendo como métricas las procedentes de la tabla original. La contribución de nuestro trabajo es proporcionar un procedi-miento numérico para cuantificar la estabilidad de los ejes principales que es adecuado para este contexto. Propone-mos una medida acotada de la estabilidad de los ejes. Además, teniendo en cuenta que cualquier umbral fijo consi-derado para etiquetar un eje como estable o inestable sería arbitrario, nuestro procedimiento se basa en la compara-ción de datos reales y datos permutados aleatoriamente para determinar umbrales de estabilidad. Se presentan los resultados computacionales obtenidos en varios conjuntos de datos.

Referencias bibliográficas

ABASCAL, E. y GRANDE, I. (2005). Análisis de encuestas. Madrid: ESIC. ABDI, H. (2007). “RV coefficient and congruence coefficient”. En Salkind N.J. (ed.): Encyclopedia of Measurement and Statistics (pp. 849-853). Thousand Oaks (CA): Sage.
AL-IBRAHIM, A.H. y AL-KANDARI, N.M. (2008). “Stability of principal components”. Computational Statistics, 23 (1), pp.153-171.
ÁLVAREZ, R.; VALENCIA, O. y BÉCUE-BERTAUT, M. (2010). “Assessing the Stability of Supplementary Elements on Principal Axes Maps Through Bootstrap Resampling. Contribution to Interpretation in Textual Analysis”. En Skiadas,C.H. (ed.): Advances in Data Analysis. Statistics for Industry and Technology (pp. 3-11). Boston: Springer/ Birkhäuser.
BENASSENI, J. (1993). “Perturbational aspects in correspondance analysis”. Computional Statistics & Data Analysis, 15, pp. 393-410.
BENZECRI, J.P. (1973). L’analyse des données. Tome 2: L´Analyse des Correspondances. Paris: Dunod.
BERAN, R.B. y SRIVASTAVA, M.S. (1985). “Bootstrap tests and confidence regions for functions of a covariance matrix”. Annals of Statistics, 13, pp. 95– 115.
CHATEAU F. y LEBART, L. (1996). “Assessing sample variability in the visualization techniques related to principal component analysis: bootstrap and alternative simulation methods”. En Prats, A. (ed.): XII Symposium on Computational Statistics COMPSTAT96 (pp. 205-210). Heidelberg; Physica Verlag.
DAUDIN, J.; BUBY, C. y TRECOURT, P. (1988). “Stability of principal component analysis studied by the bootstrap method”. Statistics, 19 (2), pp. 241- 258.
DEL HOYO, J.; LLORENTE, G. y RIVERO, C. (2011). “Consumo de electricidad y producto interior bruto. Relación dinámica y estabilidad”. Estudios de Economía Aplicada, 29(2), pp. 473-492.
DIACONIS, P. y EFRON, B. (1983). “Computer-intensive methods in statistics”. Scientific American, 248, pp. 96-108.
EFRON, B. (1979). “Bootstrap methods: Another look at the Jackknife”. Annals of Statistics, 7, pp. 1-26.
EFRON, B. y TIBSHIRANI, R. (1993). An introduction to the bootstrap. New York: Chapman & Hall.
ESCOFIER, B. (1984). Analyse factorielle en référence à un modèle. Application à l'analyse de tableaux d'échange. En RR-0337, INRIA00076220, Rennes, 1984.
ESCOFIER, B. (1987). Notice d'utilisation du programme d'analyse factorielle en référence à un modèle. En RR-0616, INRIA 00075938, Rennes, 1987.
ESCOFIER, B. (2003). Analyse des correspondances: Recherches au cœur de l'analyse des données. Rennes: Presses universitaires de Rennes.
ESCOFIER, B. y LE ROUX, B. (1972). “Etude de trois problèmes de stabilité en analyse factorielle”. En Publications de l'Institut de Statistique de Paris, 21 Fasc. (3-4) (parution 1973).
ESCOFIER, B. y PAGÈS, J. (1983). “Méthode pour l’analyse de plusieurs groupes de variables. Application à la caractérisation de vins rouges du Val de Loire”. Révue de Statistique Appliquée, 31(2), pp. 4-59.
ESCOFIER, B. y PAGÈS, J. (1992). Análisis factoriales simples y múltiples. Objetivos, métodos e interpretación. Bilbao: Servicio Editorial Universidad del País Vasco.
ESCOFIER, B. y PAGÈS, J. (1998). Analyse factorielles simples et multiples Paris: Dunod.
FISHER, R. A. (1940). “The precision of discriminant functions” en Annals of Eugenics, 10, pp.422-429. Permanent link to this item: http://hdl.handle.net/2440/15240
GIFI, A. (1981). Nonlinear multivariate analysis. Chichester: Wiley.
GILULA, Z. y HABERMAN, S.J. (1986). “Canonical analysis of contingency tables by maximum likelihood”. Journal of the American statistical association, 81, pp. 780-788.
GOWER, J.C. (1971). “Statistical methods for comparing different multivariate analyses of the same data”. En Hodson, J.R.; Kendall, D.G. y Tautu, P. (eds.): Mathematics in the Archaeological and Historical Sciences (pp. 138-149). Edinburgh: University Press.
GOWER, J.C. (1975). “Procrustes Analysis”. Psichometrika, 40(1), pp. 33-51.
GOWER J.C. y DIKSTERHUIS, G.B. (2004). Procrustes problems. Oxford, New York: Oxford University Press.
GREENACRE, M. (1984). Theory and applications of correspondence analysis. London: Academic Press.
HOLMES, S. (2008). “Multivariate data analysis: The French way”. En Nolan, D. y Speed, T. (eds.): Probability and Statistics: Essays in Honor of David A. Freedman (pp.219-233). Beachwood, Ohio, USA: Institute of Mathematical Statistics.
JACKSON, D.A. (1993). “Stopping rules in principal component analysis. A comparison of heuristical and statistical approaches”. Ecology, 74 (8), pp. 2204-2214.
KRZANOWSKI, W.J. (1999). Principles of multivariate analysis. A user’s perspective. Belfast: Oxford Statistical Science Series.
LAMBERT, Z.V.; WILDT, A.R. y DURAND, R.M. (1990). “Assessing sampling variation relative to number-factors criteria”. Educational and Psychological Measurement, 50, pp. 33-48.
LEBART, L. (1976). “The significance of eigenvalues issued form correspondence analysis”. Proceedings in Computational Statistics. COMPSTAT (pp. 38-45). Vienna: Physica Verlag.
LEBART, L. (2004a). “Validité des visualisations de données textuelles. Le poids des mots”. En Purnelle, G.; Fairon, C. y Dister, A. (eds.): Proceedings of the 7th International Conference on Textual Data Statistical Analysis, JADT 2004 (pp. 708-715). Belgique: Presses Universitaires de Louvain.
LEBART, L. (2004b). “Validation techniques in text mining (with applications to the processing of open-ended questions)”. En Sirmakessis. S. (ed): Text mining and its applications (pp. 169-178). Berlin, Heidelberg: Springer.
LEBART, L. (2006). “Validation techniques in multiple correspondence analysis”. En Greenacre, M.J. y Blasius, J. (eds.): Multiple Correspondence Analysis and Related Methods (pp.179-195). London: Chapman and Hall.
LEBART, L.; MORINEAU, A. y PIRON, M. (2000). Statistique exploratoire multidimensionnelle (3ª ed.). Paris: Dunod.
LEBART, L.; MORINEAU, A. y WARWICK, K. (1984). Multivariate descriptive statistical analysis. New York: Wiley.
LINTING, M.; MEULMAN, J.J.; GROENEN, P.J.F. y VAN DER KOOJJ, A.J. (2007). “Stability of Nonlinear Principal Components Analysis: An empirical study using the balanced bootstrap”. Psychological Methods, 12(3), pp. 359-379.
MARKUS, M. (1994). Bootstrap confidence regions in nonlinear multivariate analysis. Leiden: DSWO Press.
MIGUEL, J.A. y OLAVE, P. (2002). “Avances recientes en métodos Bootstrap para procesos ARCH. Una aplicación en el mercado español de valores”. Estudios de Economía Aplicada, 20(2), pp. 487-498.
MILAN, L. y WHITTAKER, J. (1995). “Application of the parametric bootstrap to models that incorporate a singular value decomposition”. Applied Statistics, 44(1), pp. 31-49.
O’NEILL, M.E. (1978). “Asymptotic distributions of the canonical correlations from contingency tables”. Australian Journal of Statistics, 20(1), pp. 75-82.
PERES-NETO, P.R.; JACKSON, D, A. y SOMERS, K.M. (2005). “How many principal components? Stopping rules for determining the number of non- trivial axes revisited”. Computational Statistics and Data Analysis, 49(4), pp. 974-997.
REICZIGEL, J. (1996). “Bootstrap tests in correspondence analysis”. Applied Stochastic Models and Data Analysis, 12, pp. 107-117.
RINGROSE, T.J. (1992). “Bootstrapping and correspondence analysis in Archaeology”. Journal of Archaeological Science, 19, pp. 615-629.
RODRIGUEZ, O.M. (2005). “El crédito commercial en las pymes canarias desde una perspectiva multivariante”. Estudios de Economía Aplicada, 23 (3), pp. 773-816.
SCHÖNEMANN, P.H. y CARROL, R.M. (1970). “Fitting one matrix to another under choice of a central dilation and a rigid motion”. Psychometrika, 35(2), pp. 245-255.
STAUFFER, D.F.; GARTON, E.O. y STEINHORST, R.K. (1985). “A comparison of principal components from real and random data”. Ecology, 66(6), pp. 1693-1698.
TAN, Q.; BRUSGAARD, K.; KRUSE, T.A.; OAKELEY, E.; HEMMINGS, B.; BECK-NIELSEN, H.; HANSEN, L. y GASTER. M. (2004) “Correspondence analysis of microarray time-course data in case-control design”. Journal of Biomedical Informatics, 37, pp. 358-365.
TEN BERGE, J.M.F. (2006). “The rigid orthogonal procrustes rotation problem”. Psychometrika, 71(1), pp. 201-205.
TEN BERGE, J.M.F. y BEKKER, P.A. (1993). “The isotropic scaling problem in Procrustes Analys”. Computational. Statististics & Data Analysis, 16, pp. 201-204.
TENENHAUS, M. y YOUNG, F.W. (1985). “An analysis and synthesis of multiple correspondence analysis, optimal scaling, dual scaling, homogeneity analysis and other methods for quantifying categorical multivariate data”. Psychometrika, 50(1), pp. 91-119.
TOCHER, F. (1908). “Pigmentation survey of school children in Scotland”. Biometrika, 6, pp. 30-235.
VALENCIA, O. (2006). Estudio de la estabilidad de los métodos factoriales mediante procedimientos de remuestreo: Aplicación al Análisis de Correspondencias de tablas léxicas. León: Universidad de León, Secretariado de Publicaciones.

Fuente de los datos: Dialnet