Bicocca Open Archive

When analyzing data in contingency tables it is frequent to deal with sparse data, particularly when the sample size is small relative to the number of cells. Most analyses of this kind are interpreted in an exploratory manner and even if tests are performed, little attention is paid to statistical power. This paper proposes a method we call redundant procedure, which is based on the union–intersection principle and increases test power by focusing on specific components of the hypothesis. This method is particularly helpful when the hypothesis to be tested can be expressed as the intersections of simpler models, such that at least some of them pertain to smaller table marginals. This situation leads to working on tables that are naturally denser. One advantage of this method is its direct application to (chain) graphical models. We illustrate the proposal through simulations and suggest strategies to increase the power of tests in sparse tables. Finally, we demonstrate an application to the EU-SILC dataset.

Nicolussi, F., Cazzaro, M., Rudas, T. (2024). Improving the power of hypothesis tests in sparse contingency tables. STATISTICAL PAPERS, 65(3), 1841-1867 [10.1007/s00362-023-01473-6].

Improving the power of hypothesis tests in sparse contingency tables

Nicolussi F.;Cazzaro M.;Rudas T.

2024

Abstract

When analyzing data in contingency tables it is frequent to deal with sparse data, particularly when the sample size is small relative to the number of cells. Most analyses of this kind are interpreted in an exploratory manner and even if tests are performed, little attention is paid to statistical power. This paper proposes a method we call redundant procedure, which is based on the union–intersection principle and increases test power by focusing on specific components of the hypothesis. This method is particularly helpful when the hypothesis to be tested can be expressed as the intersections of simpler models, such that at least some of them pertain to smaller table marginals. This situation leads to working on tables that are naturally denser. One advantage of this method is its direct application to (chain) graphical models. We illustrate the proposal through simulations and suggest strategies to increase the power of tests in sparse tables. Finally, we demonstrate an application to the EU-SILC dataset.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Categorical variables; Graphical model; MC simulation; Redundant test; Union intersection principle;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				3-ago-2023
			
	Data di pubblicazione
	
				2024
			
	Rivista
	
				STATISTICAL PAPERS
			
	Numero del volume
	
				65
			
	Fascicolo
	
				3
			
	Pagina iniziale
	
				1841
			
	Pagina finale
	
				1867
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1007/s00362-023-01473-6
			
	Fulltext
	
				open
			
	Citazione
	
				Nicolussi, F., Cazzaro, M., Rudas, T. (2024). Improving the power of hypothesis tests in sparse contingency tables. STATISTICAL PAPERS, 65(3), 1841-1867 [10.1007/s00362-023-01473-6].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
10281-450999_VoR.pdf accesso aperto Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Creative Commons Dimensione 598.69 kB Formato Adobe PDF Visualizza/Apri	598.69 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/450999

Citazioni

0

0

Social impact