Exact statistical tests for heterogeneity of frequencies based on extreme values

Chih-Chieh Wu, Roger C. Grimson, Sanjay Shete

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

Sophisticated statistical analyses of incidence frequencies are often required for various epidemiologic and biomedical applications. Among the most commonly applied methods is the Pearson's 2 test, which is structured to detect non specific anomalous patterns of frequencies and is useful for testing the significance for incidence heterogeneity. However, the Pearson's 2 test is not efficient for assessing the significance of frequency in a particular cell (or class) to be attributed to chance alone. We recently developed statistical tests for detecting temporal anomalies of disease cases based on maximum and minimum frequencies; these tests are actually designed to test of significance for a particular high or low frequency. The purpose of this article is to demonstrate merits of these tests in epidemiologic and biomedical studies. We show that our proposed methods are more sensitive and powerful for testing extreme cell counts than is the Pearson's 2 test. This feature could provide important and valuable information in epidemiologic or biomeidcal studies. We elucidated and illustrated the differences in sensitivity among our tests and the Pearson's 2 test by analyzing a data set of Langerhans cell histiocytosis cases and its hypothetical sets. We also computed and compared the statistical power of these methods using various sets of cell numbers and alternative frequencies. The investigation of statistical sensitivity and power presented in this work will provide investigators with useful guidelines for selecting the appropriate tests for their studies.

Original languageEnglish
Pages (from-to)612-623
Number of pages12
JournalCommunications in Statistics: Simulation and Computation
Volume39
Issue number3
DOIs
Publication statusPublished - 2010 Mar 1

Fingerprint

Exact Test
Statistical tests
Extreme Values
Statistical test
Testing
Cell
Incidence
Statistical Power
Biomedical Applications
Anomalous
Anomaly
Low Frequency
Count
Extremes
Alternatives

All Science Journal Classification (ASJC) codes

  • Statistics and Probability
  • Modelling and Simulation

Cite this

@article{591204c431264022bcbb8e9fafb84624,
title = "Exact statistical tests for heterogeneity of frequencies based on extreme values",
abstract = "Sophisticated statistical analyses of incidence frequencies are often required for various epidemiologic and biomedical applications. Among the most commonly applied methods is the Pearson's 2 test, which is structured to detect non specific anomalous patterns of frequencies and is useful for testing the significance for incidence heterogeneity. However, the Pearson's 2 test is not efficient for assessing the significance of frequency in a particular cell (or class) to be attributed to chance alone. We recently developed statistical tests for detecting temporal anomalies of disease cases based on maximum and minimum frequencies; these tests are actually designed to test of significance for a particular high or low frequency. The purpose of this article is to demonstrate merits of these tests in epidemiologic and biomedical studies. We show that our proposed methods are more sensitive and powerful for testing extreme cell counts than is the Pearson's 2 test. This feature could provide important and valuable information in epidemiologic or biomeidcal studies. We elucidated and illustrated the differences in sensitivity among our tests and the Pearson's 2 test by analyzing a data set of Langerhans cell histiocytosis cases and its hypothetical sets. We also computed and compared the statistical power of these methods using various sets of cell numbers and alternative frequencies. The investigation of statistical sensitivity and power presented in this work will provide investigators with useful guidelines for selecting the appropriate tests for their studies.",
author = "Chih-Chieh Wu and Grimson, {Roger C.} and Sanjay Shete",
year = "2010",
month = "3",
day = "1",
doi = "10.1080/03610910903528335",
language = "English",
volume = "39",
pages = "612--623",
journal = "Communications in Statistics Part B: Simulation and Computation",
issn = "0361-0918",
publisher = "Taylor and Francis Ltd.",
number = "3",

}

Exact statistical tests for heterogeneity of frequencies based on extreme values. / Wu, Chih-Chieh; Grimson, Roger C.; Shete, Sanjay.

In: Communications in Statistics: Simulation and Computation, Vol. 39, No. 3, 01.03.2010, p. 612-623.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Exact statistical tests for heterogeneity of frequencies based on extreme values

AU - Wu, Chih-Chieh

AU - Grimson, Roger C.

AU - Shete, Sanjay

PY - 2010/3/1

Y1 - 2010/3/1

N2 - Sophisticated statistical analyses of incidence frequencies are often required for various epidemiologic and biomedical applications. Among the most commonly applied methods is the Pearson's 2 test, which is structured to detect non specific anomalous patterns of frequencies and is useful for testing the significance for incidence heterogeneity. However, the Pearson's 2 test is not efficient for assessing the significance of frequency in a particular cell (or class) to be attributed to chance alone. We recently developed statistical tests for detecting temporal anomalies of disease cases based on maximum and minimum frequencies; these tests are actually designed to test of significance for a particular high or low frequency. The purpose of this article is to demonstrate merits of these tests in epidemiologic and biomedical studies. We show that our proposed methods are more sensitive and powerful for testing extreme cell counts than is the Pearson's 2 test. This feature could provide important and valuable information in epidemiologic or biomeidcal studies. We elucidated and illustrated the differences in sensitivity among our tests and the Pearson's 2 test by analyzing a data set of Langerhans cell histiocytosis cases and its hypothetical sets. We also computed and compared the statistical power of these methods using various sets of cell numbers and alternative frequencies. The investigation of statistical sensitivity and power presented in this work will provide investigators with useful guidelines for selecting the appropriate tests for their studies.

AB - Sophisticated statistical analyses of incidence frequencies are often required for various epidemiologic and biomedical applications. Among the most commonly applied methods is the Pearson's 2 test, which is structured to detect non specific anomalous patterns of frequencies and is useful for testing the significance for incidence heterogeneity. However, the Pearson's 2 test is not efficient for assessing the significance of frequency in a particular cell (or class) to be attributed to chance alone. We recently developed statistical tests for detecting temporal anomalies of disease cases based on maximum and minimum frequencies; these tests are actually designed to test of significance for a particular high or low frequency. The purpose of this article is to demonstrate merits of these tests in epidemiologic and biomedical studies. We show that our proposed methods are more sensitive and powerful for testing extreme cell counts than is the Pearson's 2 test. This feature could provide important and valuable information in epidemiologic or biomeidcal studies. We elucidated and illustrated the differences in sensitivity among our tests and the Pearson's 2 test by analyzing a data set of Langerhans cell histiocytosis cases and its hypothetical sets. We also computed and compared the statistical power of these methods using various sets of cell numbers and alternative frequencies. The investigation of statistical sensitivity and power presented in this work will provide investigators with useful guidelines for selecting the appropriate tests for their studies.

UR - http://www.scopus.com/inward/record.url?scp=77349126328&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77349126328&partnerID=8YFLogxK

U2 - 10.1080/03610910903528335

DO - 10.1080/03610910903528335

M3 - Article

VL - 39

SP - 612

EP - 623

JO - Communications in Statistics Part B: Simulation and Computation

JF - Communications in Statistics Part B: Simulation and Computation

SN - 0361-0918

IS - 3

ER -