Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  



























Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Development  





2 Assumptions  





3 The test statistic  





4 The studentized range (q) distribution  





5 Confidence limits  





6 Comparing ANOVA and TukeyKramer tests  





7 See also  





8 References  





9 Further reading  





10 External links  














Tukey's range test






العربية
Français

Македонски

Polski
Русский
 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 


















From Wikipedia, the free encyclopedia
 


Tukey's range test, also known as Tukey's test, Tukey method, Tukey's honest significance test, or Tukey's HSD (honestly significant difference) test,[1] is a single-step multiple comparison procedure and statistical test. It can be used to correctly interpret the statistical significance of the difference between means that have been selected for comparison because of their extreme values.

The method was initially developed and introduced by John Tukey for use in Analysis of Variance (ANOVA), and usually has only been taught in connection with ANOVA. However, the studentized range distribution used to determine the level of significance of the differences considered in Tukey's test has vastly broader application: It is useful for researchers who have searched their collected data for remarkable differences between groups, but then cannot validly determine how significant their discovered stand-out difference is using standard statistical distributions used for other conventional statistical tests, for which the data must have been selected at random. Since when stand-out data is compared it was by definition not selected at random, but rather specifically chosen because it was extreme, it needs a different, stricter interpretation provided by the likely frequency and size of the studentized range; the modern practice of "data mining" is an example where it is used.

Development[edit]

The test is named after John Tukey,[2] it compares all possible pairs of means, and is based on a studentized range distribution (q) (this distribution is similar to the distribution of t from the t-test. See below).[3][full citation needed]

Tukey's test compares the means of every treatment to the means of every other treatment; that is, it applies simultaneously to the set of all pairwise comparisons

and identifies any difference between two means that is greater than the expected standard error. The confidence coefficient for the set, when all sample sizes are equal, is exactly for any For unequal sample sizes, the confidence coefficient is greater than In other words, the Tukey method is conservative when there are unequal sample sizes.

This test is often followed by the Compact Letter Display (CLD) statistical procedure to render the output of this test more transparent to non-statistician audiences.

Assumptions[edit]

  1. The observations being tested are independent within and among the groups.[citation needed]
  2. The subgroups associated with each mean in the test are normally distributed.[citation needed]
  3. There is equal within-subgroup variance across the subgroups associated with each mean in the test (homogeneity of variance).[citation needed]

The test statistic[edit]

Tukey's test is based on a formula very similar to that of the t-test. In fact, Tukey's test is essentially a t-test, except that it corrects for family-wise error rate.

The formula for Tukey's test is

where YA and YB are the two means being compared, and SE is the standard error for the sum of the means. The value qs is the sample's test statistic. (The notation |x| means the absolute valueofx; the magnitude of x with the sign set to +, regardless of the original sign of x.)

This qs test statistic can then be compared to a q value for the chosen significance level α from a table of the studentized range distribution. If the qs value is larger than the critical value qα obtained from the distribution, the two means are said to be significantly different at level [3]

Since the null hypothesis for Tukey's test states that all means being compared are from the same population (i.e. μ1 = μ2 = μ3 = ... = μk ), the means should be normally distributed (according to the central limit theorem) with the same model standard deviation σ, estimated by the merged standard error, for all the samples; its calculation is discussed in the following sections. This gives rise to the normality assumption of Tukey's test.

The studentized range (q) distribution[edit]

The Tukey method uses the studentized range distribution. Suppose that we take a sample of size n from each of k populations with the same normal distribution N(μ, σ2) and suppose that is the smallest of these sample means and is the largest of these sample means, and suppose S2 is the pooled sample variance from these samples. Then the following random variable has a Studentized range distribution:

This definition of the statistic q given above is the basis of the critically significant value for qα discussed below, and is based on these three factors:

the Type I error rate, or the probability of rejecting a true null hypothesis;
the number of sub-populations being compared;
the number of degrees of freedom for each mean

( df = Nk ) where N is the total number of observations.)

The distribution of q has been tabulated and appears in many textbooks on statistics. In some tables the distribution of q has been tabulated without the factor. To understand which table it is, we can compute the result for k = 2 and compare it to the result of the Student's t-distribution with the same degrees of freedom and the same α . In addition, R offers a cumulative distribution function (ptukey) and a quantile function (qtukey) for q .

Confidence limits[edit]

The Tukey confidence limits for all pairwise comparisons with confidence coefficient of at least 1 − α   are

Notice that the point estimator and the estimated variance are the same as those for a single pairwise comparison. The only difference between the confidence limits for simultaneous comparisons and those for a single comparison is the multiple of the estimated standard deviation.

Also note that the sample sizes must be equal when using the studentized range approach. is the standard deviation of the entire design, not just that of the two groups being compared. It is possible to work with unequal sample sizes. In this case, one has to calculate the estimated standard deviation for each pairwise comparison as formalized by Clyde Kramer in 1956, so the procedure for unequal sample sizes is sometimes referred to as the Tukey–Kramer method which is as follows:

where n i and n j are the sizes of groups i and j respectively. The degrees of freedom for the whole design is also applied.

Comparing ANOVA and Tukey–Kramer tests[edit]

Both ANOVA and Tukey–Kramer tests are based on the same assumptions. However, these two tests for k groups (i.e. μ1 = μ2 = ... = μk ) may result in logical contradictions when k >2 , even if the assumptions do hold.

It is possible to generate a set of pseudorandom samples of strictly negative measure such that hypothesis μ1 = μ2 is rejected at significance level while μ1 = μ2 = μ3 is not rejected even at [4]

See also[edit]

References[edit]

  1. ^ Lowry, Richard. "One-way ANOVA – independent samples". Vassar.edu. Archived from the original on 17 October 2008. Retrieved 4 December 2008.
    Also occasionally described as "honestly", see e.g.
Morrison, S.; Sosnoff, J.J.; Heffernan, K.S.; Jae, S.Y.; Fernhall, B. (2013). "Aging, hypertension and physiological tremor: The contribution of the cardioballistic impulse to tremorgenesis in older adults". Journal of the Neurological Sciences. 326 (1–2): 68–74. doi:10.1016/j.jns.2013.01.016. PMID 23385002.
  • ^ Tukey, John (1949). "Comparing individual means in the Analysis of Variance". Biometrics. 5 (2): 99–114. doi:10.2307/3001913. JSTOR 3001913. PMID 18151955.
  • ^ a b Linton, L.R.; Harder, L.D. (2007). Lecture notes (Report). Biology 315: Quantitative biology. Calgary, AB: University of Calgary.
  • ^ Gurvich, V.; Naumova, M. (2021). "Logical contradictions in the one-way ANOVA and Tukey–Kramer multiple comparisons tests with more than two groups of observations". Symmetry. 13 (8): 1387. arXiv:2104.07552. Bibcode:2021Symm...13.1387G. doi:10.3390/sym13081387.
  • Further reading[edit]

    External links[edit]


    Retrieved from "https://en.wikipedia.org/w/index.php?title=Tukey%27s_range_test&oldid=1219537666"

    Categories: 
    Analysis of variance
    Statistical tests
    Multiple comparisons
    Hidden categories: 
    Articles with short description
    Short description matches Wikidata
    Use dmy dates from March 2024
    All articles with incomplete citations
    Articles with incomplete citations from March 2024
    All articles with unsourced statements
    Articles with unsourced statements from January 2022
     



    This page was last edited on 18 April 2024, at 10:31 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki