Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Introduction  





2 Finite case  





3 Infinite case  





4 See also  





5 References  














Šidák correction for t-test







Add links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


One of the application of Student's t-test is to test the location of one sequence of independent and identically distributed random variables. If we want to test the locations of multiple sequences of such variables, Šidák correction should be applied in order to calibrate the level of the Student's t-test. Moreover, if we want to test the locations of nearly infinitely many sequences of variables, then Šidák correction should be used, but with caution. More specifically, the validity of Šidák correction depends on how fast the number of sequences goes to infinity.

Introduction[edit]

Suppose we are interested in m different hypotheses, , and would like to check if all of them are true. Now the hypothesis test scheme becomes

: all of are true;
: at least one of is false.

Let be the level of this test (the type-I error), that is, the probability that we falsely reject when it is true.

We aim to design a test with certain level .

Suppose when testing each hypothesis , the test statistic we use is .

If these 's are independent, then a test for can be developed by the following procedure, known as Šidák correction.

Step 1, we test each of m null hypotheses at level .
Step 2, if any of these m null hypotheses is rejected, we reject .

Finite case[edit]

For finitely many t-tests, suppose where for each i, are independently and identically distributed, for each j are independent but not necessarily identically distributed, and has finite fourth moment.

Our goal is to design a test for with level α. This test can be based on the t-statistic of each sequences, that is,

where:

Using Šidák correction, we reject if any of the t-tests based on the t-statistics above reject at level More specifically, we reject when

where

The test defined above has asymptotic level α, because

Infinite case[edit]

In some cases, the number of sequences, , increase as the data size of each sequences, , increase. In particular, suppose . If this is true, then we will need to test a null including infinitely many hypotheses, that is

To design a test, Šidák correction may be applied, as in the case of finitely many t-test. However, when , the Šidák correction for t-test may not achieve the level we want, that is, the true level of the test may not converges to the nominal level as n goes to infinity. This result is related to high-dimensional statistics and is proven by Fan, Hall & Yao (2007).[1] Specifically, if we want the true level of the test converges to the nominal level , then we need a restraint on how fast . Indeed,

The results above are based on Central Limit Theorem. According to Central Limit Theorem, each of our t-statistics possesses asymptotic standard normal distribution, and so the difference between the distribution of each and the standard normal distribution is asymptotically negligible. The question is, if we aggregate all the differences between the distribution of each and the standard normal distribution, is this aggregation of differences still asymptotically ignorable?

When we have finitely many , the answer is yes. But when we have infinitely many , the answer some time becomes no. This is because in the latter case we are summing up infinitely many infinitesimal terms. If the number of the terms goes to infinity too fast, that is, too fast, then the sum may not be zero, the distribution of the t-statistics can not be approximated by the standard normal distribution, the true level does not converges to the nominal level , and then the Šidák correction fails.

See also[edit]

References[edit]

  1. ^ Fan, Jianqing; Hall, Peter; Yao, Qiwei (2007). "To How Many Simultaneous Hypothesis Tests Can Normal, Student's t or Bootstrap Calibration Be Applied". Journal of the American Statistical Association. 102 (480): 1282–1288. arXiv:math/0701003. doi:10.1198/016214507000000969. S2CID 8622675.

Retrieved from "https://en.wikipedia.org/w/index.php?title=Šidák_correction_for_t-test&oldid=1149093047"

Category: 
Multiple comparisons
Hidden categories: 
Articles with short description
Short description matches Wikidata
 



This page was last edited on 10 April 2023, at 04:31 (UTC).

Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



Privacy policy

About Wikipedia

Disclaimers

Contact Wikipedia

Code of Conduct

Developers

Statistics

Cookie statement

Mobile view



Wikimedia Foundation
Powered by MediaWiki