Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 History  





2 In medicine  





3 Test calibration  





4 Ambiguity  





5 See also  





6 References  





7 External links  














Gold standard (test)






العربية
Català
Deutsch
Español
Français
Galego

Italiano
עברית
Nederlands
Português
Русский

Türkçe
 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


Inmedicine and medical statistics, the gold standard, criterion standard,[1]orreference standard[2] is the diagnostic testorbenchmark that is the best available under reasonable conditions.[3] It is the test against which new tests are compared to gauge their validity, and it is used to evaluate the efficacy of treatments.[1]

The meanings may differ between practical medicine and the statistical ideal because, in medicine with some conditions, only an autopsy guarantees diagnostic certainty, thus the gold standard test would be the best one that keeps the patient alive instead of the autopsy. In these cases, even so-called "gold standard" tests require follow-up to confirm or refute the diagnosis.[4]

History[edit]

The term 'gold standard' in its current sense in medical research was coined by Rudd in 1979, in reference to the monetary gold standard.[5]

In medicine[edit]

"Gold standard" can refer to the criteria by which scientific evidence is evaluated. For example, in resuscitation research, the "gold standard" test of a medication or procedure is whether or not it leads to an increase in the number of neurologically intact survivors that walk out of the hospital.[6] Other types of medical research might regard a significant decrease in 30-day mortality as the gold standard.[citation needed]

The AMA Style Guide has preferred the phrase criterion standard instead of "gold standard." Other journals have also issued mandates in their instructions for contributors. For instance, the Archives of Biological Medicine and Rehabilitation specifies this usage.[7] In practice, however, the uptake of this term by authors, as well as enforcement by editorial staff, is notably poor, at least for AMA journals.[8]

When the criterion is a whole clinical testing procedure it is usually referred to as clinical case definition. Differing case definitions can produce wildly different results when used as the basis for evalulating a given diagnostic method.[9]

A hypothetical ideal "gold standard" test has a sensitivity of 100% concerning the presence of the disease (it identifies all individuals with a well-defined disease process; it does not have any false-negative results) and a specificity of 100% (it does not falsely identify someone with a condition that does not have the condition; it does not have any false-positive results). In practice, there are no true gold standard tests.[10]

As new diagnostic methods become available, the "gold standard" test may change over time. For instance, for the diagnosis of aortic dissection, the gold standard test used to be the aortogram, which had a sensitivity as low as 83% and a specificity as low as 87%. Since the advancements of magnetic resonance imaging, the magnetic resonance angiogram (MRA) has become the new gold standard test for aortic dissection, with a sensitivity of 95% and a specificity of 92%.[citation needed] Before the widespread acceptance of any new test, the former test retains its status as the "gold standard".

Test calibration[edit]

Because tests can be incorrect (yielding a false-negative or a false-positive), results should be interpreted in the context of the history, physical findings, and other test results of the individual being tested. It is within this context that the sensitivity and specificity of the "gold standard" test is determined.[citation needed]

When the gold standard is not a perfect one, its sensitivity and specificity must be calibrated against more accurate tests or against the definition of the condition.[11] This calibration is especially important when a perfect test is available only by autopsy. It is important to emphasize that a test has to meet some interobserver agreement, to avoid some bias induced by the study itself.[12]

Calibration errors can lead to misdiagnosis.[13][dubiousdiscuss]

Ambiguity[edit]

Sometimes "gold standard test" refers to the best-performing test available. In these cases, there is no other criterion against which it can be compared and it is equivalent to a definition. When referring to this meaning, gold standard tests are normally not performed at all. This is because the gold standard test may be difficult to perform or may be impossible to perform on a living person (i.e. the test is performed as part of an autopsy or may take too long for the results of the test to be available to be clinically useful).

Other times, the "gold standard" does not refer to the best-performing test available, but the best available under reasonable conditions. For example, in this sense, an MRI is the gold standard for brain tumor diagnosis, though it is not as good as a biopsy. In this case, the sensitivity and specificity of the gold standard are not 100% and it is said to be an "imperfect gold standard" or "alloyed gold standard".[11]

The term ground truth refers to the underlying absolute state of information; the gold standard strives to represent the ground truth as closely as possible. While the gold standard is the best effort to obtain the truth, ground truth is typically collected by direct observations. Inmachine learning and information retrieval, "ground truth" is the preferred term even when classifications may be imperfect; the gold standard is assumed to be the ground truth.[citation needed]

Some authors use the term "golden standard". Claassen argues this usage is incorrect, as "golden standard" implies a level of perfection that is unattainable in medical science.[5]

See also[edit]

References[edit]

  • ^ Gold, R; Reichman, M; Greenberg, E; Ivanidze, J; Elias, E; Tsiouris, AJ; Comunale, JP; Johnson, CE; Sanelli, PC (September 2010). "Developing a new reference standard: is validation necessary?". Academic Radiology. 17 (9): 1079–82. doi:10.1016/j.acra.2010.05.021. PMC 2919497. PMID 20692619.
  • ^ Versi E (July 1992). ""Gold standard" is an appropriate term". BMJ. 305 (6846): 187. doi:10.1136/bmj.305.6846.187-b. PMC 1883235. PMID 1515860.
  • ^ Fardy, John M.; Barrett, Brendan J. (2015). "Evaluation of Diagnostic Tests". Clinical Epidemiology (PDF). Methods in Molecular Biology. Vol. 1281. pp. 289–300. doi:10.1007/978-1-4939-2428-8_17. ISBN 978-1-4939-2427-1. PMID 25694317.
  • ^ a b Claassen, JA (24 December 2005). "['Gold standard', not 'golden standard']". Nederlands Tijdschrift voor Geneeskunde. 149 (52): 2937. PMID 16402524.
  • ^ ACLS: Principles and Practice. p. 62. Dallas: American Heart Association, 2003. ISBN 0-87493-341-2.
  • ^ "Guide for Authors". Archives of biological Medicine and Rehabilitation. Elsevier.
  • ^ "Criterion Standard - AMA Style Insider". 21 June 2011. Retrieved 2021-05-18.
  • ^ Bachmann, Lucas M; Jüni, Peter; Reichenbach, Stephan; Ziswiler, Hans-Rudolf; Kessels, Alfons G; Vögelin, Esther (1 August 2005). "Consequences of different diagnostic 'gold standards' in test accuracy research: Carpal Tunnel Syndrome as an example". International Journal of Epidemiology. 34 (4): 953–955. doi:10.1093/ije/dyi105. PMID 15911545.
  • ^ Troy LM, Michels KB, Hunter DJ, Spiegelman D, Manson JE, Colditz GA, et al. (February 1996). "Self-reported birthweight and history of having been breastfed among younger women: an assessment of validity". International Journal of Epidemiology. 25 (1): 122–127. doi:10.1093/ije/25.1.122. PMID 8666479.
  • ^ a b Spiegelman D, Schneeweiss S, McDermott A (January 1997). "Measurement error correction for logistic regression models with an "alloyed gold standard"". American Journal of Epidemiology. 145 (2): 184–196. doi:10.1093/oxfordjournals.aje.a009089. PMID 9006315.
  • ^ Stein PD, Athanasoulis C, Alavi A, Greenspan RH, Hales CA, Saltzman HA, et al. (February 1992). "Complications and validity of pulmonary angiography in acute pulmonary embolism". Circulation. 85 (2): 462–468. doi:10.1161/01.CIR.85.2.462. PMID 1735144.
  • ^ Gallaher MP, Mobley LR, Klee GG, Schryver P (April 2004). The Impact of Calibration Error in Medical Decision Making (PDF) (Report). Washington (DC): National Institute of Standards and Technology.
  • External links[edit]


    Retrieved from "https://en.wikipedia.org/w/index.php?title=Gold_standard_(test)&oldid=1216621340"

    Categories: 
    Epidemiology
    Medical tests
    Hidden categories: 
    Articles with short description
    Short description matches Wikidata
    All articles with unsourced statements
    Articles with unsourced statements from May 2023
    Articles with unsourced statements from December 2023
    All accuracy disputes
    Articles with disputed statements from December 2023
    Articles to be expanded from December 2023
    Articles with unsourced statements from February 2016
     



    This page was last edited on 1 April 2024, at 02:42 (UTC).

    Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



    Privacy policy

    About Wikipedia

    Disclaimers

    Contact Wikipedia

    Code of Conduct

    Developers

    Statistics

    Cookie statement

    Mobile view



    Wikimedia Foundation
    Powered by MediaWiki