Jump to content
 







Main menu
   


Navigation  



Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
 




Contribute  



Help
Learn to edit
Community portal
Recent changes
Upload file
 








Search  

































Create account

Log in
 









Create account
 Log in
 




Pages for logged out editors learn more  



Contributions
Talk
 



















Contents

   



(Top)
 


1 Motivation and derivation  



1.1  As a compound distribution  





1.2  As an urn model  







2 Moments and properties  



2.1  Factorial moments  







3 Point estimates  



3.1  Method of moments  





3.2  Maximum likelihood estimation  





3.3  Example: Sex ratio heterogeneity  







4 Role in Bayesian statistics  





5 Generating random variates  





6 Related distributions  





7 See also  





8 References  





9 External links  














Beta-binomial distribution






Català
Deutsch
Español
فارسی
Français
Italiano
עברית
Українська

 

Edit links
 









Article
Talk
 

















Read
Edit
View history
 








Tools
   


Actions  



Read
Edit
View history
 




General  



What links here
Related changes
Upload file
Special pages
Permanent link
Page information
Cite this page
Get shortened URL
Download QR code
Wikidata item
 




Print/export  



Download as PDF
Printable version
 
















Appearance
   

 






From Wikipedia, the free encyclopedia
 


Probability mass function
Probability mass function for the beta-binomial distribution
Cumulative distribution function
Cumulative probability distribution function for the beta-binomial distribution
Notation
Parameters nN0 — number of trials
(real)
(real)
Support x ∈ { 0, …, n }
PMF

where is the beta function
CDF

where 3F2(a;b;x) is the generalized hypergeometric function
Mean
Variance
Skewness
Excess kurtosis See text
MGF where is the hypergeometric function
CF
PGF

Inprobability theory and statistics, the beta-binomial distribution is a family of discrete probability distributions on a finite support of non-negative integers arising when the probability of success in each of a fixed or known number of Bernoulli trials is either unknown or random. The beta-binomial distribution is the binomial distribution in which the probability of success at each of n trials is not fixed but randomly drawn from a beta distribution. It is frequently used in Bayesian statistics, empirical Bayes methods and classical statistics to capture overdispersion in binomial type distributed data.

The beta-binomial is a one-dimensional version of the Dirichlet-multinomial distribution as the binomial and beta distributions are univariate versions of the multinomial and Dirichlet distributions respectively. The special case where α and β are integers is also known as the negative hypergeometric distribution.

Motivation and derivation[edit]

As a compound distribution[edit]

The Beta distribution is a conjugate distribution of the binomial distribution. This fact leads to an analytically tractable compound distribution where one can think of the parameter in the binomial distribution as being randomly drawn from a beta distribution. Suppose we were interested in predicting the number of heads, in future trials. This is given by

Using the properties of the beta function, this can alternatively be written

As an urn model[edit]

The beta-binomial distribution can also be motivated via an urn model for positive integer values of α and β, known as the Pólya urn model. Specifically, imagine an urn containing α red balls and β black balls, where random draws are made. If a red ball is observed, then two red balls are returned to the urn. Likewise, if a black ball is drawn, then two black balls are returned to the urn. If this is repeated n times, then the probability of observing x red balls follows a beta-binomial distribution with parameters n, α and β.

By contrast, if the random draws are with simple replacement (no balls over and above the observed ball are added to the urn), then the distribution follows a binomial distribution and if the random draws are made without replacement, the distribution follows a hypergeometric distribution.

Moments and properties[edit]

The first three raw moments are

and the kurtosisis

Letting we note, suggestively, that the mean can be written as

and the variance as

where . The parameter is known as the "intra class" or "intra cluster" correlation. It is this positive correlation which gives rise to overdispersion. Note that when , no information is available to distinguish between the beta and binomial variation, and the two models have equal variances.

Factorial moments[edit]

The r-th factorial moment of a Beta-binomial random variable Xis

.

Point estimates[edit]

Method of moments[edit]

The method of moments estimates can be gained by noting the first and second moments of the beta-binomial and setting those equal to the sample moments and . We find

These estimates can be non-sensically negative which is evidence that the data is either undispersed or underdispersed relative to the binomial distribution. In this case, the binomial distribution and the hypergeometric distribution are alternative candidates respectively.

Maximum likelihood estimation[edit]

While closed-form maximum likelihood estimates are impractical, given that the pdf consists of common functions (gamma function and/or Beta functions), they can be easily found via direct numerical optimization. Maximum likelihood estimates from empirical data can be computed using general methods for fitting multinomial Pólya distributions, methods for which are described in (Minka 2003). The R package VGAM through the function vglm, via maximum likelihood, facilitates the fitting of glm type models with responses distributed according to the beta-binomial distribution. There is no requirement that n is fixed throughout the observations.

Example: Sex ratio heterogeneity[edit]

The following data gives the number of male children among the first 12 children of family size 13 in 6115 families taken from hospital records in 19th century Saxony (Sokal and Rohlf, p. 59 from Lindsey). The 13th child is ignored to blunt the effect of families non-randomly stopping when a desired gender is reached.

Males 0 1 2 3 4 5 6 7 8 9 10 11 12
Families 3 24 104 286 670 1033 1343 1112 829 478 181 45 7

The first two sample moments are

and therefore the method of moments estimates are

The maximum likelihood estimates can be found numerically

and the maximized log-likelihood is

from which we find the AIC

The AIC for the competing binomial model is AIC = 25070.34 and thus we see that the beta-binomial model provides a superior fit to the data i.e. there is evidence for overdispersion. Trivers and Willard postulate a theoretical justification for heterogeneity in gender-proneness among mammalian offspring.

The superior fit is evident especially among the tails

Males 0 1 2 3 4 5 6 7 8 9 10 11 12
Observed Families 3 24 104 286 670 1033 1343 1112 829 478 181 45 7
Fitted Expected (Beta-Binomial) 2.3 22.6 104.8 310.9 655.7 1036.2 1257.9 1182.1 853.6 461.9 177.9 43.8 5.2
Fitted Expected (Binomial p = 0.519215) 0.9 12.1 71.8 258.5 628.1 1085.2 1367.3 1265.6 854.2 410.0 132.8 26.1 2.3

Role in Bayesian statistics[edit]

The beta-binomial distribution plays a prominent role in the Bayesian estimation of a Bernoulli success probability which we wish to estimate based on data. Let be a sampleofindependent and identically distributed Bernoulli random variables . Suppose, our knowledge of - in Bayesian fashion - is uncertain and is modeled by the prior distribution . If then through compounding, the prior predictive distributionof

.

After observing we note that the posterior distribution for

where is a normalizing constant. We recognize the posterior distribution as a .

Thus, again through compounding, we find that the posterior predictive distribution of a sum of a future sample of size of random variables is

.

Generating random variates[edit]

To draw a beta-binomial random variate simply draw and then draw .

Related distributions[edit]

See also[edit]

References[edit]

External links[edit]


Retrieved from "https://en.wikipedia.org/w/index.php?title=Beta-binomial_distribution&oldid=1217940839"

Categories: 
Discrete distributions
Compound probability distributions
Conjugate prior distributions
Hidden categories: 
Articles with short description
Short description matches Wikidata
 



This page was last edited on 8 April 2024, at 20:15 (UTC).

Text is available under the Creative Commons Attribution-ShareAlike License 4.0; additional terms may apply. By using this site, you agree to the Terms of Use and Privacy Policy. Wikipedia® is a registered trademark of the Wikimedia Foundation, Inc., a non-profit organization.



Privacy policy

About Wikipedia

Disclaimers

Contact Wikipedia

Code of Conduct

Developers

Statistics

Cookie statement

Mobile view



Wikimedia Foundation
Powered by MediaWiki