Difference between revisions of "Multinomial distribution"

Revision as of 14:32, 7 June 2020

polynomial distribution

2020 Mathematics Subject Classification: Primary: 60E99 [MSN][ZBL]

The joint distribution of random variables that is defined for any set of non-negative integers satisfying the condition , , , by the formula

(*)

where (, ) are the parameters of the distribution. A multinomial distribution is a multivariate discrete distribution, namely the distribution for the random vector with (this distribution is in essence -dimensional, since it is degenerate in the Euclidean space of dimensions). A multinomial distribution is a natural generalization of a binomial distribution and coincides with the latter for . The name of the distribution is given because the probability (*) is the general term in the expansion of the multinomial . The multinomial distribution appears in the following probability scheme. Each of the random variables is the number of occurrences of one of the mutually exclusive events , , in repeated independent trials. If in each trial the probability of event is , , then the probability (*) is equal to the probability that in trials the events will appear times, respectively. Each of the random variables has a binomial distribution with mathematical expectation and variance .

The random vector has mathematical expectation and covariance matrix , where

(the rank of the matrix is because ). The characteristic function of a multinomial distribution is

For , the distribution of the vector with normalized components

tends to a certain multivariate normal distribution, while the distribution of the sum

(which is used in mathematical statistics to construct the "chi-squared" test) tends to the "chi-squared" distribution with degrees of freedom.

References

[C]	H. Cramér, "Mathematical methods of statistics" , Princeton Univ. Press (1946) MR0016588 Zbl 0063.01014

Comments

References

[JK]	N.L. Johnson, S. Kotz, "Discrete distributions" , Wiley (1969) MR0268996 Zbl 0292.62009

How to Cite This Entry:
Multinomial distribution. Encyclopedia of Mathematics. URL: http://encyclopediaofmath.org/index.php?title=Multinomial_distribution&oldid=49316

This article was adapted from an original article by A.V. Prokhorov (originator), which appeared in Encyclopedia of Mathematics - ISBN 1402006098. See original article

@@ Line 1: / Line 1: @@
-<!--
-m0653301.png
-$#A+1 = 40 n = 0
-$#C+1 = 40 : ~/encyclopedia/old_files/data/M065/M.0605330 Multinomial distribution,
-Automatically converted into TeX, above some diagnostics.
-Please remove this comment and the {{TEX|auto}} line below,
-if TeX found to be correct.
--->
-{{TEX|auto}}
-{{TEX|done}}
 ''polynomial distribution''
@@ Line 17: / Line 5: @@
 [[Category:Distribution theory]]
-The joint distribution of random variables  $  X _ {1} \dots X _ {k} $
+The joint distribution of random variables <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m0653301.png" /> that is defined for any set of non-negative integers <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m0653302.png" /> satisfying the condition <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m0653303.png" />, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m0653304.png" />, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m0653305.png" />, by the formula
-that is defined for any set of non-negative integers  $  n _ {1} \dots n _ {k} $
-satisfying the condition  $  n _ {1} + \dots + n _ {k} = n $,
-$  n _ {j} = 0 \dots n $,
-$  j = 1 \dots k $,
-by the formula
-$$ \tag{* }
+<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m0653306.png" /></td> <td valign="top" style="width:5%;text-align:right;">(*)</td></tr></table>
-{\mathsf P} \{ X _ {1} = n _ {1} \dots X _ {k} = n _ {k} \}  = \
-n! over {n _ {1} ! \dots n _ {k} ! } p _ {1} ^ {n _ {1} } \dots p _ {k} ^ {n _ {k} } ,
-$$
-where  $  n, p _ {1} \dots p _ {k} $(
+where <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m0653307.png" /> (<img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m0653308.png" />, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m0653309.png" />) are the parameters of the distribution. A multinomial distribution is a multivariate discrete distribution, namely the distribution for the random vector <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533010.png" /> with <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533011.png" /> (this distribution is in essence <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533012.png" />-dimensional, since it is degenerate in the Euclidean space of <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533013.png" /> dimensions). A multinomial distribution is a natural generalization of a [[Binomial distribution|binomial distribution]] and coincides with the latter for <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533014.png" />. The name of the distribution is given because the probability (*) is the general term in the expansion of the multinomial <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533015.png" />. The multinomial distribution appears in the following probability scheme. Each of the random variables <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533016.png" /> is the number of occurrences of one of the mutually exclusive events <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533017.png" />, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533018.png" />, in repeated independent trials. If in each trial the probability of event <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533019.png" /> is <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533020.png" />, <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533021.png" />, then the probability (*) is equal to the probability that in <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533022.png" /> trials the events <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533023.png" /> will appear <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533024.png" /> times, respectively. Each of the random variables <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533025.png" /> has a binomial distribution with mathematical expectation <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533026.png" /> and variance <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533027.png" />.
-$  p _ {j} \geq  0 $,
-$  \sum p _ {j} = 1 $)
-are the parameters of the distribution. A multinomial distribution is a multivariate discrete distribution, namely the distribution for the random vector  $  ( X _ {1} \dots X _ {k} ) $
-with  $  X _ {1} + \dots + X _ {k} = n $(
-this distribution is in essence  $  ( k- 1) $-
-dimensional, since it is degenerate in the Euclidean space of  $  k $
-dimensions). A multinomial distribution is a natural generalization of a [[Binomial distribution|binomial distribution]] and coincides with the latter for  $  k = 2 $.
-The name of the distribution is given because the probability (*) is the general term in the expansion of the multinomial  $  ( p _ {1} + \dots + p _ {k} )  ^ {n} $.
-The multinomial distribution appears in the following probability scheme. Each of the random variables  $  X _ {i} $
-is the number of occurrences of one of the mutually exclusive events  $  A _ {j} $,
-$  j = 1 \dots k $,
-in repeated independent trials. If in each trial the probability of event  $  A _ {j} $
-is  $  p _ {j} $,
-$  j = 1 \dots k $,
-then the probability (*) is equal to the probability that in  $  n $
-trials the events  $  A _ {1} \dots A _ {k} $
-will appear  $  n _ {1} \dots n _ {k} $
-times, respectively. Each of the random variables  $  X _ {j} $
-has a binomial distribution with mathematical expectation  $  np _ {j} $
-and variance  $  np _ {j} ( 1- p _ {j} ) $.
-The random vector  $  ( X _ {1} \dots X _ {k} ) $
+The random vector <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533028.png" /> has mathematical expectation <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533029.png" /> and covariance matrix <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533030.png" />, where
-has mathematical expectation  $  ( np _ {1} \dots np _ {k} ) $
-and covariance matrix  $  B = \| b _ {ij} \| $,
-where
-$$
+<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533031.png" /></td> </tr></table>
-b _ {ij}  =  \left \{
-(the rank of the matrix  $  B $
+(the rank of the matrix <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533032.png" /> is <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533033.png" /> because <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533034.png" />). The characteristic function of a multinomial distribution is
-is  $  k- 1 $
-because  $  \sum _ {i=} 1  ^ {k} n _ {i} = n $).
-The characteristic function of a multinomial distribution is
-$$
+<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533035.png" /></td> </tr></table>
-f( t _ {1} \dots t _ {k} )  =  \left ( p _ {1} e ^ {it _ {1} } + \dots + p _ {k} e ^ {it _ {k} } \right )  ^ {n} .
-$$
-For  $  n \rightarrow \infty $,
+For <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533036.png" />, the distribution of the vector <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533037.png" /> with normalized components
-the distribution of the vector  $  ( Y _ {1} \dots Y _ {k} ) $
-with normalized components
-$$
+<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533038.png" /></td> </tr></table>
-Y _ {i}  = \
-\frac{X _ {i} - np _ {i} }{\sqrt {np _ {i} ( 1- p _ {i} ) } }
-$$
 tends to a certain multivariate [[Normal distribution|normal distribution]], while the distribution of the sum
-$$
+<table class="eq" style="width:100%;"> <tr><td valign="top" style="width:94%;text-align:center;"><img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533039.png" /></td> </tr></table>
-\sum _ { i= } 1 ^ { k }  ( 1 - p _ {i} ) Y _ {i}  ^ {2}
-$$
-(which is used in mathematical statistics to construct the [[Chi-squared distribution| "chi-squared" test]]) tends to the [[Chi-squared test| "chi-squared" distribution]] with  $  k- 1 $
+(which is used in mathematical statistics to construct the [[Chi-squared distribution| "chi-squared" test]]) tends to the [[Chi-squared test| "chi-squared" distribution]] with <img align="absmiddle" border="0" src="https://www.encyclopediaofmath.org/legacyimages/m/m065/m065330/m06533040.png" /> degrees of freedom.
-degrees of freedom.
 ====References====
@@ Line 94: / Line 35: @@
 ====Comments====
 ====References====

Navigation

Tools

Namespaces

Variants

Views

Actions

Difference between revisions of "Multinomial distribution"

Revision as of 14:32, 7 June 2020

References

Comments

References