# U-statistic

A sum

Hoeffding's form for -statistics is [a1]:

The kernel of a -statistic, , is a symmetric real-valued function of variables. The random variables (cf. also Random variable) are independent identically distributed with common distribution function on a measurable space , . The number is called the degree of the -statistic. The number of terms in the sum is equal to

in the first sum and to

in the second sum. Also, .

Various statistics can be represented as -statistics or can be approximated by -statistics with a suitable choice of the kernel . For example, the sampling variance

can be obtained using the kernel . Here,

is the mean value of the sample. The von Mises functional, given by

where

is the empirical distribution, can be represented by a linear combination of -statistics [a2]. For the primitive kernel , the -statistic

is a symmetric polynomial statistic of the random variables , .

The starting point of the analysis of -statistics is the Hoeffding decomposition of -statistics, [a1]:

where , , are completely degenerate kernels: . The integer is called the rank of the -statistic. Here, by definition, is the mean value of the kernel and, also, . Therefore, an -statistic is an unbiased estimator of the functional .

The theory of -statistics, founded by W. Hoeffding in the seminal work [a1], published in 1948, was developed under the impact of the theory of sums of independent random variables. The law of large numbers, the central limit theorem, the law of the iterated logarithm, etc. were investigated in various works (see the references in [a3]). The asymptotic behaviour of -statistics can be reduced to the analysis of sums of independent identically distributed random variables. For a non-degenerate kernel with and there is weak convergence (as ; cf. also Convergence, types of)):

where is a random variable with standard Gaussian distribution with and . Here, .

For the limit distribution of -statistics depends essentially on the kernel. For a primitive completely degenerate kernel

with and , there is weak convergence (as ):

where is the Hermite polynomial of degree [a7] (cf. also Hermite polynomials).

-statistics with completely degenerate kernel, and , converge weakly (as ) to the Itô–Wiener stochastic integral [a3], [a5]:

-statistics can also be represented by the stochastic integral with respect to the permanent random measure, as follows, [a3],

where

The asymptotic analysis of -statistics is based on the martingale structure of -statistics and involves functional limit theorems, rate of convergence, almost sure convergence, asymptotic expansions, and probability of large deviations.

The contemporary development of the theory of -statistics contains various generalizations: -statistics with kernel taking values in a Hilbert or Banach space [a8], multi-sampling -statistics, bootstrap and truncated -statistics, weighted -statistics, etc. -statistics with kernel depending on are used in non-parametric density and regression estimation [a2], [a3], [a4], [a5], [a6].

#### References

 [a1] W. Hoeffding, "A class of statistics with asymptotically normal distribution" Ann. Math. Stat. , 19 (1948) pp. 293–325 [a2] A.J. Lee, "U-statistics. Theory and practice" , Statistics textbooks and monographs , 110 , M. Dekker (1990) [a3] V.S. Korolyuk, Yu.V. Borovskikh, "Theory of U-statistics" , Kluwer Acad. Publ. (1994) (In Russian) [a4] R.J. Serfling, "Approximation: theorems of mathematical statistics" , Wiley (1980) [a5] E.B. Dynkin, A. Mandelbaum, "Symmetric statistics, Poisson point process and multiple Wiener integrals" Ann. Stat. , 11 (1983) pp. 739–745 [a6] M. Denker, "Asymptotic theory in nonparametric statistics" , Vieweg (1985) [a7] V.S. Korolyuk, Yu.V. Borovskikh, "Random permanents" , VSP (1994) (In Russian) [a8] Yu.V. Borovskikh, "U-statistics in Banach space" , VSP (1995) (In Russian)
How to Cite This Entry:
U-statistic. V.S. Korolyuk (originator), Encyclopedia of Mathematics. URL: http://www.encyclopediaofmath.org/index.php?title=U-statistic&oldid=13765
This text originally appeared in Encyclopedia of Mathematics - ISBN 1402006098