Key points are not available for this paper at this time.
The purpose of this paper is to investigate the asymptotic normality of linear combinations of order statistics; that is, to find conditions under which a statistic of the form Sₙ = ⁿ₈=₁ c₈₍X₈₍ has a limiting normal distribution as n becomes infinite, where the c₈₍'s are constants and X₁₍, X₂₍, , X₍₍ are the observations of a sample of size n, ordered by increasing magnitude. Aside from the sample mean (the case where the weights c₈₍ are all equal to 1/n), the first proof of asymptotic normality within this class was by Smirnov in 1935 19, who considered the case that nonzero weight is attached to at most two percentiles. In 1946, Mosteller 13 extended this to the case of several percentiles, and coined the phrase "systematic statistic" to describe Sₙ. Since the publication in 1955 of a paper by Jung 11 concerned with finding optimal weights for Sₙ in certain estimation problems, interest in proving its asymptotic normality under more general conditions has grown. For example, Weiss in 21 proved that Sₙ has a limiting normal distribution when no weight is attached to the observations below the pth sample percentile and above the qth sample percentile, p < q, and the remaining observations are weighted according to a function J by c₈₍ = J (i/ (n + 1) ), where J is assumed to have a bounded derivative between p and q. Within the past few years, several notable attempts have been made to prove the asymptotic normality of Sₙ under more general conditions on the weights and underlying distribution. These attempts have employed three essentially different methods. In 1 Bickel used an invariance principle for order statistics to prove asymptotic normality when ₈<ₓ₍ c₈₍ converges to a function J (t) of bounded variation and the underlying distribution F has a continuous density, positive on the support of F. His method was quite successful in dealing with statistics which put no weight on observations below the pth and above the qth sample percentile, p < q, but in other cases he did not allow the more extreme observations to be weighted more than in the sample mean. More recently in 17, Shorack used a more high powered version of the same approach to obtain a stronger result, allowing much more weight on the extremes. Chernoff, Gastwirth, and Johns, in 3, employed a device of Renyi 14 and expressed Sₙ as a linear combination of independent, exponentially distributed random variables plus a remainder term; they then showed that the sum of independent variables has a limiting normal distribution and the remainder is asymptotically negligible in the sense that it tends to zero in probability as n tends to infinity. They proved asymptotic normality if the c₈₍'s give substantial weight to only a fixed number of percentiles and obey certain boundedness conditions elsewhere, and F has a density which is continuous outside a set of measure zero, bounded away from zero on the interior of the support of F, and smooth in the tails. A different approach is used by Govindarajulu in 7. He adopted a version due to LeCam of the method of Chernoff and Savage 4, which was used by Govindarajulu, LeCam, and Raghavachari in 8 to prove asymptotic normality of linear rank statistics, and for the case c₈₍ = J (i/ (n + 1) ) he expressed Sₙ as a linear combination of independent random variables plus a remainder term. Then if J is absolutely continuous and both it and its derivative satisfy certain boundedness conditions at zero and one, the remainder term tends to zero in probability and Sₙ has a limiting normal distribution. While his conditions on the weights are very restrictive (for example he does not allow substantial weight to be put on sample percentiles), his conditions on the underlying distribution are the weakest yet obtained, requiring only that the inverse of F does not grow very rapidly at zero or one. Recently in 12, Moore gave a short proof of asymptotic normality, also along the lines of Chernoff and Savage 4, which permits quite general F, but again at the expense of stringent conditions on J. In this investigation we use yet another method of attacking this problem. Using a procedure due to Hajek 9, who applied it to linear rank statistics, we will represent the statistic Sₙ as a linear combination of independent random variables, to which the usual central limit theory can be applied, plus a remainder term, and then, under quite general conditions on the weights, prove that the remainder converges to zero in mean square, rather than in the weaker sense of convergence in probability as in 3, 7, and 12. This is accomplished by first approximating the statistic Tₙ = ^n-bₙ₈=₁䂸 c₈₍X₈₍ by a sum of independent random variables, where bₙ is a sequence of integers tending to infinity slower than n but faster than n as n increases, and then finding conditions under which Tₙ approximates Sₙ in mean square. The result is essentially stronger than that of Bickel, in that much more weight is allowed on the extreme observations; however, a smoothness condition on the distribution similar to that of Chernoff, Gastwirth, and Johns is required. For statistics of the form Tₙ the result is essentially stronger than that of Chernoff, Gastwirth, and Johns, although slightly more restrictive conditions are required to prove mean square equivalence of Tₙ and Sₙ. We treat a much more general class of weights than does Govindarajulu or Moore, but our conditions on the underlying distribution are stronger than their conditions. In the course of the proof we derive asymptotic expressions for the covariances of the order statistics and the variance of Sₙ. For the case of the variance of a single order statistic X₈₍, the result is proved under weaker conditions on F and for a wider range of i than by Sen 16, Van Zwet 20, Bickel 1, or Blom 2, although the speed of convergence is slower than in 16, 20, or 2. The asymptotic expression for covariances is similarly improved over Blom 2. The first section of the paper describes the method to be used and states two previously known propositions which will be useful in the following sections. Section two contains the calculation of an approximation for an order statistic by a sum of independent random variables and an exact expression for the covariance of two such approximations. Asymptotic expressions for this covariance and the covariance of two order statistics are derived in section three, as well as an expression for the variance of Sₙ. Section four contains the proof of the asymptotic normality of Sₙ when the extremes are not included and no weight is allowed for i/ (n + 1) near a point where the derivative of the inverse of F misbehaves, and conditions are given under which these restrictions can be dropped. Finally, in section five we discuss the limitations of the method used and extend the results to the slightly more general class of statistics of the form ₈₍h (X₈₍), also considered in 3, 7, and 17.
Stephen M. Stigler (Sun,) studied this question.
Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context: