Talk:Statistics theory

From Citizendium
Revision as of 18:09, 8 December 2007 by imported>Ragnar Schroder
Jump to navigation Jump to search
This article is developing and not approved.
Main Article
Discussion
Related Articles  [?]
Bibliography  [?]
External Links  [?]
Citable Version  [?]
Advanced [?]
 
To learn how to update the categories for this article, see here. To update categories, edit the metadata template.
 Definition A branch of mathematics that specializes in enumeration, or counted, data and their relation to measured data. [d] [e]
Checklist and Archives
 Workgroup category mathematics [Please add or review categories]
 Talk Archive none  English language variant American English

Definition of a statistic

The modified sentence:

"More generally, a statistic can be any measure within a data sample. This would be some quantification of a random variable, or variables, of interest, such as a height, weight, polling results, test performance, and so on"

does not have the same meaning as the original

"More generally, a statistic can be any measurable function of the data samples, the latter being realizations of the random variables which are of interest such as the height of people, polling results, students' performance on a test, and so on."

In particular, a measure and a measurable function are not the same thing and the new sentence obfuscates the definition of a statistic. The point is that there is a precise definition of a statistic in mathematical statistics which is based on measure theoretic probability theory. For this purpose I provide a reference for this definition. An intuitive definition as given in the second paragraph of the article is fine as a gentle introduction, but it should also be complemented by a more rigorous mathematical definition.

I agree that my original sentence may not have been very readable, so to strike a compromise I combined the good parts of both sentences and produced what now appears in the article. Cheers, --Hendra I. Nurdin 17:25, 10 November 2007 (CST)

Outstanding edit! --Michael J. Formica 19:17, 10 November 2007 (CST)


"A data sample is regarded as instances of a random variable of interest..."
I think referring to "random variable" here narrows the focus a little too much.
Statistics is largely about extracting concise info from large piles of data. Sometimes, the data set is best described without reference to a numerical random variable, f.i. the fact that the most common 1st name in this or that town is "Billy" is a perfectly good statistic, ditto that "I" is the most commonly used word in English.
Ragnar Schroder 18:09, 8 December 2007 (CST)