statistics

sigma is the standard deviation of a population of size N
S is the standard deviation of a sample of size n from within the population.

What is the estimated value of S^2?

If the population was infinitely large (size N = infinity), what would the estimated value of S^2 be?

  1. 👍 0
  2. 👎 0
  3. 👁 212
  1. I meant "expected" value, not "estimated" value. Sorry about that.

    1. 👍 0
    2. 👎 0
  2. As far as I know (statistics is not really my specialty), the sigma of the sample depends only on the sample size, not the size of the population the sample is chosen from.
    Therefore in my ignorance I would say:
    S^2 sample = sigma^2 of population / n
    or
    sigma sample = sigma population/sqrt(n)

    1. 👍 0
    2. 👎 0
    posted by Damon
  3. Damon, that can't be right. As n approaches infinity, S^2 should approach sigma^2.

    Also, the wikipedia entry does use both sample size and population size in their formula which is one reason that I wanted to see it derived.

    1. 👍 0
    2. 👎 0
  4. You are right. I think I have that backwards and it is too simple anyway. I hope a statistics expert comes by here.

    1. 👍 0
    2. 👎 0
    posted by Damon
  5. You can formally write down everything in terms of the probability distribution function. Let's say the population consists of N elements and each element can be in some state denoted by a continuous variable x distributed according to the same probability density
    p(x).


    If all the variables are independent, the joint probability distribution factorizes:

    p(x1, x2, ...,xN) = p(x1)p(x2)...p(xN)

    If you measure S^2, you take n of the variables xi, say, x1 , x2, ...xn and compute the standard deviation in the usual way:

    S^2 = <x^2> - <x>^2

    where <x> and <x^2> denote averages of the n numbers x1, x2, ..., xn, not an average using p(x) as S^2 depends on the actual numbers in the sample. So, we don't know what S^2 will be, but we can compute the probability distribution, expectation value etc. of S^2 in terms of the function p(x).

    The expectation value is given by:

    Integral dx1 dx2...dxn

    p(x1)p(x2)...p(xn)[<x^2> - <x>^2].

    Insert in here:

    <x^2> = 1/n(x1^2 + x2^2 + ...+xn^2)

    <x>^2 = 1/n^2(x1 + x2 + ...+xn)^2 =

    1/n^2(x1^2 + x2^2 + ...+xn^2) +

    1/n^2 (sum over xi xj for i not equal to j)

    Now let's compute the integrations. Let's use the notation <<f(x)>> for an average relative to p(x). So

    <<x>> means Integral dx x p(x)

    Then you see that you only got terms like <<x>>^2 and <<x^2>> and you just need to count how many of each and what the prefactors are.

    You should find:

    <<S^2>> = (1-1/n) [<<x^2>> - <<x>>^2]

    Now, the term in the square brackets is the sigma^2 if the sample size is infinite (because if you sample over an infinite sample size you are computing the exact average which is also given by the integral over the probability distribution).

    The factor(1-1/n) explains why when estimating the standard deviation from a finite sample you have a term 1/(n-1) in the denominator in the aquare root instead of 1/n. S^2 will, on average be the true standard deviation times
    (1-1/n), so you divide by this factor, i.e. multiply by n/(n-1)

    1. 👍 0
    2. 👎 0
  6. Thank you !

    1. 👍 0
    2. 👎 0
    posted by Damon

Respond to this Question

First Name

Your Response

Similar Questions

  1. statistics

    what are the mean and standard deviation of a sampling distribution consisting of samples of size 16? these sameples were drawn from a population whose mean is 25 and who standard deviation is 5. a. 25 and 1.25 b. 5 and 5 c. 25

    asked by helga on February 1, 2009
  2. Statistics

    Suppose that the fit to the simple linear regression of Y on X from 6 observations produces the following residuals: -3.3, 2.1, -4.0, -1.5, 5.1, 1.6. a) What is the estimate of sigma squared? b) What is the estimate of sigma? c)

    asked by MMK on March 14, 2007
  3. statistics

    How do we compute sample size fluctuations? Suppose we have a population of 16000 and a sample size of 30, and the number of samples 1000, what will be the impact on population mean and standard deviation when the population is

    asked by Nasir on December 1, 2011
  4. sample size/ standard deviations

    How do we compute sample size fluctuations? Suppose we have a population of 16000 and a sample size of 30, and the number of samples 1000, what will be the impact on population mean and standard deviation when the population is

    asked by ut on December 1, 2011
  5. Math (Statistic)

    Considered the sampling distribution of a sample mean obtained by random sampling from an infinite population. This population has a distribution that is highly skewed toward the larger values. a) How is the mean of the sampling

    asked by Jennifer on February 19, 2007
  6. math

    Can I gt some help on these questions! 1.Research subjects that are divided up based on gender, age, etc. to be compared to people of the same group is what type of design experiment? 2. A biologist collects a sample of 50 snakes.

    asked by mysterychicken on June 20, 2013
  7. statistics

    Given a sample size of 38, with sample mean 660.3 and sample standard deviation 95.9 we are to perform the following hypothesis test. Null Hypothesis H0: ƒÊ = 700 Alternative Hypothesis H0: ƒÊ ‚ 700 At significance level

    asked by Scott on August 9, 2010
  8. Psychology/Statistics

    For a population that has a standard deviation of 10, figure the standard deviation of the distribution of means for samples of size - 3. Use the formula sqrt(N^ 2/S)

    asked by Dawn on July 15, 2011
  9. Stats

    You take and SRS of size 25 from a population with mean 200 and standard deviation of 10. Find the mean and the standard deviation of the sampling distribution of your sample mean.

    asked by Ryan on July 14, 2011
  10. statistics

    An SRS of size n is taken from a large population whose distribution of income is extremely right-skewed and the mean income is calculated. Which of the following statements is false? a) When n > 30, the sampling distribution of

    asked by Anna on April 17, 2012

More Similar Questions