statistics

I'm trying to work through the proof for
SST = SSM + SSE

MEAN = ∑(X)/N

SST = ∑((x - MEAN)^2)
= ∑(x^2 - 2 * x1 * MEAN + MEAN^2)
= ∑(x^2) - 2 * MEAN * ∑(x) + N * MEAN^2
= ∑(x^2) - 2 * ∑(x)^2/N + ∑(x)^2/N
= ∑(x^2) - ∑(x)^2/N

SSM = ∑((MODEL - MEAN)^2)
= ∑(MODEL^2 - 2 * MODEL * MEAN + MEAN^2)
= ∑(MODEL^2) - 2 * MEAN * ∑(MODEL) + N * MEAN^2
= ∑(MODEL^2) - 2 * MEAN * ∑(MODEL) + N * MEAN^2
= ∑(MODEL^2) - 2/N * ∑(x) * ∑(MODEL) + ∑(x)^2/N

SSE = ∑((x - MODEL)^2)
= ∑(x^2 - 2 * x * MODEL + MODEL^2)
= ∑(x^2) - 2 * ∑(x * MODEL) + ∑(MODEL^2)

SST = SSM + SSE
∑(x^2) - ∑(x)^2/N = ∑(MODEL^2) - 2/N * ∑(x) * ∑(MODEL) + ∑(x)^2/N + ∑(x^2) - 2 * ∑(x * MODEL) + ∑(MODEL^2)
2 * ∑(MODEL^2) - 2/N * ∑(x) * ∑(MODEL) + 2 * ∑(x)^2/N + - 2 * ∑(x * MODEL) = 0
∑(MODEL^2) - 1/N * ∑(x) * ∑(MODEL) + ∑(x)^2/N + - ∑(x * MODEL) = 0

I can't complete the proof. What am I missing? Thanks!

  1. 👍 0
  2. 👎 0
  3. 👁 211
  1. Divide the sum of squares by N and work with the averages. Let's use the notation:

    <X> for the average of X. E.g.:

    <X> = ∑(X)/N = Mean

    And:

    <(X-<X>)^2> =

    <X^2 - 2X<X> + <X>^2> =

    <X^2> - <X>^2

    Note that <a X> = a <X> for a constant factor a. In an average like <X <X>>, the inner <X> is a constant when carrying out the outer average, so you can take it out of the outer average sign. So, you have <X <X>> = <X>^2. The average of a constant is, of course, the same constant so e.g. <<X>^2> = <X^2> because once the inner average is carried out it is a constant w.r.t. the outer average.

    If you work with averages and use these rules then you can derive the desired result in just one line. If you use summations, you'll tend to re-derive these rules in every step you make, so you'll get a complicated mess.

    Derivation:

    <(X - <X>)^2> =

    <(X - m + m - <X>)^2> =

    <(X-m)^2> + <(m - <X>)^2>

    + 2 <X-m><m-<X>>

    The last term is zero if the average of X equals the average of the Model.

    1. 👍 0
    2. 👎 0
  2. I don't follow this:

    <<X>^2> = <X^2>

    Of course, if k is constant and x is variable:

    <kx> = k<x>
    <k> = k
    <k^2> = k^2

    but...

    <x^2> != <x>^2

    1. 👍 0
    2. 👎 0
  3. Sorry, that was a typo.

    I meant to write:

    <<X>^2> = <X>^2

    1. 👍 0
    2. 👎 0
  4. I don't follow this at all:
    <(X - m + m - <X>)^2> = <(X-m)^2> + <(m - <X>)^2> + 2 <X-m><m-<X>>

    Trying to follow your logic, for the left side:

    <(X - m + m - <X>)^2>
    = <(X - <X>)^2>
    = <x^2> - <x>^2
    = SST

    For SSM + SSE:

    <(x - m>^2> + <(m - <x>)^2>
    = <x^2 - 2xm + m^2> + <m^2 - 2m<x> + <x>^2>
    = <x^2> - 2<xm> + <m^2> + <m^2> - 2<m><x> + <x>^2>
    = 2<x^2> + 2<m^2> - 2<xm> - 2<m><x>

    And I'm stuck...

    1. 👍 0
    2. 👎 0

Respond to this Question

First Name

Your Response

Similar Questions

  1. statistics

    I have a simple set of 10 data points My ten Data Points 2 3 3 4 5 8 9 11 11 13 (mean = 6.9) My prediction nodel predicts the following values for the ten data points (listed in same order) 2 3 4 5 6 7 9 10 11 12 I calculate SST =

    asked by statstudent on August 29, 2007
  2. Statistics

    Where can I find a proof for: SST = SSM + SSE

    asked by statstudent on August 28, 2007
  3. Econ check my work

    Is there any way or formula to find SSE with using SSR and SST? because i only have data for my ssr and sst and I need to know my sse or mse?

    asked by Anonymous on March 14, 2018
  4. business statistics

    1-A randomized block design ANOVA has five treatments and four blocks. The computed test statistic (value of F) is 4.35. With a 0.05 significance level, the appropriate table value and conclusion will be? 2-A randomized block

    asked by Lesliam on August 11, 2009
  5. Algebra

    Can you please help with this problem: At Super Saver Mart (SSM), you can buy three packs of toilet paper and seven boxes of cereal for $46.00. You can also purchase 8 packs of toilet paper and 4 boxes of cereal for $75.00 at SSM.

    asked by Leroy on December 10, 2016
  1. Logic (quick yes or no question)

    Why?? what proof is there that states the sun is the center of the solar system?? how do we know this?? The solar system is the group of heavenly bodies that orbits the Sun, which is also known as Sol. Sol orbits the galactic

    asked by Bryan on November 30, 2006
  2. Math

    Please help me with this problem: A bourbon that is 51 proof is 25.5% alcohol by volume while one that is 82 proof is 41% alcohol. How many liters of 51 proof bourbon must be mixed with 1.0 L of 82 proof bourbon to produce a 66

    asked by Lorie on April 5, 2013
  3. college math

    I posted this problem earlier, but I still can't figure it out. It is for extra credit. I was hoping if someone could please solve it. I would greatly appreciate it. A bourbon that is 51 proof is 25.5% alcohol by vol. while one

    asked by Lorie on April 5, 2013
  4. Geometric progression

    sum of 4 terms of g p is 30 & sum of first and last term is 18 . Find gp a(r^4-1)/(r-1) = 30 a + ar^3 = 18 a(1+r^3) = 18 it's easy to see that if a=2, r=2 Does that work on S4? 2*15/1 = 30. Yes So, the GP is 2,4,8,16,... But sir I

    asked by Venkatesh on March 23, 2016
  5. Geometry

    Write a two-column proof. If point g is the centriod of triangle ABC and BG = 18. Find GE and BE. My work: ge is half so its 9. be equals 18 + 9 = 27 i need help on two column proof part

    asked by Anthony on November 27, 2015

More Similar Questions