Statistics Lookup

For the Linear Regression on GDP:
- The data consists of 167 paired x’s and y’s
- The regression is: y = 66.5212 + 0.000142379 $
- The Parameter Table is

A Monte Carlo simulation, run with 10,000 iterations, estimated the standard errors as
- Standard error of intercept = 0.593747
- Standard error of the variable $ = 0.0000102458

Regression Equation; y = 68.4357 – 40.3304 m + 0.000124256 $
DataPoints = 167
NbrofIVs = 2
Sum of Squares Equation: 5912.8 + 3764.14 = 9676.95
- SSR + SSE = SST, where SSR, SSE, and SST are the sums of squares for
  - predicted y’s
  - residuals
  - observed y’s
Standard Error of the Regression = 4.79083
- = √(SSE / (Datapoints – (NbrofIVs + 1)))
R-Squared = 0.61102
- = SSR / SST
Adjusted R-Squared = 0.606276
AICc = 1002.45

Regression Equation; y = 18.3278 – 21.2782 y + 5.38715 Log[x]
DataPoints = 167
NbrofIVs = 2
Sum of Squares Equation: 6765.65 + 2911.3 = 9676.95
- SSR + SSE = SST, where SSR, SSE, and SST are the sums of squares for
  - predicted y’s
  - residuals
  - observed y’s
Standard Error of the Regression = 4.21329
- = √(SSE / (Datapoints – (NbrofIVs + 1)))
R-Squared = 0.699151
- = SSR / SST
Adjusted R-Squared = 0.695482
AICc = 959.546

Regression Equation; y = 66.5212 + 0.000213569 r + 0.0000711895 $
DataPoints = 167
NbrofIVs = 2
Sum of Squares Equation: 5242.57 + 4434.37 = 9676.95
- SSR + SSE = SST, where SSR, SSE, and SST are the sums of squares for
  - predicted y’s
  - residuals
  - observed y’s
Standard Error of the Regression = 5.19989
- = √(SSE / (Datapoints – (NbrofIVs + 1)))
R-Squared = 0.541759
- = SSR / SST
Adjusted R-Squared = 0.536171
AICc = 1029.82

Regression Equation; y = 73.3606 – 12.435 g – 38.764 m + 0.00011775 $
DataPoints = 167
NbrofIVs = 3
Sum of Squares Equation: 6057.69 + 3619.26 = 9676.95
- SSR + SSE = SST, where SSR, SSE, and SST are the sums of squares for
  - predicted y’s
  - residuals
  - observed y’s
Standard Error of the Regression = 4.71212
- = √(SSE / (Datapoints – (NbrofIVs + 1)))
R-Squared = 0.625992
- = SSR / SST
Adjusted R-Squared = 0.619108
AICc = 998.044

Regression Equation; y = 25.0246 – 20.32 y – 11.8055 z + 5.16427 Log[x]
DataPoints = 167
NbrofIVs = 3
Sum of Squares Equation: 6897.47 + 2779.48 = 9676.95
- SSR + SSE = SST, where SSR, SSE, and SST are the sums of squares for
  - predicted y’s
  - residuals
  - observed y’s
Standard Error of the Regression = 4.12941
- = Sqrt(SSE / (Datapoints – (NbrofIVs + 1)))
R-Squared = 0.712773
- = SSR / SST
Adjusted R-Squared = 0.707487
AICc = 953.955

Analysis
Correlation
- Pretty good correlation between GDP and Life Expectancy, 0.736
Graphs
- The data curves to the right, suggesting that a nonlinear regression might do better.
Standard Error of the Regression
- We’ll see how 5.18411 compares to later regression.
P-Values for t-Statistics for the standard errors of the coefficients
- Very high. But that’s assuming that the regression equation is correct.
R-Squared and Adjusted R-Squared
- R-Squared is 0.541759, meaning that the regression captures about 54% of the life expectancy scatter. That’s fine, but not great.
AICc
- We’ll see how 1027.7 compares to later regression.

Analysis
Correlation
- Terrible 0.179154
Graphs
- Random dots
Standard Error of the Regression
- 7.53431
P-Values for the t-Statistics for the standard errors of the coefficients
- 0.02 for x
R-Squared and Adjusted R-Squared
- 0.032096 versus SSE / SST = 0.967904
AICc
- 1152.57