Lehigh's Math 338
Math 338 & Stat 438 at Lehigh University
Math 338/Stat 438 (Regression Analysis): Least square principles in
multiple regression and their interpretations; estimation, hypothesis testing,
confidence interval and prediction interval; residual analysis,
multicollinearity, selection of regression models; comparison of data sets,
analysis of variance and covariance; simultameous inference procedures. Use of
computer packages for statistical analysis. Prerequisite: Math 12 or Math 231.
Here is a typical course description of Regession Analysis at Lehigh
University.
This is the home page for Spring 1996 Math 338 ( Regression Analysis )
at Lehigh University.
The text is `Regression Analysis : Concepts and Applications' by
F. A. Graybill and H. K. Iyer. More information will follow later (or
call me at (610) 758-3735).
Instructor: Professor Wei-Min Huang, Department of Mathematics.
To contact me, send e-mail to wh02@Lehigh.EDU
,
or click here
Mail-Me, if your web server supports
forms.
Click for Lehigh's home page or for
Lehigh Mathematics Department home page.
For now the Homework Assignments are:
- HW #1: Problem 1.6.1 parts (a) and (i), (Text, page 46); Exercises
1.10.5, 1.10.6, 1.10.8 (Text, page 70). Answer all questions without
using MINITAB or SAS.
- HW #2: Problems 1.10.9, 1.10.10, 1.10.11 (Text, page 71); Problems
M1.6.1 - M1.6.10 (Lab Book, page 25), use MINITAB to answer these
problems from the Lab Book.
- HW #3: Show that the beta-sub-o star and beta-sub-1 star we derived
from the first-derivetive equations is the solution to the Minimization
based on the total squared errors between the responses and the fitted
values. Use the hint I gave you in class.
- HW #4: Textboob, 2.4.4, 6, 7, 8, 10, 12; LabM, M2.3.1, 11, 12, 14, 15.
(Don't us Simple Linear Regression Model yet in solving these
problems.)
- HW #5: Refer to Task 3.4.1 and crystal.dat on page 118 in the Textbook.
(1) Use MINITAB to verify numerically the values of SXY, SSX, MSE, and
the LS estimates of beta-o and beta-1.
(2) (a) Use MINITAB matrix and vector (column) operations to verify the LS
estimate of the beta-vector. (b) Find the estimated values of y-sub-i
and the residuals. Also show that (using MINITAB) sum of residuals is
zero and sum of residuals*x-sub-i is also zero.
(3) (a) Obtain a scater-plot of the pairs (x-sub-i, y-sub-i), for
i=1,...,14. Then super-impose the estimated points (x-sub-i,
y-estimated-i) on the same plot. Use "mplot" in MINITAB. (b) Find the
correlation coefficient of (x-sub-i, y-sub-i), i=1,...,14. Summarize
your findings from (a) and (b).
(See handout for details of this HW assignment.)
- HW #6: Problems 3.12.3 (page 215, Text, refer to chol.dat);
Problem 3.12.4 (omit part (e)).
- HW #7: See the instructor.
- HW #8: Problem 4.12.1 (Textbook, page 335); Read Chapters 3 and 4 in
the Lab Book and do problems M3.5.1,2,3 also obtain the residual plots
using (a) ordinary residuals and (b) standardized residuals.
- HW #9: (1) M3.6.5,6,8,10,11 (Lab Manual, page 62); (2) M3.7.2, page 65;
(3) (Refer to Bodyfat.dat in the handout). For testing Ho :
beta1 + 2*beta3 = 0.1 show numerically that t^2 = F where t = (beta1-hat
+ 2*beta3-hat - 0.1) / SE(beta1-hat + 2*beta3-hat) and F =
[ (SSE(reduced)-SSE(full))/(df(reduced)-df(full) ] / [ SSE(full)/df(full) ].
- HW #10: (1) Problem 4.12.2, Textbook, page 337. Don't use MINITAB's
MACROS for this problem; (2) M4.11.1, Lab Manual, page 100. Use
MINITAB's MACROS to answer this problem.
- HW #11: (1) Problem 3.6.4 (c), Textbook; (2) Problem 4.12.3 (a)-(g).
- HW #12: (1) Problem 1 from the Handout on "One-Way ANOVA and
Multiple Linear Regression"; (2) Analyze the Ski Resort Data (ski.dat).
Especially determine 3 ANOVA Tables and 3 F-tests for models using (a) X1
alone (b) X2 alone, and (c) both X1 and X2. Discuss your findings.
- HW #13: Two problems, see Handout.
- HW #14: (1) Exercise 5.6.1 , Textbook, pages 399-402; (2) Refer to
Hospital Data (hospital.dat). (a) Find the correlation coefficient
matrix of the 3 regressors X1, X2, and X3. (b) Find the condition
indices and the condition number of the matrix in (a). Discuss the results;
(3) Refer to Hospital Data. Find the Cook's distances and the
studentized deleted residuals. Determine all the possible influential
observations or possible outliers.
- HW #15: Refer to Hospital Data Set (hospital.dat).
(a) Use ridge regression to estimate the regression coefficients.
Use delta (the shrinkage or biasing parameter) on the basis of the stability
of the estimated regression coefficients; (b) Obtain the regression line
using all 30 observations. Also list the COOK, studentized deleted residuals
and DFFITS. Then identify all possible influential observations and outliers;
(c) Obtain the regression line using 29 observations (with #30
deleted). Also list the COOK, studentized deleted residuals and DFFITS.
Then identify all possible influential observations and outliers.
- HW #16: Refer to Hospital Data Set. (1) Perform all-subsets
regression and
determine the best possible subset. (2) Perform forward-regression and
determine the best possible combination of regressors. (3) Perform the
principal component analysis and then eliminate the necessary components
(using t-test). Based on the remaining components to back estimate the
coefficients of the original regression model.
To go back
click here