Assignment 2

All questions are worth 5 points. The number in the the starting parenthesis indicate the corresponding exercise number in “Applied Linear Statistical Models”.

  1. (1.2) The members of a health spa pay annual membership dues of $300 plus a charge of $2 for each visit to the spa. Let Y denote the dollar cost for the year for a member and X the number of visits by the member during the year. Express the relation between X and Y mathematically. Is it a functional relation or a statistical relation (that is, is the relation deterministic or stochastic)?

  2. (1.6) Suppose the regression parameters are \(\beta_0 = 200\) and \(\beta_1 = 5.0\).

    1. Plot the regression equation.
    2. Predict the response for X = 10, 20, and 40.
    3. Explain the meaning of parameters \(\beta_0\) and \(\beta_1\).
  3. (1.10) An analyst in a large corporation studied the relation between current annual salary (Y ) and age (X) for the 46 computer programmers presently employed in the company. The analyst concluded that the relation is curvilinear, reaching a maximum at 47 years. Does this imply that the salary for a programmer increases until age 47 and then decreases? Explain.

  4. The time it takes to transmit a file always depends on the file size. Suppose you transmitted 30 files, with the average size of 126 Kbytes and the standard deviation of 35 Kbytes. The average transmittance time was 0.04 seconds with the standard deviation of 0.01 seconds. The correlation coefficient between the time and the size was 0.86.

    Based on this data, fit a linear regression model and predict the time it will take to transmit a 400 Kbyte file.

  5. At a gas station, 180 drivers were asked to record the mileage of their cars and the number of miles per gallon. The results are summarized in the table.

    Sample mean Standard deviation
    Mileage 24,598 14,634
    Miles per gallon 23.8 3.4

    The sample correlation coefficient is \(r = -0.17\)

    1. Compute the least squares regression line which describes how the number of miles per gallon depends on the mileage.
    2. What do the obtained slope and intercept mean in this situation?
    3. You purchase a used car with 35,000 miles on it. Predict the number of miles per gallon.
  6. Show that the sample intercept \(b_0\) is a linear and unbiased estimator of the population intercept \(\beta_0\)

  7. (1.19) Grade point average. The director of admissions of a small college selected 120 students at random from the new freshman class in a study to determine whether a students grade point average (GPA) at the end of the freshman year (Y) can be predicted from the ACT test score (X). The Full data set is available here. To read the text file you can use the R function

    CH01PR19 <- read.table("data/CH01PR19.txt", 
                            col.names = c("gpa", "act"))
     
    1. Obtain the least squared estimate of \(\beta_0\) and \(\beta_1\) and state the estimated regression function.
    2. Plot the estimated regression function and the data. Does the estimated regression function appear to fit the data well?
    3. Obtain a point estimate of the mean freshman GPA for students with ACT test score X = 30.
    4. What is the point estimate of the change in the mean response when the entrance test score increases by one point?