Consider the simple regression yi = ï¢xi + ï¥i where E[ï¥/x] = 0 and E[ï¥2 / x] = ï³2
a. What is the minimum mean squared error linear estimator of ï¢? [Hint: Let the estimator be (ï¢n = câ²y). Choose c to minimize Var(ï¢) + (E(ï¢ - ï¢))2. The answer is a function of the unknown parameters.]
b. For the estimator in part a, show that ratio of the mean squared error of ï¢ to that of the ordinary least squares estimator b is
Note that ï´ï is the population analog to the ât ratioâ for testing the hypothesis that ï¢ = 0, which is
> We examined Ray Fair’s famous analysis (Journal of Political Economy, 1978) of a Psychology Today survey on extramarital affairs in Example 18.18 using a Poisson regression model. Although the dependent variable used in that study was a count, Fair (1978
> Appendix Table F18.3 contains data on ship accidents reported in McCullagh and Nelder (1983). The data set contains 40 observations on the number of incidents of wave damage for oceangoing ships. Regressors include aggregate months of service, and three
> The GSOEP data are an unbalanced panel, with 7,293 groups. Continue your analysis in Application 3 by fitting the Poisson model with fixed and with random effects and compare your results. (Recall, like the linear model, the Poisson fixed effects model m
> Several applications in the preceding chapters using the German health care data have examined the variable DocVis, the reported number of visits to the doctor. The data are described in Appendix Table F7.1. A second count variable in that data set that
> Continuing the analysis of the first application, we now consider the self-reported rating, v1. This is a natural candidate for an ordered choice model, because the simple five-item coding is a censored version of what would be a continuous scale on some
> Appendix Table F17.2 provides Fair’s (1978) Redbook Magazine survey on extramarital affairs. The variables in the data set are as follows: and three other variables that are not used. The sample contains a survey of 6,366 married women
> Appendix Table F17.2 provides Fair’s (1978) Redbook survey on extramarital affairs. The data are described in Application 1 at the end of chapter 18 and in Appendix F. The variables in the data set are as follows: and three other variab
> Data on U.S. gasoline consumption for the years 1953 to 2004 are given in Table F2.2. Note the consumption data appear as total expenditure. To obtain the per capita quantity variable, divide GASEXP by GASP times Pop. The other variables do not need tran
> Consider a model for the mix of male and female children in families. Let Ki denote the family size (number of children), Ki = 1………. Let Fi denote the number of female children, Fi = 0â
> A regression model that describes income as a function of experience is /
> Does the Wald statistic reject the null hypothesis too often? construct a Monte carlo study of the behavior of the Wald statistic for testing the hypothesis that g equals zero in the model of Section 15.5.1. Recall that the Wald statistic is the square o
> (This application will require an optimizer. Maximization of a user-supplied function is provided by commands in Stata, R, SAS, EViews or NLOGIT.) Use the following pseudo-code to generate a random sample of 1,000 observations on y from a mixed normals p
> The geometric distribution used in Examples 14.13, 14.17, 14.18, and 14.22 would not be the typical choice for modeling a count such as DocVis. The Poisson model suggested at the beginning of section 14.11.1 would be the more natural choice (at least at
> Binary Choice. This application will be based on the health care data analyzed in Example 14.13 and several others. Details on obtaining the data are given in Appendix F Table 7.1. We consider analysis of a dependent variable, y, that ta
> The data in Appendix Table F6.1 are an unbalanced panel on 25 U.S. airlines in the pre-deregulation days of the 1970s and 1980s. The group sizes range from 2 to 15. Data in the file are the following variables. (Variable names contained in the data file
> The data in Appendix Table F10.4 were used by Grunfeld (1958) and dozens of researchers since, including Zellner (1962, 1963) and Zellner and Huang (1962), to study different estimators for panel data and linear regression systems. [See Kleiber and Zeile
> Several applications in this and previous chapters have examined the returns to education in panel data sets. Specifically, we applied Hausman and Taylor’s approach in Examples 11.17 and 11.18. Example 11.18 used Cornwell and Rupert&aci
> Using the estimated health outcomes model in Example 10.8, determine the expected values of ln Income and Health Satisfaction for a person with the following characteristics: Female = 1, Working = 1, Public = 1, AddOn = 0, Education = 14, Married = 1, HH
> Christensen and Greene (1976) estimated a “generalized Cobb–Douglas” cost function for electricity generation of the form ln C = a + b ln Q + g[1 (ln Q)2] + dk ln Pk + dl ln Pl + df ln Pf + e. Pk, Pl, and Pf indicate unit prices of capital, labor, and fu
> Using the cost function estimates in Example 10.2, we obtained an estimate of the efficient scale, / We can use the delta method in Section 4.5.4 to compute an asymptotic standard error for the estimator of Q* and a confidence interval. The estimators of
> Reverse regression. A method of analyzing statistical data to detect discrimination in the workplace is to fit the regression y = a + x' + d + , (1) where y is the wage rate and d is a dummy variable indicating either membership (d = 1) or nonmember s
> Partial Frisch and Waugh. In the least squares regression of y on a constant and X, to compute the regression coefficients on X, we can first transform y to deviations from the mean y and, likewise, transform each column of X to deviations from the respe
> Change in the sum of squares. suppose that b is the least squares coefficient vector in the regression of y on X and that c is any other K *1 vector. Prove that the difference in the two sums of squared residuals is (y - Xc)=(y - Xc) - (y - Xb)=(y - Xb)
> The two-variable regression. For the regression model y = a + bx + e, a. show that the least squares normal equations imply iei = 0 and ixiei = 0. b. show that the solution for the constant term is a = y - bx. c. show that the solution for b is b = 3
> The data in Appendix Table f5.2 may be used to estimate a small macroeconomic model. Use these data to estimate the model in Example 10.5. Estimate the parameters of the two equations by two-stage and three-stage least squares.
> Reverse regression continued. This and the next exercise continue the analysis of Exercise 2. In Exercise 2, interest centered on a particular dummy variable in which the regressors were accurately measured. Here we consider the case in which the crucial
> A regression model with K = 16 independent variables is fit using a panel of seven years of data. The sums of squares for the seven separate regressions and the pooled regression are shown below. The model with the pooled data allows a separate constant
> Prove that under the hypothesis that RB = q, the estimator where J is the number of restrictions, is unbiased for s2.
> Use the test statistic defined in Exercise 7 to test the hypothesis in Exercise 1.
> An alternative way to test the hypothesis RB - q = 0 is to use a Wald test of the hypothesis that L* = 0, where L* is defined in (5-23). Prove that Note that the fraction in brackets is the ratio of two estimators of s2. By virtue of (5-28) and the prece
> Prove the result that the R2 associated with a restricted least squares estimator is never larger than that associated with the unrestricted least squares estimator. conclude that imposing restrictions never improves the fit of the regression.
> Prove the result that the restricted least squares estimator never has a larger covariance matrix than the unrestricted least squares estimator.
> The expression for the restricted coefficient vector in (5-23) may be written in the form b* = [I - CR]b + w, where w does not involve b. What is C? show that the covariance matrix of the restricted least squares estimator is s2(X'X)-1 - s2(X'X)-1R'[R(X'
> The regression model to be analyzed is y = X1B1 + X2B2 + E , where X1 and X2 have K1 and K2 columns, respectively. The restriction is B2 = 0. a. Using (5-23), prove that the restricted estimator is simply [b1*, 0], where b1* is the least squares coeffici
> Using the results in Exercise 1, test the hypothesis that the slope on x1 is 0 by running the restricted regression and comparing the two sums of squared deviations.
> The Grunfeld investment data in Appendix Table 10.4 constitute a classic data set that has been used for decades to develop and demonstrate estimators for seemingly unrelated regressions.28 Although somewhat dated at this juncture, they remain an ideal a
> Show that the model of the alternative hypothesis in Example 5.7 can be written As such, it does appear that H0 is a restriction on H1. However, because there are an infinite number of constraints, this does not reduce the test to a standard test of rest
> The log likelihood function for the linear regression model with normally distributed disturbances is shown in (14-39) in section 14.9.1. show that at the maximum likelihood estimators of b for B and e'e/n for s2, the log likelihood is an increasing func
> Suppose the true regression model is given by (4-7). The result in (4-9) shows that if pX.z is nonzero and g is nonzero, then regression of y on X alone produces a biased and inconsistent estimator of B. suppose the objective is to forecast y, not to est
> Show that in the multiple regression of y on a constant, x1 and x2 while imposing the restriction b1 + b2 = 1 leads to the regression of y - x1 on a constant and x2 - x1.
> A multiple regression of y on a constant x1 and x2 produces the following results: yn = 4 + 0.4x1 + 0.9x2, R2 = 8/60, e'e = 520, n = 29, Test the hypothesis that the two slopes sum to 1.
> For the classical normal regression model y = Xβ +( with no constant term and K regressors, assuming that the true value of b is zero, what is the exact expected value of F[K, n - K] = (R2/K)/[(1 - R2)/(n - K)]?
> Consider the multiple regression of y on K variables X and an additional variable z. Prove that under the assumptions A1 through A6 of the classical regression model, the true variance of the least squares estimator of the slopes on X is larger when z is
> The following sample moments for x = [1, x1, x2, x3] were computed from 100 observations produced using a random number generator: The true model underlying these data is y = x1 + x2 + x3 + e. a. Compute the simple correlations among the regressors. b. C
> As a profit-maximizing monopolist, you face the demand curve Q =  + P + . In the past, you have set the following prices and sold the accompanying quantities: Suppose that your marginal cost is 10. Ba
> Prove that the least squares intercept estimator in the classical regression model is the minimum variance linear unbiased estimator.
> Continuing the analysis of Section 10.3.2, we find that a translog cost function for one output and three factor inputs that does not impose constant returns to scale is The factor share equations are [See Christensen and Greene (1976) for analysis of th
> Suppose that the regression model is yi = + xi + i, where the disturbances i have f(i) = (1/) exp (-i/), i / . This model is rather peculiar in that all the disturbances are assumed to be nonnegative. Note that the disturbances have E[i/xi]
> Suppose that the classical regression model applies but that the true value of the constant is zero. Compare the variance of the least squares slope estimator computed without a constant term with that of the estimator computed with an unnecessary consta
> For the simple regression model yi = + i, i ( N[0, 2], prove that the sample mean is consistent and asymptotically normally distributed. Now consider the alternative estimator mn = w y , w = i = i . Note that // Prove that this is a c
> Let ei be the ith residual in the ordinary least squares regression of y on X in the classical regression model, and let ei be the corresponding true disturbance. Prove that plim(ei - ei) = 0.
> For the classical normal regression model y = Xβ + ( with no constant term and K regressors, what is plim /assuming that the true value of β is zero?
> Prove that / where b is the ordinary least squares estimator and k is a characteristic root of X′X.
> Suppose that you have two independent unbiased estimators of the same parameter /with different variances v1 and v2. What linear combination / is the minimum variance unbiased estimator of /
> Example 4.10 presents a regression model that is used to predict the auction prices of Monet paintings. The most expensive painting in the sample sold for $33.0135M (ln = 17.3124). The height and width of this painting were 35″ and 39.4″, respectively. U
> In Section 4.9.2, we consider regressing y on a set of principal components, rather than the original data. For simplicity, assume that X does not contain a constant term, and that the K variables are measured in deviations from the means and are standar
> Statewide aggregate production function. Continuing Example 10.1, data on output, the capital stocks, and employment are aggregated by summing the values for the individual states (before taking logarithms). The unemployment rate for each region, m, at t
> In (4-13), we find that when superfluous variables X2 are added to the regression of y on X1 the least squares coefficient estimator is an unbiased estimator of the true parameter vector, β = (β′1, 0′)′. Show that, in this long regression, e′e/(n - K1 -
> Consider a data set consisting of n observations, nc complete and nm incomplete, for which the dependent variable, yi, is missing. Data on the independent variables, xi, are complete for all n observations, Xc and Xm. We wish to use the data to estimate
> Is it possible to partition R2? The idea of “hierarchical partitioning” is to decompose R2 into the contributions made by each variable in the multiple regression. That is, if x1, , xK are entered into a regression one at a time, then ck is the increment
> In the December 1969 American Economic Review (pp. 886–896), Nathaniel Leff reports the following least squares regression results for a cross section study of the effect of age composition on savings in 74 countries in 1964: ln S/Y = 7.3439 + 0.1596 ln
> Using the matrices of sums of squares and cross products immediately preceding section 3.2.3, compute the coefficients in the multiple regression of real investment on a constant, GNP, and the interest rate. Compute R2.
> Three variables, N, D, and Y, all have zero means and unit variances. A fourth variable is C = N + D. In the regression of C on Y, the slope is 0.8. In the regression of C on N, the slope is 0.5. In the regression of D on Y, the slope is 0.4. What is the
> Regression without a constant. suppose that you estimate a multiple regression first with, then without, a constant. Whether the R2 is higher in the second case than the first will depend in part on how it is computed. using the (relatively) standard met
> Change in adjusted R2. Prove that the adjusted R2 in (3-30) rises (falls) when variable xk is deleted from the regression if the square of the t ratio on xk in the multiple regression is less (greater) than 1.
> Demand system estimation. Let Y denote total expenditure on consumer durables, nondurables, and services and Ed, En, and Es are the expenditures on the three categories. As defined, Y = Ed + En + Es. Now, consider the expenditure system Ed = ad + bdY + g
> Deleting an observation. A common strategy for handling a case in which an observation is missing data for one or more variables is to fill those missing variables with 0s and add a variable to the model that takes the value 1 for that one observation an
> (We look ahead to our use of maximum likelihood to estimate the models discussed in this chapter in chapter 14.) In Example 9.3, we computed an iterated FGLS estimator using the airline data and the model The weights computed at each iteration were comp
> Adding an observation. A data set consists of n observations contained in Xn and yn. The least squares estimator based on these n observations is bn = (X= X )-1X= y . Another observation, xs and ys, becomes available. Prove that the least squares estimat
> Residual makers. What is the result of the matrix product M1M where M1 is defined in (3-19) and M is defined in (3-14)?
> Find the first two autocorrelations and partial autocorrelations for the MA(2) process
> It is commonly asserted that the Durbin–Watson statistic is only appropriate for testing for first-order autoregressive disturbances. The Durbin–Watson statistic estimates 2(1 - ) where r is the first-order autocorrelation of the residuals. What combina
> Derive the disturbance covariance matrix for the model What parameter is estimated by the regression of the OLS residuals on their lagged values?
> Does first differencing reduce autocorrelation? Consider the models where / Compare the autocorrelation of t in the original model with that of vt in / where
> Prove that the Hessian for the tobit model in (19-14) is negative definite after Olsen’s transformation is applied to the parameters.
> Derive the partial effects for the tobit model with heteroscedasticity that is described in section 19.3.5.b.
> Continuing to use the data in Exercise 1, consider once again only the nonzero observations. Suppose that the sampling mechanism is as follows: y* and another normally distributed random variable z have population correlation 0.7. The two variables, y* a
> Using only the no limit observations, repeat Exercise 2 in the context of the truncated regression model. Estimate / by using the method of moments estimator outlined in Example 19.2. Compare your results with those in the previous exercises.
> This application is based on the following data set. a. compute the OLS regression of y on a constant, x1, and x2. Be sure to compute the conventional estimator of the asymptotic covariance matrix of the OLS estimator as well. b. compute the White esti
> We now consider the tobit model that applies to the full data set. a. Formulate the log likelihood for this very simple tobit model. b. Reformulate the log likelihood in terms of Then derive the necessary conditions for maximizing the log likelihood with
> The following 20 observations are drawn from a censored normal distribution: The applicable model is Exercises 1 through 4 in this section are based on the preceding information. The OLs estimator of  in the context of this tobit mo
> Consider estimation of a Poisson regression model for yi | xi. The data are truncated on the left—these are on-site observations at a recreation site, so zeros do not appear in the data set. The data are censored on the right—any response greater than 5
> For the zero-inflated Poisson (ZIP) model in section 18.4.8, we derived the conditional mean function, / a. For the same model, now obtain / Then, obtain Does the zero inflation produce overdispersion? (That is, is the ratio greater than one?) b. obtai
> We are interested in the ordered probit model. our data consist of 250 observations, of which the responses are using the preceding data, obtain maximum likelihood estimates of the unknown parameters of the model. (Hint: Consider the probabilities as t
> In the panel data models estimated in section 17.7, neither the logit nor the probit model provides a framework for applying a Hausman test to determine whether fixed or random effects is preferred. Explain. (Hint: Unlike our application in the linear mo
> Prove (17-26).
> A data set consists of n = n1 + n2 + n3 observations on y and x. For the first n1 observations, y = 1 and x = 1. For the next n2 observations, y = 0 and x = 1. For the last n3 observations, y = 0 and x = 0. Prove that neither (17-18) nor (17-20) has a so
> The following hypothetical data give the participation rates in a particular type of recycling program and the number of trucks purchased for collection by 10 towns in a small mid-Atlantic state: The town of Eleven is contemplating initiating a recycli
> Construct the Lagrange multiplier statistic for testing the hypothesis that all the slopes (but not the constant term) equal zero in the binomial logit model. Prove that the Lagrange multiplier statistic is nR2 in the regression of (yi - p) on the xs, wh
> In Example 8.5, we have suggested a model of a labor market. From the “reduced form” equation given first, you can see the full set of variables that appears in the model—that is the “endogenous variables,” ln Wageit, and Wksit, and all other exogenous v
> Given the data set estimate a probit model and test the hypothesis that x is not influential in determining the probability that y equals one.
> Suppose that a linear probability model is to be fit to a set of observations on a dependent variable y that takes values zero and one, and a single regressor x that varies continuously across observations. Obtain the exact expressions for the least squa
> A binomial probability model is to be based on the following index function model: The only regressor, d, is a dummy variable. The data consist of 100 observations that have the following: Obtain the maximum likelihood estimators of and b, and estimate
> Suppose the distribution of yi |  is Poisson, We will obtain a sample of observations, yi, , yn. suppose our prior for ï¬ï€ is the inverted gamma, which will imply a. construct the likelihood function,
> Derive the first-order conditions for nonlinear least squares estimation of the parameters in (15-2). How would you estimate the asymptotic covariance matrix for your estimator of /
> The Weibull population has survival function How would you obtain a random sample of observations from a Weibull population? (The survival function equals one minus the cdf.)
> The exponential distribution has density / How would you obtain a random sample of observations from an exponential population?
> Consider sampling from a multivariate normal distribution with mean vector and covariance matrix 2I. The log-likelihood function is show that the maximum likelihood estimators of the parameters are / and Derive the second derivativ
> For random sampling from the classical regression model in (14-3), reparameterize the likelihood function in terms of / Find the maximum likelihood estimators of and δ and obtain the asymptotic covariance matrix of the estimators of these parameters.
> Show that the likelihood inequality in Theorem 14.3 holds for the Poisson distribution used in section 14.3 by showing that / is uniquely maximized at θ = θθ. (Hint: First show that the expectation is / show that the likelihood inequality in Theorem 14