Jeff Sagarin has been providing sports ratings for USA Today since 1985. In baseball his predicted RPG (runs per game) statistic takes into account the entire playerâs offensive statistics, and is claimed to be the best measure of a playerâs true offensive value. The following data show the RPG and a variety of offensive statistics for a Major League Baseball (MLB) season for 20 members of the New York Yankees. The labels on columns are defined as follows: RPG, predicted runs per game statistic; H, hits; 2B, doubles; 3B, triples; HR, home runs; RBI, runs batted in; BB, bases on balls (walks); SO, strikeouts; SB, stolen bases; CS, caught stealing; OBP, on-base percentage; SLG, slugging percentage; and AVG, batting average.
> South Shore Construction builds permanent docks and seawalls along the southern shore of Long Island, New York. Although the firm has been in business only five years, revenue has increased from $308,000 in the first year of operation to $1,084,000 in th
> Air pollution control specialists in southern California monitor the amount of ozone, carbon dioxide, and nitrogen dioxide in the air on an hourly basis. The hourly time series data exhibit seasonality, with the levels of pollutants showing patterns that
> The quarterly sales data (number of copies sold) for a college textbook over the past three years follow. a. Construct a time series plot. What type of pattern exists in the data? b. Use the following dummy variables to develop an estimated regression
> Fowle Marketing Research, Inc., bases charges to a client on the assumption that telephone surveys can be completed within 15 minutes or less. If more time is required, a premium rate is charged. With a sample of 35 surveys, a population standard deviati
> Exercises 1 and 2 used different forecasting methods. Which method appears to provide the more accurate forecasts for the historical data? Explain. Exercise 2: Refer to the time series data in exercise 1. Using the average of all the historical data as
> Consider the following time series data. Whenever a categorical variable such as season has k levels, k 2 1 dummy variables are required. a. Construct a time series plot. What type of pattern exists in the data? b. Use the following dummy variables to
> Consider the following time series. a. Construct a time series plot. What type of pattern exists in the data? b. Use the following dummy variables to develop an estimated regression equation to account for seasonal effects in the data: Qtr1 = 1 if Quar
> The following data show Google revenue from 2008 (period 1) to 2017 (period 10) in billions of dollars (Alphabet, Inc. annual reports). These data are in the file GoogleRevenue. a. Construct a time-series plot. What type of pattern exists? b. Develop a
> Giovanni Food Products produces and sells frozen pizzas to public schools throughout the eastern United States. Using a very aggressive marketing strategy they have been able to increase their annual revenue by approximately $10 million over the past 10
> The following data show the number of Netflix subscribers worldwide for the years 2012 (period 1) to 2017 (period 6) (datawrapper website). The data are in the file NetflixSubscribers. a. Construct a time-series plot. What type of pattern exists in the
> The following data shows the average interest rate (%) for a 30-year fixed-rate mortgage over a ten-year period (FreddieMac website). Period ………………………………………………………………. Interest Rate (%) 1 …………………………………………………………………………………………. 6.41 2 ……………………………………………………………
> Skechers is a performance footwear company headquartered in Manhattan Beach, California. The sales for Skechers (in billions of dollars) for 2012 (period 1) to 2017 (period 6) are in the file SkechersSales (annualreports.com). a. Construct a time-series
> The Seneca Children’s Fund (SCF) is a local charity that runs a summer camp for disadvantaged children. The fund’s board of directors has been working very hard in recent years to decrease the amount of overhead expens
> The general fund revenue receipts for the state of Kentucky for 2003 (period 1) to 2017 (period 15) are in the file KYRevenue (ky.gov website). a. Construct a time-series plot. What type of pattern exists in the data? b. Develop a linear trend equation
> In 2017, ABC News reported that 58% of U.S. drivers admit to speeding. Suppose that a new satellite technology can instantly measure the speed of any vehicle on a U.S. road and determine whether the vehicle is speeding, and this satellite technology was
> Consider the following time series. a. Construct a time series plot. What type of pattern exists in the data? b. Using statistical software, develop the quadratic trend equation for the time series. c. What is the forecast for t = 8?
> Refer to the time series data in exercise 1. Using the average of all the historical data as a forecast for the next period, compute the following measures of forecast accuracy. a. Mean absolute error. b. Mean squared error. c. Mean absolute percentage e
> Consider the following time series. a. Construct a time series plot. What type of pattern exists in the data? b. Develop the linear trend equation for this time series. c. What is the forecast for t = 8?
> Consider the following time series data. a. Construct a time series plot. What type of pattern exists in the data? b. Develop the linear trend equation for this time series. c. What is the forecast for t = 8?
> Consider the following time series data. a. Construct a time series plot. What type of pattern exists in the data? b. Develop the linear trend equation for this time series. c. What is the forecast for t = 6?
> The U.S. Census Bureau tracks the median price for new home sales by month of year. The median prices for April for 22 years follow (U.S. Census Bureau website). a. Construct a time series plot. Comment on any pattern you observe. Discuss some of the f
> Ten weeks of data on the Commodity Futures Index are 7.35, 7.40, 7.55, 7.56, 7.60, 7.52, 7.52, 7.70, 7.62, and 7.55. a. Construct a time series plot. What type of pattern exists in the data? b. Compute the exponential smoothing forecasts for α = .2. c. C
> The following time series shows the sales of a particular product over the past 12 months. a. Construct a time series plot. What type of pattern exists in the data? b. Use α = .3 to compute the exponential smoothing forecasts for the time
> The values of Alabama building contracts (in $ millions) for a 12-month period follow. 240 350 230 260 280 320 220 310 240 310 240 230 a. Construct a time series plot. What type of pattern exists in the data? b. Compare the three-month moving average app
> Corporate triple-A bond interest rates for 12 consecutive months follow. 9.5 9.3 9.4 9.6 9.8 9.7 9.8 10.5 9.9 9.7 9.6 9.6 a. Construct a time series plot. What type of pattern exists in the data? b. Develop three-month and four-month moving averages for
> Internet users were recently asked online to rate their satisfaction with the web browser they use most frequently. Of 102,519 respondents, 65,120 indicated they were very satisfied with the web browser they use most frequently. a. What is the sample pro
> For the Hawkins Company, the monthly percentages of all shipments received on time over the past 12 months are 80, 82, 84, 83, 83, 84, 85, 84, 82, 83, 84, and 83. a. Construct a time series plot. What type of pattern exists in the data? b. Compare the th
> With a smoothing constant of α = .2, equation (17.2) shows that the forecast for week 13 of the gasoline sales data from Table 17.1 is given by F13 = .2Y12 + .8F12. However, the forecast for week 12 is given by F12 = .2Y11 + .8F11. Thus, we could combine
> Consider the following time series data. Using the naive method (most recent value) as the forecast for the next week, compute the following measures of forecast accuracy. a. Mean absolute error. b. Mean squared error. c. Mean absolute percentage error
> As of September 4, 2016, the film Suicide Squad had an average rating of 3.7 out of 5 based on 117,323 viewer ratings (Rotten Tomatoes website). How are the viewer ratings of Suicide Squad related to the viewer age and the viewer ratings of The Secret Li
> Corvette, Ferrari, and Jaguar produced a variety of classic cars that continue to increase in value. The following data, based upon the Martin Rating System for Collectible Cars, show the rarity rating (1–20) and the high price ($1000)
> Home Depot, a home-improvement retailer, sells several brands of washing machines. The following table contains a sample of 24 models of full-size washing machines sold by Home Depot in 2016, with each observation recording the washing machine capacity (
> A study of emergency service facilities investigated the relationship between the number of facilities and the average distance traveled to provide the emergency service. The following table gives the data collected. Number of Facilities …………….. Average
> In working further with the problem of exercise 4, statisticians suggested the use of the following curvilinear estimated regression equation. ŷ = b0 + b1x + b2x2 a. Use the data of exercise 4 to estimate the parameters of this estimated regression equa
> A highway department is studying the relationship between traffic flow and speed. The following model has been hypothesized. y = β 0 + β1x + ε where y = traffic flow in vehicles per hour x = vehicle speed in miles per hour The following data were collect
> Consider the following data for two variables, x and y. a. Does there appear to be a linear relationship between x and y? Explain. b. Develop the estimated regression equation relating x and y. c. Plot the standardized residuals versus yÌ
> According to the Census Bureau, 2,475,780 people are employed by the federal government in the United States as of 2018. Suppose that a random sample of 3,500 of these federal employees was selected and the number of sick hours each of these employees to
> Refer to the Cravens data set in Table 16.5. In Section 16.3 we showed that the estimated regression equation involving Accounts, AdvExp, Poten, and Share had an adjusted coefficient of determination of 88.1%. Use the .05 level of significance and apply
> The following data show the daily closing prices (in dollars per share) for a stock. Date ……………………… Price ($) Nov. 3 ……………………….. 82.87 Nov. 4 ………………………. 83.00 Nov. 7 ……………………….. 83.61 Nov. 8 ……………………….. 83.15 Nov. 9 ……………………….. 82.84 Nov. 10 ………………………..
> Mbuy is a media consulting firm that provides advice to companies on how to allocate their advertising budgets. Mbuy designed a factorial experiment to test the effect of the size of a banner ad on a website and the ad design on the number (in thousands)
> An automobile dealer conducted a test to determine whether the time needed to complete a minor engine tune-up depends on whether a computerized engine analyzer or an electronic analyzer is used. Because tune-up time varies among compact, intermediate, an
> Four different paints are advertised as having the same drying time. To check the manufacturers’ claims, five samples were tested for each of the paints. The time in minutes until the paint was dry enough for a second coat to be applied
> The Jacobs Chemical Company wants to estimate the mean time (minutes) required to process a batch of material on mixer machines produced by three different manufacturers. To limit the cost of testing, four batches of material were mixed on machines produ
> Write a multiple regression equation that can be used to analyze the data for a two-factorial design with two levels for factor A and three levels for factor B. Define all variables.
> Write a multiple regression equation that can be used to analyze the data for a randomized block design involving three treatments and two blocks. Define all variables.
> Consider a completely randomized design involving four treatments: A, B, C, and D. Write a multiple regression equation that can be used to analyze these data. Define all variables.
> Consider the following data for two variables, x and y. a. Develop an estimated regression equation for the data of the form ŷ = β0 + β1x. Comment on the adequacy of this equation for predicting y. b. Develop an
> Suppose a sample of 10,001 erroneous Federal income tax returns from last year has been taken and is provided in the file FedTaxErrors. A positive value indicates the taxpayer underpaid and a negative value indicates that the taxpayer overpaid. a. What i
> Refer to exercise 14. Using age, blood pressure, whether a person is a smoker, and any interaction involving those variables, develop an estimated regression equation that can be used to predict risk. Briefly describe the process you used to develop an e
> The Ladies Professional Golfers Association (LPGA) maintains statistics on performance and earnings for members of the LPGA Tour. Year-end performance statistics for 134 golfers for 2014 appear in the file LPGA2014Stats (LPGA website). Earnings is the to
> A study provided data on variables that may be related to the number of weeks a person has been jobless. The dependent variable in the study (Weeks) was defined as the number of weeks a person has been jobless due to a layoff. The following independent v
> In 2016, the average monthly residential natural gas bill for Black Hills Energy customers in Cheyenne, Wyoming, is $67.95 (Wyoming Public Service Commission website). How is the monthly average gas bill for a Cheyenne home related to the square footage,
> A 10-year study conducted by the American Heart Association provided data on how age, blood pressure, and smoking relate to the risk of strokes. Data from a portion of this study follow. Risk is interpreted as the probability (times 100) that a person wi
> Refer to the description in exercise 12. a. Develop an estimated regression equation that can be used to predict the total earnings for all events given the average number of putts taken on greens hit in regulation. b. Develop an estimated regression equ
> Predicting LPGA Player’s Average Score. The Ladies Professional Golfers Association (LPGA) maintains statistics on performance and earnings for members of the LPGA Tour. Year-end performance statistics for 134 golfers for 2014 appear in the file LPGA2014
> In a regression analysis involving 30 observations, the following estimated regression equation was obtained: ŷ = 17.6 + 3.8x1 - 2.3x2 + 7.6x3 + 2.7x4 For this estimated regression equation SST = 1805 and SSR = 1760. a. At α = .05, test the significance
> In a regression analysis involving 27 observations, the following estimated regression equation was developed: ŷ = 25.2 + 5.5x1 For this estimated regression equation SST = 1550 and SSE = 520. a. At α = .05, test whether x1 is significant. Suppose that
> The Pew Research Center Internet Project, conducted in 2014 on the 25th anniversary of the Internet, involved a survey of 857 Internet users. It provided a variety of statistics on Internet users. For instance, in 2014, 87% of American adults were Intern
> Consider the following data for two variables, x and y. a. Develop an estimated regression equation for the data of the form ŷ = β0 + β1x. b. Use the results from part (a) to test for a significant relationship be
> Spring is a peak time for selling houses. The file SpringHouses contains the selling price, number of bathrooms, square footage, and number of bedrooms of 26 homes sold in Ft. Thomas, Kentucky, in spring 2018 (realtor.com website). a. Develop scatter plo
> The Condé Nast Traveler Gold List provides ratings for the top 20 small cruise ships. The data shown below are the scores each ship received based upon the results from Condé Nast Traveler’s annual Readers&acir
> PC Magazine provided ratings for several characteristics of computer monitors, including an overall rating (PC Magazine website). The following data show the rating for contrast ratio, resolution, and the overall rating for ten monitors tested using a 0&
> The National Football League (NFL) records a variety of performance data for individuals and teams. To investigate the importance of passing on the percentage of games won by a team, the following data show the conference (Conf), average number of passin
> The owner of Showtime Movie Theaters, Inc., would like to predict weekly gross revenue as a function of advertising expenditures. Historical data for a sample of eight weeks follow. a. Develop an estimated regression equation with the amount of televis
> The Tire Rack maintains an independent consumer survey to help drivers help each other by sharing their long-term tire experiences. The data contained in the file named TireRatings show survey results for 68 all-season tires. Performance traits are rated
> Over the past few years the percentage of students who leave Lakeland College at the end of the first year has increased. Last year Lakeland started a voluntary one-week orientation program to help first-year students adjust to campus life. If Lakeland i
> Community Bank would like to increase the number of customers who use payroll direct deposit. Management is considering a new sales campaign that will require each branch manager to call each customer who does not currently use payroll direct deposit. As
> In Table 15.12 we provided estimates of the probability of using the coupon in the Simmons Stores catalog promotion. A different value is obtained for each combination of values for the independent variables. a. Compute the odds in favor of using the cou
> A poll for the presidential campaign sampled 491 potential voters in June. A primary purpose of the poll was to obtain an estimate of the proportion of potential voters who favored each candidate. Assume a planning value of p* = .50 and a 95% confidence
> Refer to the Simmons Stores example introduced in this section. The dependent variable is coded as y = 1 if the customer used the coupon and 0 if not. Suppose that the only information available to help predict whether the customer will use the coupon is
> The Ladies Professional Golfers Association (LPGA) maintains statistics on performance and earnings for members of the LPGA Tour. Year-end performance statistics for 134 golfers for 2014 appear in the file named LPGA2014 (LPGA website, April 2015). Earn
> The following data show the curb weight, horsepower, and ¼-mile speed for 16 popular sports and GT cars. Suppose that the price of each sports and GT car is also available. The complete data set is as follows: a. Find the estimated regres
> Exercise 5 gave the following data on weekly gross revenue, television advertising, and newspaper advertising for Showtime Movie Theaters. a. Find an estimated regression equation relating weekly gross revenue to television and newspaper advertising. b
> Data for two variables, x and y, follow. a. Develop the estimated regression equation for these data. b. Compute the studentized deleted residuals for these data. At the .05 level of significance, can any of these observations be classified as an outli
> A shoe store developed the following estimated regression equation relating sales to inventory investment and advertising expenditures. ŷ = 25 + 10 x1 + 8 x2 where x1 = inventory investment ($1000s) x2 = advertising expenditures ($1000s) y = sales ($100
> Data for two variables, x and y, follow. a. Develop the estimated regression equation for these data. b. Plot the standardized residuals versus yˆ. Do there appear to be any outliers in these data? Explain. c. Compute the studentized deleted
> A 10-year study conducted by the American Heart Association provided data on how age, blood pressure, and smoking relate to the risk of strokes. Assume that the following data are from a portion of this study. Risk is interpreted as the probability (time
> Best Buy, a nationwide retailer of electronics, computers, and appliances, sells several brands of refrigerators. A random sample of models of full size refrigerators prices sold by Best Buy and the corresponding cubic feet (cu. ft.) and list price follo
> This problem is an extension of the situation described in exercise 35. a. Develop the estimated regression equation to predict the repair time given the number of months since the last maintenance service, the type of repair, and the repairperson who pe
> Fewer young people are driving. In 1995, 63.9% of people under 20 years old who were eligible had a driver’s license. Bloomberg reported that percentage had dropped to 41.7% in 2016. Suppose these results are based on a random sample of 1200 people under
> A simple random sample with n = 54 provided a sample mean of 22.5 and a sample standard deviation of 4.4. a. Develop a 90% confidence interval for the population mean. b. Develop a 95% confidence interval for the population mean. c. Develop a 99% confide
> Refer to the Johnson Filtration problem introduced in this section. Suppose that in addition to information on the number of months since the machine was serviced and whether a mechanical or an electrical repair was necessary, the managers obtained a lis
> Management proposed the following regression model to predict sales at a fast-food outlet. y = β0 + β1x1 + β2x2 + β3x3 + e where x1 = number of competitors within one mile x2 = population within one mile (1000s) x3 = { 51 if drive-up window present
> Consider a regression study involving a dependent variable y, a quantitative independent variable x1, and a categorical independent variable with three possible levels (level 1, level 2, and level 3). a. How many dummy variables are required to represent
> Consider a regression study involving a dependent variable y, a quantitative independent variable x1, and a categorical independent variable with two levels (level 1 and level 2). a. Write a multiple regression equation relating x1 and the categorical va
> Refer to Problem 25. Use the estimated regression equation from part (a) to answer the following questions. a. Estimate the selling price of a four-year-old Honda Accord with mileage of 40,000 miles. b. Develop a 95% confidence interval for the selling p
> In exercise 24, an estimated regression equation was developed relating the percentage of games won by a team in the National Football League for the 2011 season given the average number of passing yards obtained per game on offense and the average numbe
> In a regression analysis involving 30 observations, the following estimated regression equation was obtained. ŷ = 17.6 + 3.8 x1 - 2.3 x2 + 7.6x3 + 2.7 x4 a. Interpret b1, b2, b3, and b4 in this estimated regression equation. b. Predict y when x1 = 10, x
> In exercise 5, the owner of Showtime Movie Theaters, Inc., used multiple regression analysis to predict gross revenue (y) as a function of television advertising (x1) and newspaper advertising (x2). The estimated regression equation was ŷ = 83.23 + 2.29
> Refer to the data in exercise 2. The estimated regression equation for those data is ŷ = 218.4 + 2.01x1 + 4.74x2 a. Develop a 95% confidence interval for the mean value of y when x1 = 47 and x2 = 10. b. Develop a 95% prediction interval for y when x1 =
> In exercise 1, the following estimated regression equation based on 10 observations was presented. ŷ = 29.1270 + .5906x1 + .4980x2 a. Develop a point estimate of the mean value of y when x1 = 180 and x2 = 310. b. Develop a point estimate for an individu
> For many years businesses have struggled with the rising cost of health care. But recently, the increases have slowed due to less inflation in health care prices and employees paying for a larger portion of health care benefits. A recent survey showed th
> In exercise 10, data showing the values of several pitching statistics for a random sample of 20 pitchers from the American League of Major League Baseball were provided. In part (c) of this exercise an estimated regression equation was developed to pred
> The Honda Accord was named the best midsized car for resale value for 2018 by the Kelley Blue Book (Kelley Blue Book website). The file AutoResale contains mileage, age, and selling price for a sample of 33 Honda Accords. a. Develop an estimated regressi
> The National Football League (NFL) records a variety of performance data for individuals and teams. A portion of the data showing the average number of passing yards obtained per game on offense (Off-PassYds/G), the average number of yards given up per g
> Refer to exercise 5. Use α = .01 to test the hypotheses H0: β 1 = β 2 = 0 Ha: β 1 and/or β2 is not equal to zero for the model y = β0 + β1x1 + β2x2 + e, wh
> In exercise 4, the following estimated regression equation relating sales to inventory investment and advertising expenditures was given. ŷ = 25 + 10x1 + 8x2 The data used to develop the model came from a survey of 10 stores; for these data SST = 16,000
> The following estimated regression equation was developed for a model involving two independent variables. ŷ = 40.7 1+ 8.63x1 + 2.71x2 After x2 was dropped from the model, the least squares method was used to obtain an estimated regression equation invo
> Refer to the data presented in exercise 2. The estimated regression equation for these data is ŷ = 218.37 + 2.01x1 + 4.74x2 Here SST = 15,182.9, SSR = 14,052.2, sb1 = .2471, and sb2 = .9484. a. Test for a significant relationship among x1,
> Consider the following data for a dependent variable y and two independent variables, x1 and x2. a. Develop an estimated regression equation relating y to x1. Predict y if x1 = 47. b. Develop an estimated regression equation relating y to x2. Predict y