The production of wine is a multibillion-dollar worldwide industry. In an attempt to develop a model of wine quality as judged by wine experts, data were collected from red wine variants of Portuguese “Vinho Verde” wine. Source: Data extracted from Cortez, P., Cerdeira, A., Almeida, F., Matos, T., and Reis, J., “Modeling Wine Preferences by Data Mining from Physiochemical Properties,” Decision Support Systems, 47, 2009, pp. 547–553 and bit.ly/9xKlEa. A sample of 50 wines is stored in VinhoVerde . Develop a simple linear regression model to predict wine quality, measured on a scale from 0 (very bad) to 10 (excellent), based on alcohol content (%). a. Construct a scatter plot. For these data, b0 = -0.3529 and b1 = 0.5624. b. Interpret the meaning of the slope, b1, in this problem. c. Predict the mean wine quality for wines with a 10% alcohol content. d. What conclusion can you reach based on the results of (a)–(c)?
> In Problem 13.9 on page 494, an agent for a real estate company wanted to predict the monthly rent for one-bedroom apartments, based on the size of the apartment. The data are stored in RentSilverSpring . Use the results of that problem. a. At the 0.05 l
> In Problem 13.8 on page 494, you used annual revenues to predict the value of a baseball franchise. The data are stored in BBValues . Use the results of that problem. a. At the 0.05 level of significance, is there evidence of a linear relationship betwee
> Consider a two-factor factorial design with three levels for factor A, three levels for factor B, and four replicates in each of the nine cells. a. How many degrees of freedom are there in determining the factor A variation and the factor B variation? b.
> In Problem 13.7 on page 494, you used the plate gap in the bag-sealing equipment to predict the tear rating of a bag of coffee. The data are stored in Starbucks . Use the results of that problem. a. At the 0.05 level of significance, is there evidence of
> In Problem 13.6 on page 494, a prospective MBA student wanted to predict starting salary upon graduation, based on program per-year tuition. The data are stored in FTMBA . Use the results of that problem. a. At the 0.05 level of significance, is there ev
> In Problem 13.5 on page 493, you used the summated rating of a restaurant to predict the cost of a meal. The data are stored in Restaurants . a. At the 0.05 level of significance, is there evidence of a linear relationship between the summated rating of
> In Problem 13.4 on page 493, you used the percentage of alcohol to predict wine quality. The data are stored in VinhoVerde . From the results of that problem, b1 = 0.5624 and Sb1 = 0.1127. a. At the 0.05 level of significance, is there evidence of a line
> You are testing the null hypothesis that there is no linear relationship between two variables, X and Y. From your sample of n = 20, you determine that SSR = 60 and SSE = 40. a. What is the value of FSTAT? b. At the a = 0.05 level of significance, what i
> You are testing the null hypothesis that there is no linear relationship between two variables, X and Y. From your sample of n = 18, you determine that b1 = +4.5 and Sb1 = 1.5. a. What is the value of tSTAT? b. At the α = 0.05 level of significance, wh
> You are testing the null hypothesis that there is no linear relationship between two variables, X and Y. From your sample of n = 10, you determine that r = 0.80. a. What is the value of the t test statistic tSTAT? b. At the α = 0.05 level of significan
> The owners of a chain of ice cream stores have the business objective of improving the forecast of daily sales so that staffing shortages can be minimized during the summer season. As a starting point, the owners decide to develop a simple linear regress
> In Problem 14.7 on page 542, you used the weekly staff count and remote engineering hours to predict standby hours (stored in Nickels26Weeks ). a. Perform a residual analysis on your results. b. If appropriate, perform the Durbin-Watson test, using a = 0
> A transportation strategist wanted to compare the traffic congestion levels across four continents: Asia, Europe, North America, and South America. The file CongestionLevel contains congestion level, defined as the increase (%) in overall travel time whe
> A mail-order catalog business that sells personal computer supplies, software, and hardware maintains a centralized warehouse for the distribution of products ordered. Management is currently examining the process of distribution from the warehouse and h
> In Problem 13.7 on page 494 concerning the bag-sealing equipment at Starbucks, you used the plate gap to predict the tear rating. a. Is it necessary to compute the Durbin-Watson statistic in this case? Explain. b. Under what circumstances is it necessary
> The residuals for 15 consecutive time periods are as follows: a. Plot the residuals over time. What conclusion can you reach about the pattern of the residuals over time? b. Compute the Durbin-Watson statistic. At the 0.05 level of significance, is the
> The residuals for 10 consecutive time periods are as follows: a. Plot the residuals over time. What conclusion can you reach about the pattern of the residuals over time? b. Based on (a), what conclusion can you reach about the autocorrelation of the r
> In Problem 13.10 on page 494, you used YouTube trailer views to predict movie weekend box office gross. Perform a residual analysis for these data (stored in Movie ). Based on these results, evaluate whether the assumptions of regression have been seriou
> In Problem 13.8 on page 494, you used annual revenues to predict the value of a baseball franchise. Perform a residual analysis for these data (stored in BBValues ). Based on these results, evaluate whether the assumptions of regression have been serious
> In Problem 13.9 on page 494, an agent for a real estate company wanted to predict the monthly rent for one-bedroom apartments, based on the size of the apartments. Perform a residual analysis for these data (stored in RentSilverSpring ). Based on these r
> In Problem 13.6 on page 494, a prospective MBA student wanted to predict starting salary upon graduation, based on program per-year tuition. Perform a residual analysis for these data (stored in FTMBA ). Based on these results, evaluate whether the assum
> In Problem 14.6 on page 542, you used full-time voluntary turnover (%), and total worldwide revenue ($billions) to predict number of full-time jobs added (stored in BestCompanies ). a. Perform a residual analysis on your results. b. If appropriate, perfo
> In Problem 14.6 on page 542, you used full-time voluntary turnover (%) and total worldwide revenue ($billions) to predict number of full-time jobs added (stored in BestCompanies ). Using the results from that problem, a. determine whether there is a sig
> For this problem, use the following multiple regression equation: a. Interpret the meaning of the slopes. b. Interpret the meaning of the Y intercept. Y; = 50 – 2X1; + 7X2;
> In Problem 13.7 on page 494, you used the plate gap on the bag-sealing equipment to predict the tear rating of a bag of coffee. Perform a residual analysis for these data (stored in Starbucks ). Based on these results, evaluate whether the assumptions of
> In Problem 13.4 on page 493, you used the percentage of alcohol to predict wine quality. Perform a residual analysis for these data (stored in VinhoVerde ). Evaluate whether the assumptions of regression have been seriously violated.
> In Problem 13.5 on page 493, you used the summated rating to predict the cost of a restaurant meal. Perform a residual analysis for these data (stored in Restaurants ). Evaluate whether the assumptions of regression have been seriously violated.
> The following results show the X values, residuals, and a residual plot from a regression analysis: Is there any evidence of a pattern in the residuals? Explain. X Residuals 0.70 Residual Plot 2 1.58 2.0 1.03 0.33 1.5 5 -0.39 -0.67 1.0 7 -0.56 0.65
> The following results provide the X values, residuals, and a residual plot from a regression analysis: Is there any evidence of a pattern in the residuals? Explain. X Residuals 1 0.70 Residual Plot 2 -0.78 3.0 3 1.03 4 0.33 2.5 2.39 2.0 -0.67 7 0.1
> In Problem 13.10 on page 494, you used YouTube trailer views to predict movie weekend box office gross (stored in Movie ). Using the results of that problem, a. determine the coefficient of determination, r2, and interpret its meaning. b. determine the s
> In Problem 13.9 on page 494, an agent for a real estate company wanted to predict the monthly rent for one-bedroom apartments, based on the size of the apartment (stored in RentSilverSpring ). Using the results of that problem, a. determine the coefficie
> In Problem 13.8 on page 494, you used annual revenues to predict the value of a baseball franchise (stored in BBValues ). Using the results of that problem, a. determine the coefficient of determination, r2, and interpret its meaning. b. determine the s
> In Problem 13.7 on page 494, you used the plate gap on the bag-sealing equipment to predict the tear rating of a bag of coffee (stored in Starbucks ). Using the results of that problem, a. determine the coefficient of determination, r2, and interpret it
> A pet food company has a business objective of expanding its product line beyond its current kidney and shrimp-based cat foods. The company developed two new products, one based on chicken liver and the other based on salmon. The company conducted an exp
> In Problem 13.6 on page 494, a prospective MBA student wanted to predict starting salary upon graduation, based on program per-year tuition (stored in FTMBA ). Using the results of that problem, a. determine the coefficient of determination, r2, and inte
> In Problem 13.5 on page 493, you used the summated rating to predict the cost of a restaurant meal (stored in Restaurants ). a. Determine the coefficient of determination, r2, and interpret its meaning. b. Determine the standard error of the estimate. c.
> In Problem 13.4 on page 493, the percentage of alcohol was used to predict wine quality (stored in VinhoVerde ). For those data, SSR = 21.8677 and SST = 64.0000. a. Determine the coefficient of determination, r2, and interpret its meaning. b. Determine t
> If SSR = 120, why is it impossible for SST to equal 110?
> If SSE = 10 and SSR = 30, compute the coefficient of determination, r2, and interpret its meaning.
> If SSR = 66 and SST = 88, compute the coefficient of determination, r2, and interpret its meaning.
> If SSR = 36 and SSE = 4, determine SST and then compute the coefficient of determination, r2, and interpret its meaning.
> How do you interpret a coefficient of determination, r2, equal to 0.80?
> A box office analyst seeks to predict opening weekend box office gross for movies. Toward this goal, the analyst plans to use YouTube trailer views as a predictor. For each of 66 movies, the YouTube trailer view count, the number of YouTube trailer views
> Brand valuations are critical to CEOs, financial and marketing executives, security analysts, institutional investors, and others who depend on well-researched, reliable information needed for assessments and comparisons in decision making. Millward Brow
> An agent for a residential real estate company in a suburb located outside of Washington, DC, has the business objective of developing more accurate estimates of the monthly rental cost for apartments. Toward that goal, the agent would like to use the si
> The value of a sports franchise is directly related to the amount of revenue that a franchise can generate. The file BBValues represents the value in 2017 (in $millions) and the annual revenue (in $millions) for the 30 Major League Baseball franchises.
> Starbucks Coffee Co. uses a data-based approach to improving the quality and customer satisfaction of its products. When survey data indicated that Starbucks needed to improve its package-sealing process, an experiment was conducted to determine the fact
> Is an MBA a golden ticket? Pursuing an MBA is a major personal investment. Tuition and expenses associated with business school programs are costly, but the high costs come with hopes of career advancement and high salaries. A prospective MBA student wou
> Zagat’s publishes restaurant ratings for various locations in the United States. The file Restaurants contains the Zagat rating for food, décor, service, and the cost per person for a sample of 100 restaurants located in the center of New York City and i
> Fitting a straight line to a set of data yields the following prediction line: a. Interpret the meaning of the Y intercept, b0. b. Interpret the meaning of the slope, b1. c. Predict the value of Y for X = 6. Î, = 16 – 0.5X;
> If the values of X in Problem 13.1 range from 2 to 25, should you use this model to predict the mean value of Y when X equals a. 3? b. -3? c. 0? d. 24?
> Fitting a straight line to a set of data yields the following prediction line: a. Interpret the meaning of the Y intercept, b0. b. Interpret the meaning of the slope, b1. c. Predict the value of Y for X = 3. Î; = 2 + 5X;
> QSR reports on the largest quick-serve and fast-casual brands in the United States. The file FastFoodChain contains the food segment (burger, chicken, sandwich or pizza/pasta) and U.S. mean sales per unit ($ thousands) for each of 37 quick-service brands
> Use the following contingency table: a. Compute the expected frequency for each cell. b. Compute χ2STAT. Is it significant at α = 0.05? A B C Total 1 10 30 50 90 2 40 45 50 135 Total 50 75 100 225
> Consider a contingency table with two rows and five columns. a. How many degrees of freedom are there in the contingency table? b. Determine the critical value for α = 0.05. c. Determine the critical value for α = 0.01.
> Does co-browsing have positive effects on the customer experience? Co-browsing refers to the ability to have a contact center agent and customer jointly navigate an application (e.g., web page, digital document, or mobile application) on a real time basi
> A glass manufacturing company wanted to investigate the effect of zone 1 lower temperature (630 vs. 650) and zone 3 upper temperature (695 vs. 715) on the roller imprint of glass. The results stored in Glass2 were as follows: Source: K. Kumar and S. Y
> What social media tools do marketers commonly use? A survey by Social Media Examiner of B2B marketers (marketers that focus primarily on attracting businesses) and B2C marketers (marketers that primarily target consumers) reported that 267 (81%) of B2B m
> The Society for Human Resource Management (SHRM) collaborated with Globoforce on a series of organizational surveys with the goal of identifying challenges that HR leaders face and what strategies help them conquer those challenges. A 2016 survey indicat
> Are you an impulse shopper? A survey of 500 grocery shoppers indicated that 29% of males and 40% of females make an impulse purchase every time they shop. Source: Data extracted from Women shoppers are impulsive while men snap up bargains, available at b
> Does Cable Video on Demand (VOD D4+) increase ad effectiveness? A 2015 VOD study compared general TV and VOD D4+ audiences after viewing a brand ad. Whether the viewer indicated that the ad made them want to visit the brand website was collected and orga
> An Ipsos poll asked 1,004 adults “If purchasing a used car made certain upgrades or features more affordable, what would be your preferred luxury upgrade?” The results indicated that 9% of the males and 14% of the females answered window tinting. Source
> Use the following contingency table: a. Compute the expected frequency for each cell. b. Compute χ2STAT. Is it significant at α = 0.05? A B Total 1 20 30 50 2 30 20 50 Total 50 50 100
> Use the following contingency table: a. Compute the expected frequency for each cell. b. Compare the observed and expected frequencies for each cell. c. Compute χ2STAT. Is it significant at α = 0.05? A B Total 1 20 Total
> The following ANOVA summary table is for a multiple regression model with two independent variables: a. Determine the regression mean square (MSR) and the mean square error (MSE). b. Compute the overall FSTAT test statistic. c. Determine whether the
> Determine the critical value of χ2 with 1 degree of freedom in each of the following circumstances: a. α = 0.05 b. α = 0.025 c. α = 0.01
> Determine the critical value of χ2 with 1 degree of freedom in each of the following circumstances: a. α = 0.01 b. α = 0.005 c. α = 0.10
> A glass manufacturing company wanted to investigate the effect of breakoff pressure and stopper height on the percentage of breaking off chips. The results, stored in Glass1 , were as follows: Source: K. Kumar and S. Yadav, “Breakth
> In Problems 13.8, 13.20, 13.30, 13.46, 13.62, 13.82, and 13.83, you developed regression models to predict franchise value of major league baseball, NBA basketball, and soccer teams. Now, write a report based on the models you developed. Append to your r
> The file CEO 2016 includes the total compensation (in $ millions) for CEOs of 200 Standard & Poor’s 500 companies and the investment return in 2016. Source: Data extracted from R. Lightner and T. Francis, “How Much Do Top CEOs Make?” available at bit.ly/
> Refer to the discussion of beta values and market models in Problem 13.49 on page 513. The S&P 500 Index tracks the overall movement of the stock market by considering the stock prices of 500 large corporations. The file StockPrices2016 contains 2016 wee
> During the fall harvest season in the United States, pumpkins are sold in large quantities at farm stands. Often, instead of weighing the pumpkins prior to sale, the farm stand operator will just place the pumpkin in the appropriate circular cutout on th
> Referring to Problem 14.82, instead of predicting the unit density, you now wish to predict the foam diameter from results stored in PackagingFoam4 . Develop a multiple regression model that uses die temperature and die diameter to predict the foam diame
> An experiment was conducted to study the extrusion process of biodegradable packaging foam. Source: Data extracted from W. Y. Koh, K. M. Eskridge, and M. A. Hanna, “Supersaturated Split-Plot Designs,” Journal of Quality Technology, 45, January 2013, pp.
> Starbucks Coffee Co. uses a data-based approach to improving the quality and customer satisfaction of its products. When survey data indicated that Starbucks needed to improve its package sealing process, an experiment was conducted to determine the fact
> HR practitioners are increasing performing gender pay audits to understand whether a gender gap exists at their company. Practitioners examine payroll data for evidence of a gender pay gap. An HR practitioner collects data on base pay ($), gender (0 = fe
> Nassau County is located approximately 25 miles east of New York City. The data organized and stored in GlenCove include the fair market value (in $thousands), land area of the property in acres, and age, in years, for a sample of 30 single-family homes
> You are a real estate broker who wants to compare property values in Glen Cove and Roslyn (which are located approximately 8 miles apart). In order to do so, you will analyze the data in GCRoslyn , a file that includes samples of houses from Glen Cove an
> A plastic injection molding process is often used in manufacturing because of its ability to mold complicated shapes. An experiment was conducted on the manufacture of a television remote part, and the warpage (mm) of the part was measured and stored in
> Referring to Problem 14.77, suppose that in addition to using ERA to predict the number of wins, the analytics specialist wants to include the league (0 = American, 1 = National) as an independent variable. Develop a model to predict wins based on ERA an
> A baseball analytics specialist wants to determine which variables are important in predicting a team’s wins in a given season. He has collected data related to wins, earned run average (ERA), and runs scored per game for a recent season (stored in Baseb
> A sample of 61 houses recently listed for sale in Silver Spring, Maryland, was selected with the objective of developing a model to predict the taxes (in $) based on the asking price of houses (in $thousands) and the age of the houses (in years) (stored
> Measuring the height of a California redwood tree is very difficult because these trees grow to heights over 300 feet. People familiar with these trees understand that the height of a California redwood tree is related to other characteristics of the tre
> A sample of 61 houses recently listed for sale in Silver Spring, Maryland, was selected with the objective of developing a model to predict the asking price (in $thousands), using the living space of the house (in square feet) and age (in years). The res
> Professional basketball has truly become a sport that generates interest among fans around the world. More and more players come from outside the United States to play in the National Basketball Association (NBA). You want to develop a regression model t
> The owner of a moving company typically has his most experienced manager predict the total number of labor hours that will be required to complete an upcoming move. This approach has proved useful in the past, but the owner has the business objective of
> Increasing customer satisfaction typically results in increased purchase behavior. For many products, there is more than one measure of customer satisfaction. In many, purchase behavior can increase dramatically with an increase in just one of the custom
> What is the difference between least squares regression and logistic regression?
> The business problem facing the director of broadcasting operations for a television station was the issue of standby hours (i.e., hours in which employees at the station are paid but are not actually involved in any activity) and what factors were relat
> In Problem 14.8 on page 542, you used the land area of a property and the age of a house to predict the fair market value (stored in GlenCove ). Using the results from that problem, a. determine whether there is a significant relationship between fair m
> When do you use logistic regression?
> When a dummy variable is included in a regression model that has one numerical independent variable, what assumption do you need to make concerning the slope between the dependent variable, Y, and the numerical independent variable, X?
> Under what circumstances do you include an interaction term in a regression model?
> How can you evaluate whether the slope of the dependent variable with an independent variable is the same for each level of the dummy variable?
> Why and how do you use dummy variables?
> How do the coefficients of partial determination differ from the coefficient of multiple determination?
> How does testing the significance of the entire multiple regression model differ from testing the contribution of each independent variable?
> How does the interpretation of the regression coefficients differ in multiple regression and simple linear regression?
> What is the difference between r2 and adjusted r2?
> A local supermarket manager wants to use two independent variables, customer age (in years) and whether the customer subscribes to the supermarket chain’s health/wellness e-newsletters (coded as 1 = yes and 0 = no) to predict which customers are likely t