Which statement is true? Why not the others?
a. We expect the median to exceed the mean in positively skewed data.
b. The geometric mean is not possible when there are negative data values.
c. The midrange is resistant to outliers.
> Which statement is incorrect? Explain. a. If p = .50 and n = 100, the estimated standard error of the sample proportion is .05. b. In a sample size calculation for estimating π, it is conservative to assume π = .50. c. If n = 250 and p = .07 it is not s
> Which statement is false? Explain. a. To find probabilities in a continuous distribution, we add up the probabilities at each point. b. A uniform continuous model U(5,21) has mean 13 and standard deviation 4.619. c. A uniform PDF is constant for all valu
> A sample of 9 customers in the “quick” lane in a supermarket showed a mean purchase of $14.75 with a standard deviation of $2.10. (a) Find the 95 percent confidence interval for the true mean. (b) Why should you use t instead of z in this case?
> A sample of 16 ATM transactions shows a mean transaction time of 67 seconds with a standard deviation of 12 seconds. (a). State the hypotheses to test whether the mean transaction time exceeds 60 seconds. (b). Find the test statistic. (c). At α = .02
> Which statement is not correct? Explain. a. The sample data x1, x2, . . . , xn will be approximately normal if the sample size n is large. b. For a skewed population, the distribution of / is approximately normal if n is large. c. The expected value of /
> A regression model to predict Y, the state burglary rate per 100,000 people for 2005, used the following four state predictors: X1 = median age in 2005, X2 = number of 2005 bankruptcies, X3 = 2004 federal expenditures per capita (a leading predictor), an
> A random sample of 502 Vail Resorts’ guests were asked to rate their satisfaction on various attributes of their visit on a scale of 1–5 with 1 = very unsatisfied and 5 = very satisfied. The regression model was Y = ov
> A sample of 74 Noodles & Company restaurants was used to perform a regression analysis with Y = % Annual Revenue Growth and X = % Revenue Due to Loyalty Card Use. Calculate the leverage statistic for the following three restaurants and state whether or n
> A sample of season performance measures for 29 NBA teams was collected for a season. A regression analysis was performed on two of the variables with Y = total number of free throws made and X = total number of free throws attempted. Calculate the levera
> An estimated regression for a random sample of vehicles is MPG = 49.22 - 0.081 Horsepower, where MPG is miles per gallon and Horsepower is the engine’s horsepower. The standard error is se = 2.03. Suppose an engine has 200 horsepower and its actual (obse
> Observations are taken on net revenue from sales of a certain LCD TV at 50 retail outlets. The regression model was Y = net revenue (thousands of dollars), X1 = shipping cost (dollars per unit), X2 = expenditures on print advertising (thousands of dollar
> In the previous problem, calculate (a) the 95th percentile of vehicle speeds (i.e., 95 percent below); (b) the lowest 10 percent of speeds; (c) the highest 25 percent of speeds (3rd quartile).
> (a) Make an Excel scatter plot. What does it suggest about the population correlation between X and Y? (b) Make an Excel worksheet to calculate SSxx, SSyy, and SSxy. Use these sums to calculate the sample correlation coefficient. Check your work by using
> Review the two residual plots below. Do either of these show evidence that the regression error assumptions of normality and constant variation have been violated? Explain. X -3.0 -2.0 -1.0 0.0 1.0 2.0 3.0 Normal Score Residual Residuals
> Review the two residual plots below. Do either of these show evidence that the regression error assumptions of normality and constant variation have been violated? Explain. -3.0 -2.0 -1.0 0.0 1.0 2.0 3.0 Normal Score Residual
> Study the table of residuals. Identify as outliers any standardized residuals that exceed 3 and as unusual any that exceed 2. Can you suggest any reasons for these unusual residuals? Midterm and Final Exam Scores for Business Statistics Students Fall S
> Refer to the Revenue and Profit data set below. Data are in billions of dollars. (a) Use MegaStat or MINITAB to find confidence and prediction intervals for Y using the following set of x values: 1.8, 15, and 30. (b) Report the 95 percent confidence inte
> Refer to the Weekly Earnings data set below. (a) Use MegaStat or MINITAB to find confidence and prediction intervals for Y using the following set of x values: 12, 17, 21, 25, and 30. (b) Report the 95 percent confidence interval and prediction interval
> (a) Use Excel’s Data Analysis > Regression (or MegaStat or MINITAB) to obtain regression estimates. (b) Interpret the 95 percent confidence interval for the slope. Does it contain zero? (c) Interpret the t test for the slope and its p-value. (d) Interpre
> (a) Use Excel’s Data Analysis > Regression (or MegaStat or MINITAB) to obtain regression estimates. (b) Interpret the 95 percent confidence interval for the slope. Does it contain zero? (c) Interpret the t test for the slope and its p-value. (d) Interpre
> (a) Use Excel’s Data Analysis > Regression (or MegaStat or MINITAB) to obtain regression estimates. (b) Interpret the 95 percent confidence interval for the slope. Does it contain zero? (c) Interpret the t test for the slope and its
> Below is a regression using X = average price, Y = units sold, n = 20 stores. (a) Write the fitted regression equation. (b) Write the formula for each t statistic and verify the t statistics shown below. (c) State the degrees of freedom for the t tests a
> Below is a regression using X = home price (000), Y = annual taxes (000), n = 20 homes. (a) Write the fitted regression equation. (b) Write the formula for each t statistic and verify the t statistics shown below. (c) State the degrees of freedom for the
> A regression was performed using data on 16 randomly selected charities. The variables were Y = expenses (millions of dollars) and X = revenue (millions of dollars). (a) Write the fitted regression equation. (b) Construct a 95 percent confidence interval
> Which probability distribution (uniform, binomial, Poisson) is most nearly appropriate to describe each situation (assuming you knew the relevant parameters)? a. The number of dimes older than 10 years in a random sample of 8 dimes. b. The number of hos
> A regression was performed using data on 32 NFL teams. The variables were Y = current value of team (millions of dollars) and X = total debt held by the team owners ( millions of dollars). (a) Write the fitted regression equation. (b) Construct a 95 perc
> (a) Based on the R2 and ANOVA table for your model, how would you assess the fit? (b) Interpret the p-value for the F statistic. (c) Would you say that your model’s fit is good enough to be of practical value? Midterm and Final Exam S
> (a) Perform a regression using MegaStat or Excel. (b) State the null and alternative hypotheses for a two-tailed test for a zero slope. (c) Report the p-value and the 95 percent confidence interval for the slope shown in the regression results. (d) Is th
> (a) Perform a regression using MegaStat or Excel. (b) State the null and alternative hypotheses for a two-tailed test for a zero slope. (c) Report the p-value and the 95 percent confidence interval for the slope shown in the regression results. (d) Is th
> Using the “Metals” data, construct a correlation matrix of the six independent variables. The response variable is Priceylb. (a). Identify any pairs of independent variables that have a significant pairwise correlation. (b). Using MegaStat or MINITAB,
> (a) Use Excel to make a scatter plot of the data. (b) Select the data points, right-click, select Add Trendline, select the Options tab, and choose Display equation on chart and Display R-squared value on chart. (c) Interpret the fitted slope. (d) Is the
> (a) Use Excel to make a scatter plot of the data. (b) Select the data points, right-click, select Add Trendline, select the Options tab, and choose Display equation on chart and Display R-squared value on chart. (c) Interpret the fitted slope. (d) Is the
> (a) Make an Excel worksheet to calculate SSxx, SSyy, and SSxy (the same worksheet you used in exercises 12.2 and 12.3). (b) Use the formulas to calculate the slope and intercept. (c) Use your estimated slope and intercept to make a worksheet to calculate
> (a) Make an Excel worksheet to calculate SSxx, SSyy, and SSxy (the same worksheet you used in exercises 12.2 and 12.3). (b) Use the formulas to calculate the slope and intercept. (c) Use your estimated slope and intercept to make a worksheet to calculate
> Refrigerator prices are affected by characteristics such as whether or not the refrigerator is on sale, whether or not it is listed as a Sub-Zero brand, the number of doors (one door or two doors), and the placement of the freezer compartment (top, side,
> A regression model to predict the price of diamonds included the following predictor variables: the weight of the stone (in carats where 1 carat = 0.2 gram), the color rating (D, E, F, G, H, or I), and the clarity rating (IF, VVS1, VVS2, VS1, or VS2). (
> Which statement is false? Explain. a. If μ = 52 and σ = 15, then X = 81 would be an outlier. b. If the data are from a normal population, about 68 percent of the values will be within μ ± σ. c. If μ = 640 and σ = 128 then the coefficient of variation is
> (a) Does the 95 percent confidence interval for the slope include zero? If so, what does this tell you? If not, what does it mean? (b) Do a two-tailed t test for zero slope at α = .05. State the hypotheses, degrees of freedom, and critical
> Below are fitted regressions for Y = asking price of a used vehicle and X = the age of the vehicle. The observed range of X was 1 to 8 years. The sample consisted of all vehicles listed for sale in a particular week. (a) Interpret the slope of each fitte
> The regression equation Credits = 15.4 -.07 Work was estimated from a sample of 21 statistics students. Credits is the number of college credits taken and Work is the number of hours worked per week at an outside job. (a). Interpret the slope. (b). Is
> The regression equation HomePrice = 51.3 + 2.61 Income was estimated from a sample of 34 cities in the eastern United States. Both variables are in thousands of dollars. HomePrice is the median selling price of homes in the city, and Income is median fam
> The regression equation NetIncome = 2,277 + .0307 Revenue was estimated from a sample of 100 leading world companies (variables are in millions of dollars). (a). Interpret the slope. (b). Is the intercept meaningful? Explain. (c). Make a prediction o
> (a) Interpret the slope of the fitted regression Number of International Franchises = -47.5 + 1.75 Power Distance Index. The Power Distance Index is a measure on a scale of 0–100 of the wealth gap between the richest and poorest in a country. (b) What is
> Observations are taken on sales of a certain mountain bike in 30 sporting goods stores. The regression model was Y = total sales (thousands of dollars), X1 = display floor space (square meters), X2 = competitors’ advertising expenditure
> Observations are taken on net revenue from sales of a certain LCD TV at 50 retail outlets. The regression model was Y = net revenue (thousands of dollars), X1 = shipping cost (dollars per unit), X2 = expenditures on print advertising (thousands of dollar
> Refer to the ANOVA table below. (a) State the degrees of freedom for the F test for overall significance. (b) Use Appendix F to look up the critical value of F for α = .05. (c) Calculate the F statistic. Is the regression significant overal
> (a) Make a scatter plot of the data. What does it suggest about the correlation between X and Y? (b) Use Excel, MegaStat, or MINITAB to calculate the correlation coefficient. (c) Use Excel or Appendix D to find t.025 for a two-tailed test at Î&plu
> (a) Make an Excel scatter plot. What does it suggest about the population correlation between X and Y? (b) Make an Excel worksheet to calculate SSxx, SSyy, and SSxy. Use these sums to calculate the sample correlation coefficient. Check your work by using
> Use Excel, MegaStat, or MINITAB to fit the regression model, including residuals and standardized residuals. Midterm and Final Exam Scores for Business Statistics Students Fall Semester 2011 (n = 58 students) Midterm Exam Score …&ac
> For each sample, do a test for zero correlation. (a) Use Appendix D to find the critical value of tα. (b) State the hypotheses about ρ. (c) Perform the t test and report your decision. Appendix D: a. r = +.45, n = 20, Î&
> A box of Wheat Chex cereal is to be filled to a mean weight of 466 grams. The lower specification limit is 453 grams (the labeled weight is 453 grams) and the upper specification limit is 477 grams (so as not to overfill the box). The process standard de
> A new type of smoke detector battery is developed. From laboratory tests under standard conditions, the half-life (defined as less than 50 percent of full charge) of 20 batteries are shown below. (a). Make a histogram of the data and/or a probability p
> A Nabisco Fig Newton has a process mean weight of 14.00 g with a standard deviation of 0.10 g. The lower specification limit is 13.40 g and the upper specification limit is 14.60 g. (a). Describe the capability of this process, using the techniques you
> Refer to the freezer data’s 80 individual temperature observations in problem 17.52. Problem 17.52: Refer to the freezer problem 17.51 with μ = 23 and σ = 2. Temperature measurements are recorded four times a
> The temperature control unit on a commercial freezer in a 24-hour grocery store is set to maintain a mean temperature of 23 degrees Fahrenheit. The temperature varies because people are constantly opening the freezer door to remove items, but the thermos
> Refer back to Table 15.11, which shows the distribution of the number of U.S. Supreme Court appointments per year from 1900–1999. Since 1999 there have been four Supreme Court appointments with one each in the years 2005, 2006, 2009, an
> (a) Use Excel’s Data Analysis > Random Numbers to generate 100 Poisson-distributed random numbers with a mean of λ = 4. (b) Make a histogram of your sample and assess its shape. (c) Calculate descriptive statistics. Are
> (a) Use Excel’s function =RAND() or Excel’s Data Analysis > Random Numbers to generate 100 uniformly distributed random numbers between 0 and 1. (b) Make a histogram of your sample and assess its shape. (c) Calculat
> Interpret the slope. Does the intercept have meaning, given the range of the data? Midterm and Final Exam Scores for Business Statistics Students Fall Semester 2011 (n = 58 students) Midterm Exam Score ……â&
> (a) Use Excel’s function =NORM.INV(RAND(),0,1) or Excel’s Data Analysis > Random Numbers to generate 100 normally distributed random numbers with a mean of 0 and a standard deviation of 1. (b) Make a histogram of yo
> If a random experiment whose success probability is .20 is repeated 8 times, find the probability of (a) exactly 3 successes; (b) more than 3 successes; (c) at most 2 successes. (d) Which probability distribution did you use and why?
> Using test data on 43 vehicles, an analyst fitted a regression to predict CityMPG (miles per gallon in city driving) using as predictors Length (length of car in inches), Width (width of car in inches), and Weight (weight of car in pounds). Interpret the
> Analysis of a Detroit Marathon (n = 1,015 men, n = 150 women) produced the regression results shown below, with dependent variable Time (the marathon time in minutes) and predictors Age (runner’s age), Weight (runner’s
> An expert witness in a case of alleged racial discrimination in a state university school of nursing introduced a regression of the determinants of Salary of each professor for each year during an 8-year period (n = 423) with the following results, with
> A sports enthusiast created an equation to predict Victories (the team’s number of victories in the National Basketball Association regular season play) using predictors FGP (team field goal percentage), FTP (team free throw percentage)
> A researcher used stepwise regression to create regression models to predict Birth Rate (births per 1,000) using five predictors: LifeExp (life expectancy in years), InfMort (infant mortality rate), Density (population density per square kilometer), GDPC
> Using test data on 20 types of laundry detergent, an analyst fitted a regression to predict Cost Per Load (average cost per load in cents per load) using binary predictors Top Load (1 if washer is a top loading model, 0 otherwise) and Powder (if detergen
> A hospital emergency room analyzed n = 17,664 hourly observations on its average occupancy rates using six binary predictors representing days of the week and two binary predictors representing the 8-hour work shift (12 a.m.–8 a.m., 8 a.m.–4 p.m., 4 p.m.
> In a study of paint peel problems, a regression was suggested to predict defects per million (the response variable). The intended predictors were supplier (four suppliers, coded as binaries) and substrate (four materials, coded as binaries). There were
> Which one of the following is true? Why not the others? a. Histograms are useful for visualizing correlations. b. Pyramid charts are generally preferred to bar charts. c. A correlation coefficient can be negative.
> In a model of Ford’s quarterly revenue Total Revenue = β0 + β1 Car Sales + β2 Truck Sales + β3 SUVSales + ε, the three predictors are measured in number of units sold (not dollars). (a). Interpret each slope. (b). Would the intercept be meaningful? (c)
> If you are using time-series data, perform one or more tests for autocorrelation (visual inspection of residuals plotted against observation order, runs test, Durbin-Watson test). Is autocorrelation a concern? DATA SET A Mileage and Other
> Here are the ages of a random sample of 20 CEOs (chief executive officers) of Fortune 500 U.S. corporations. (a). Find the mean, median, and mode. (b). Discuss advantages and disadvantages of each of these measures of center for this data set. (c). F
> If you did not already do so, request a plot of residuals versus the fitted Y. Is heteroscedasticity a concern? DATA SET A Mileage and Other Characteristics of Randomly Selected Vehicles (n = 73, k = 4) O Mileage Obs Vehicle CityMPG Length
> If you did not already do so, request a histogram of standardized residuals and/or a normal probability plot. Do the residuals suggest non-normal errors? Explain. DATA SET A Mileage and Other Characteristics of Randomly Selected Vehicles (
> If you did not already do so, request leverage statistics. Are any observations influential? Explain. DATA SET A Mileage and Other Characteristics of Randomly Selected Vehicles (n = 73, k = 4) O Mileage Obs Vehicle CityMPG Length Width Wei
> (a). If you did not already do so, request a table of standardized residuals. (b). Are any residuals outliers (three standard errors) or unusual (two standard errors)? DATA SET A Mileage and Other Characteristics of Randomly Selected Vehi
> (a). If you did not already do so, rerun the regression requesting variance inflation factors (VIFs) for your predictors. (b). Do the VIFs suggest that multicollinearity is a problem? Explain. DATA SET A Mileage and Other Characteristics
> (a). Generate a correlation matrix for your predictors. Round the results to three decimal places. (b). Based on the correlation matrix, is collinearity a problem? DATA SET A Mileage and Other Characteristics of Randomly Selected Vehicles
> Use the standard error to construct an approximate prediction interval for Y. Based on the width of this prediction interval, would you say the predictions are good enough to have practical value? DATA SET A Mileage and Other Characteristi
> Use Excel’s Add Trendline feature to fit a linear regression to the scatter plot. Is a linear model credible? Midterm and Final Exam Scores for Business Statistics Students Fall Semester 2011 (n = 58 students) Midterm Exam Score &acir
> Based on the R2 and ANOVA table for your model, how would you describe the fit? DATA SET A Mileage and Other Characteristics of Randomly Selected Vehicles (n = 73, k = 4) O Mileage Obs Vehicle CityMPG Length Width Weight ManTran 3968 3583
> (a) Which p-values indicate predictor significance at α = .05? (b) Do the p-values support the conclusions you reached from the t tests? (c) Do you prefer the t test or the p-value approach? Why? DATA SET A Mileage and Other
> (a) Plot U.S. petroleum imports on a graph. (b) Describe the trend (if any) and discuss possible causes. (c) Fit both a linear and an exponential trend. (c) Interpret each fitted trend equation, explaining the implications. (d) Make a projection for 2010
> (a) Plot both men’s and women’s winning times on the same graph. (b) Fit a linear trend model to each series (men, women). (c) Use Excel’s option to forecast each trend graphically to 2040 (i.e., to p
> Do a two-tailed t test for zero slope for each predictor coefficient at α = .05. State the degrees of freedom and look up the critical value in Appendix D (or from Excel). Appendix D: Confidence Level Confidence Level 80 90
> Does a class break stimulate the pulse? Here are heart rates for a sample of 30 students before and after a class break. Research question: At α = .05, do the medians differ? Heart Rate before and after Class Break Student Before After
> Salaries of 30 randomly chosen individuals in the same occupation (only the first 3 and last 3 observations are shown). The data are from a salary equity study comparing two industries. Research question: Without assuming normality of the populations, is
> (a) Plot the data on law enforcement officers killed. (b) Describe the trend (if any) and discuss possible causes or anomalies in the data. (c) Would a fitted trend be helpful? Explain. (c) Make a forecast for 2009 using any method you like (including ju
> A cognitive retraining clinic assists outpatient victims of head injury, anoxia, or other conditions that result in cognitive impairment. Each incoming patient is evaluated to establish an appropriate treatment program and estimated length of stay (ELOS
> (a) Plot both men’s and women’s winning times on the same graph. (b) Fit a linear trend model to each series. From the fitted trends, will the times eventually converge? Hint: Ask Excel for forecasts (e.g., 20 years ah
> (a) Plot either receipts and outlays or federal debt and GDP (plot both time series on the same graph). (b) Describe the trend (if any) and discuss possible causes. (c) Fit a trend of your choice to each. (d) Interpret each fitted trend equation, explain
> (a) Choose one beverage category and plot the data. (b) Describe the trend (if any) and discuss possible causes. (c) Would a fitted trend be helpful? Explain. (d) Fit several trend models. Which is best, and why? If none is satisfactory, explain. (e) Mak
> (a) Plot the data on U.S. general aviation shipments. (b) Describe the pattern and discuss possible causes. (c) Would a fitted trend be helpful? Explain. (d) Make a similar graph for 1993–2008 only. Would a fitted trend be helpful in ma
> Which statement is correct? Why not the others? a. Likert scales are interval if scale distances are meaningful. b. Cross-sectional data are measured over time. c. A census is always preferable to a sample.
> (a) Choose one category of consumer credit and plot it. (b) Describe the trend (if any) and discuss possible causes. (c) Fit a trend model of your choice. (d) Make forecasts for 3 years (2011–2013), using a trend model of your choice.
> If freeway speeds are normally distributed with a mean of μ = 70 mph and σ = 7 mph, find the probability that the speed of a randomly chosen vehicle (a) exceeds 78 mph; (b) is between 65 and 75 mph; (c) is less than 70 mph.
> (a) Plot the voter participation rate. (b) Describe the trend (if any) and discuss possible causes. (c) Fit both a linear and a quadratic trend to the data. (d) Which model is preferred? Why? (e) Make a forecast for 2012, using a trend model of your choi
> (a) Plot the total minutes of TV viewing time per household. (b) Describe the trend (if any) and discuss possible causes. (c) Fit a linear trend to the data. (d) Would this model give reasonable forecasts? Would another trend model be better? Explain. (e
> (a) Plot both Swiss watch time series on the same graph. (b) Describe the trend (if any) and discuss possible causes. (c) Fit an exponential trend to each time series. (d) Interpret each fitted trend carefully. What conclusion do you draw? (e) Make forec