An independent golf equipment testing facility compared the difference in the performance of golf balls hit off a regular 2-3/4” wooden tee to those hit off a 3” Stinger Competition golf tee. A Callaway Great Big Bertha driver with 10 degrees of loft was used for the test, and a robot swung the club head at approximately 95 miles per hour. Boxplots of the distances traveled, in yards, are shown in the following figure. Use the boxplots to compare the driving distances for the two golf tees, paying special attention to center and variation.
> Why is probability theory important to statistics?
> For quantitative data, we examined three types of grouping: single-value grouping, limit grouping, and cut point grouping. For each type of data given, decide which of these three grouping types is usually best. Explain your answers. a. Continuous data d
> In an on-line press release, ABCNews.com reported that “. . . 73 percent of Americans. . . favor a law that would require every gun sold in the United States to be test-fired first, so law enforcement would have its fingerprint in case it were ever used
> Based on the least-squares criterion, the line that best fits a set of data points is the one with the ____ possible sum of squared errors.
> Regarding the variables in a regression analysis, a. what is the independent variable called? b. what is the dependent variable called?
> Identify one use of a regression equation.
> What kind of plot is useful for deciding whether finding a regression line for a set of data points is reasonable?
> Explain your answers. If a line has a positive slope, y-values on the line decrease as the x-values decrease.
> Explain your answers. A horizontal line has no slope.
> Explain your answers. The y-intercept of a line has no effect on the steepness of the line.
> From the website Golf.com, part of Sports Illustrated Sites, we obtained the scores for the first and second rounds of the 2013 U.S. Open golf tournament. You will find those scores on the WeissStats site. For part (d), predict the second-round score of
> The National Oceanic and Atmospheric Administration publish temperature and precipitation information for cities around the world in Climates of the World. Data on average high temperature (in degrees Fahrenheit) in July and average precipitation (in inc
> From the International Data Base, published by the U.S. Census Bureau, we obtained data on infant mortality rate (IMR) and life expectancy (LE), in years, for a sample of 60 countries. The data are presented on the WeissStats site. For part (d), predict
> With regard to grouping quantitative data into classes in which each class represents a range of possible values, we discussed two methods for depicting the classes. Identify the two methods and explain the relative advantages and disadvantages of each m
> In the article “Effects of Human Population, Area, and Time on Non-native Plant and Fish Diversity in the United States” (Biological Conservation, Vol. 100, No. 2, pp. 243–252), M. McKinney investigated the relationship of various factors on the number o
> Refer to Problem 21. a. Compute the linear correlation coefficient, r. b. Interpret your answer from part (a) in terms of the linear relationship between student-to faculty ratio and graduation rate. c. Discuss the graphical implications of the value of
> Refer to Problem 21. a. Determine SST, SSR, and SSE by using the computing formulas. b. Obtain the coefficient of determination. c. Obtain the percentage of the total variation in the observed graduation rates that is explained by student-to-faculty rati
> Graduation rate—the percentage of entering freshmen attending full time and graduating within 5 years— and what influences it is a concern in U.S. colleges and universities. U.S. News and World Report’s “College Guide” provides data on graduation rates f
> A small company has purchased a computer system for $7200 and plans to depreciate the value of the equipment by $1200 per year for 6 years. Let x denote the age of the equipment, in years, and y denote the value of the equipment, in hundreds of dollars.
> Consider the linear equation y = 4 − 3x. a. At what y-value does its graph intersect the y-axis? b. At what x-value does its graph intersect the y-axis? c. What is its slope? d. By how much does the y-value on the line change when the x-value increases b
> Answer true or false to the following statement, and explain your answer: A strong correlation between two variables doesn’t necessarily mean that they’re causally related.
> A value of r close to ____ suggests at most a weak linear relationship between the variables.
> A value of r close to −1 suggests a strong ____ linear relationship between the variables.
> A positive linear relationship between two variables means that one variable tends to increase linearly as the other ____ .
> State three of the most important guidelines in choosing the classes for grouping a quantitative data set.
> One use of the linear correlation coefficient is as a descriptive measure of the strength of the ____ relationship between two variables.
> For each of the sums of squares in regression, state its name and what it measures. a. SST b. SSR c. SSE
> Identify a use of the coefficient of determination as a descriptive measure.
> In the context of regression analysis, what is an a. outlier? b. influential observation?
> Using a regression equation to make predictions for values of the predictor variable outside the range of the observed values of the predictor variable is called _____ .
> The line that best fits a set of data points according to the least squares criterion is called the ____ line.
> For a linear equation y = b0 + b1x, identify the a. independent variable. b. dependent variable. c. slope. d. y-intercept.
> A quantitative data set of size 87 has mean 80 and standard deviation 10. At least how many observations lie between 60 and 100?
> What does Chebyshev’s rule say about the percentage of observations in any data set that lie within a. six standard deviations to either side of the mean? b. 1.5 standard deviations to either side of the mean?
> Complete the statement: Almost all the observations in any data set lie within ____ standard deviations to either side of the mean.
> Do the concepts of class limits, marks, cutpoints, and midpoints make sense for qualitative data? Explain your answer
> Data Set A has more variation than Data Set B. Decide which of the following statements are necessarily true. a. Data Set A has a larger mean than Data Set B. b. Data Set A has a larger standard deviation than Data Set B.
> Specify the mathematical symbol used for each of the following descriptive measures. a. Sample mean b. Sample standard deviation c. Population mean d. Population standard deviation
> Identify the most appropriate measure of variation corresponding to each of the following measures of center. a. Mean b. Median
> Philosophical and health issues are prompting an increasing number of Taiwanese to switch to a vegetarian lifestyle. In the paper “LDL of Taiwanese Vegetarians Are Less Oxidizable than Those of Omnivores” (Journal of Nutrition, Vol. 130, pp. 1591–1596),
> The U.S. National Oceanic and Atmospheric Administration publishes temperature data in Climatography of the United States. According to that document, the annual average maximum and minimum temperatures for selected cities in the United States are as pro
> From The World Bank, in the document Life Expectancy at Birth, we obtained data on the expectation of life (in years) at birth for people in various countries. Those data are presented on the WeissStats site. a. obtain the mean, median, and mode(s) of th
> The U.S. Department of Agriculture collects data pertaining to the value of agricultural exports and publishes its findings in U.S. Agricultural Trade Update. For one year, the values of these exports, by state, are provided on the WeissStats site. Data
> Among the measures of center discussed, which is the only one appropriate for qualitative data?
> The U.S. Census Bureau classifies the states in the United States by region and division. The data giving the region and division of each state are presented on the WeissStats site. Use the technology of your choice to determine the mode(s) of the a. reg
> The U.S. Energy Information Administration reports weekly figures on retail gasoline prices in Weekly Retail Gasoline and Diesel Prices. Every Monday, retail prices for all three grades of gasoline are collected by telephone from a sample of approximatel
> Identify an important reason for grouping data.
> According to the Statistical Summary of Students and Staff , prepared by the Department of Information Resources and Communications, Office of the President, University of California, the Fall 2012 enrollment figures for undergraduates at the University
> Beachbody, LLC, provides fitness programs, including home workout videos and nutrition. P90X, or Power 90 Extreme, is a home exercise program that consists of an intense series of workout DVDs. It is a 90-day program that uses the term “muscle confusion”
> In the article “Distribution of Oxygen in Surface Sediments from Central Sagami Bay, Japan: In Situ Measurements by Microelectrodes and Planar Optodes” (Deep Sea Research Part I: Oceanographic Research Papers, Vol. 52, Issue 10, pp. 1974–1987), R. Glud e
> The ages of the 36 millionaires sampled are arranged in increasing order in the following table. a. Determine the quartiles for the data. b. Obtain and interpret the interquartile range. c. Find and interpret the five-number summary. d. Calculate the low
> The U.S. Census Bureau publishes annual price figures for new mobile homes in Manufactured Housing Statistics. The prices of a sample of 250 new mobile homes have roughly a bell-shaped distribution with mean $63.3 thousand and standard deviation $7.9 tho
> The objective of the article, “Caffeinated and Caffeine-free Beverages and Risk of Type 2 Diabetes” (American Journal of Clinical Nutrition, Vol. 97, No. 1, pp. 155–166) by S. Bhupathiraju et al., was to examine the association between caffeinated bevera
> Dr. Thomas Stanley of Georgia State University has collected information on millionaires, including their ages, since 1973. A sample of 36 millionaires has a mean age of 58.5 years and a standard deviation of 13.4 years. a. Complete the following graph.
> Identify the two most commonly used measures of center for quantitative data. Explain the relative advantages and disadvantages of each.
> In the paper “Injuries and Risk Factors in a 100-Mile (161-km) Infantry Road March” (Preventative Medicine, Vol. 28, pp. 167–173), K. Reynolds et al. reported on a study commissioned by the U.S. Army. The purpose of the study was to improve medical plann
> In Issue 338 of the Amstat News, thenpresident of the American Statistical Association, F. Scheuren, reported the results of a survey on how members would prefer to receive ballots in annual elections. On the WeissStats site, you will find data for prefe
> In the article “Fossil Argonauts (Mollusca: Cephalopoda: Octopodida) from Late Miocene Siltstones of the Los Angeles Basin, California” (Journal of Paleontology, Vol. 79, No. 3, pp. 520–531), paleontologists L. Saul and C. Stadum discussed fossilized Arg
> The U.S. National Center for Health Statistics collects data on causes of death and publishes its findings in National Vital Statistics Reports. Which of the three main measures of center is appropriate for causes of death? Explain your answer.
> The National Center for Health Statistics publishes information on the duration of marriages in Vital Statistics of the United States. Which measure of center is more appropriate for data on the duration of marriages, the mean or the median? Explain your
> An integral part of doing business in the dot-com culture of the late 1990s was frequenting the party circuit centered in San Francisco. Here high-tech companies threw as many as five parties a night to recruit or retain talented workers in a highly comp
> Regarding z-scores: a. How is a z-score obtained? b. What is the interpretation of a z-score? c. An observation has a z-score of 2.9. Roughly speaking, what is the relative standing of the observation?
> Regarding outliers: a. What is an outlier? b. Explain how you can identify potential outliers, using only the first and third quartiles.
> Regarding the five-number summary: a. Identify its components. b. How can it be employed to describe center and variation? c. What graphical display is based on it?
> A data set of size 152 with roughly a bell-shaped distribution has mean 25 and standard deviation 4. Approximately how many observations lie between 17 and 33?
> A data set with roughly a bell-shaped distribution has mean 45 and standard deviation 12. Approximately what percentage of the observations lie between 33 and 57?
> Define a. descriptive measures. b. measures of center. c. measures of variation.
> Research by W. Clark and L. Midanik (Alcohol Consumption and Related Problems: Alcohol and Health Monograph 1. DHHS Pub. No. (ADM) 82–1190) examined, among other issues, alcohol consumption patterns of U.S. adults by marital status. Data for marital stat
> A quantitative data set has been grouped by using limit grouping with equal-width classes. The lower and upper limits of the first class are 3 and 8, respectively, and the class width is 6. a. What is the class mark of the second class? b. What are the l
> When is the use of single-value grouping particularly appropriate?
> Some users of statistics prefer pie charts to bar charts because people are accustomed to having the horizontal axis of a graph show order. For example, someone might infer from “Republican” is less than “Other” because “Republican” is shown to the left
> In a bar chart, unlike in a histogram, the bars do not abut. Give a possible reason for that.
> Identify two main types of graphical displays that are used for qualitative data.
> What is the relationship between a frequency or relative frequency distribution of a quantitative data set and that of a qualitative data set?
> For a qualitative data set, what is a a. frequency distribution? b. relative-frequency distribution?
> The U.S. Census Bureau divides the states in the United States into nine divisions: East North Central (ENC), East South Central (ESC), Middle Atlantic (MAC), Mountain (MTN), New England (NED), Pacific (PAC), South Atlantic (SAC), West North Central (WNC
> Firearms, live ammunition, and spent cartridge casings are often submitted to crime laboratories to be processed for latent fingerprints. B. Maldonado explored the chances of successfully recovering fingerprints in the article, “Study on Developing Laten
> From the ESPN Web site, we obtained the age of the oldest player on each of the major league baseball teams during one season. Here are the data. a. Construct a dotplot for these data. b. Use your dotplot from part (a) to identify the modality and symmet
> Provide a reason why the classification of data is important.
> The Air Travel Consumer Report is a monthly product of the Department of Transportation’s Office of Aviation Enforcement and Proceedings. The report is designed to assist consumers with information on the quality of services provided by the airlines. Fol
> The Prescott National Bank has six tellers available to serve customers. The data in the following table provide the number of busy tellers observed during 25 spot checks. a. Use single-value grouping to organize these data into frequency and relative-fr
> In the article “Comparing the lifetime of two brands of batteries” (Journal of Statistics Education, Vol. 21, No. 1, pp. 1–19) by P. Dunn, two brands of AA alkaline batteries were compared. The following table gives the number of pulses required for a sa
> Refer to Problem 19. Construct a stem and-leaf diagram for the inauguration ages of the first 44 presidents of the United States. a. Use one line per stem. b. Use two lines per stem. c. Which of the two stem-and-leaf diagrams that you just constructed co
> Refer to Problem 19. Construct a dotplot for the ages at inauguration of the first 44 presidents of the United States. Data from Problem 19: From the Information Please Almanac, we obtained the ages at inauguration for the first 44 presidents of the Uni
> This problem is about data. a. What are data? b. How is data type determined?
> From the Information Please Almanac, we obtained the ages at inauguration for the first 44 presidents of the United States (from George Washington to Barack H. Obama). a. Identify the classes for grouping these data, using limit grouping with classes of
> Refer to Example: a. Explain why a frequency histogram of the DVD prices with single-value classes would be essentially identical to the dotplot. b. Would the dotplot and a frequency histogram be essentially identical with other than single-value classes
> According to Wikipedia, the world’s five largest hydroelectric plants, based on installed capacity, are as shown in the following table. Capacities are in megawatts. a. What type of data is given in the first column of the table? b. What type of data is
> A variable of a population has a left-skewed distribution. a. If a large simple random sample is taken from the population, roughly what shape will the distribution of the sample have? Explain your answer. b. If two simple random samples are taken from t
> The Japan Automobile Manufacturers Association provides data on exported vehicles in Motor Vehicle Statistics of Japan. In 2010, cars, trucks, and buses constituted 88.3%, 9.3%, and 2.4% of vehicle exports, respectively. A random sample of last year’s ex
> Draw a smooth curve that represents a symmetric trimodal (three-peak) distribution.
> Sketch the curve corresponding to each of the following specific distribution shapes. a. Bell shaped b. Triangular c. Reverse J shaped d. Uniform
> Explain the relative positioning of the bars in a histogram to the numbers that label the horizontal axis when each of the following quantities is used to label that axis. a. Lower class limits b. Lower class cutpoints c. Class marks d. Class midpoints
> A quantitative data set has been grouped by using cutpoint grouping with equal-width classes of width 8. a. If the midpoint of the first class is 10, what are its lower and upper cutpoints? b. What is the class midpoint of the second class? c. What are t
> A quantitative data set has been grouped by using cutpoint grouping with equal-width classes. a. If the lower and upper cutpoints of the first class are 5 and 15, respectively, what is the common class width? b. What is the midpoint of the second class?
> A quantitative data set has been grouped by using limit grouping with equal-width classes of width 5. The class limits are whole numbers. a. If the class mark of the first class is 8, what are its lower and upper limits? b. What is the class mark of the
> The U.S. National Oceanic and Atmospheric Administration publishes temperature data in Climatography of the United States. According to that document, the annual average maximum and minimum temperatures for selected cities in the United States are as pro
> Life expectancy is the average number of years to be lived by a group of people born in the same year if mortality at each age remains constant in the future. From the World FactBook, published by the Central Intelligence Agency (CIA), we obtained the li
> The U.S. Department of Agriculture collects data pertaining to the value of agricultural exports and publishes its findings in U.S. Agricultural Trade Update. For one year, the values of these exports, by state, are provided on the WeissStats site. Data
> In the article “Graphical Display of Two Way Contingency Tables” (The American Statistician, Vol. 28, No. 1, pp. 9–12), R. Snee presented data on hair color and eye color among 592 students in an elementary statistics course at the University of Delaware