In 2012, the following data were reported by the U.S. Census Bureau. The data show the number of people (in thousands) living above and below the poverty line in each of the four regions of the United States. Based on these data, do you think there is an association between region and poverty? Explain.
> The researcher was concerned about whether each maid BMI changed over the four-week study. Here are the results of a test performed using technology: Mean of Paired Differences=0.2068t-Statistic=3.853 w/72 dfP=0.0003 Using the proper notation, how would
> Now let consider only the maids who were informed. Here is a display of their body mass index (BMI), a measure of body fat, at the start (BMI) and at the end (BMI2) of the study. Which of these is the best comment to make about this display? 1. This is n
> Which type of study is this? 1. A prospective observational study because it followed the maids for four weeks. 2. A retrospective observational study because at the end of four weeks, the researcher had to look back at the original measurements. 3. A su
> Consider the relationship between the life expectancy (in years) and the illiteracy rate (per hundred people) in the 50 U.S. states plus Washington, DC. A linear model is run and the output is presented here: Residual standard deviation: 1.097 on 48 degr
> Examine the table about ethnicity and acceptance for the Houston Independent School District magnet schools program, shown in Exercise 35. Does it appear that the admissions decisions are made independent of the applicant ethnicity? Explain.
> The following table summarizes 1529 films from 2014 and 2015 that have been classified into a genre and have a MPAA rating. (Data from Movies 06-15) 1. What percent of these films were rated R? 2. What percent of these films were R-rated comedies? 3. Wha
> During contract negotiations, a company seeks to change the number of sick days employees may take, saying that the annual average is 7 days of absence per employee. The union negotiators counter that the average employee misses only 3 days of work each
> A clerk entering salary data into a company spreadsheet accidentally put an extra “0†in the boss salary, listing it as $2,000,000 instead of $200,000. Explain how this error will affect these summary statistics for the company payroll: 1. Measures o
> Test scores from a calculus section of 40 students are shown in the histogram below. Describe the distribution of scores. Why might you be less sure of the description of the shape as compared to the histogram in Exercise 51?
> Test scores from a large calculus class of 400 are shown in the histogram below 1. Describe the distribution of scores. What might account for this shape? 2. Why might both the mean and median score be misleading as a summary of the center?
> In the Super Bowl, by how many points does the winning team outscore the losers? Here are the winning margins for the first 50 Super Bowl games. (Data in Super Bowl 2016) 25, 19, 9, 16, 3, 21, 7, 17, 10, 4, 18, 17, 4, 12, 17, 5, 10, 29, 22, 36, 19, 32, 4
> How many points do football teams score in the Super Bowl? Here are the total numbers of points scored by both teams in each of the first 50 Super Bowl games. (Data in Super Bowl 2016) 45, 47, 23, 30, 29, 27, 21, 31, 22, 38, 46, 37, 66, 50, 37, 47, 44, 4
> The Cornell Lab of Ornithology holds an annual Christmas Bird Count (www.birdsource.org), in which bird watchers at various locations around the country see how many different species of birds they can spot. Here are the numbers of species counted from t
> The histogram shows the lengths of hospital stays (in days) for all the female patients admitted to hospitals in New York during one year with a primary diagnosis of acute myocardial infarction (heart attack). 1. From the histogram, would you expect the
> One of the authors collected the times (in minutes) it took him to run 4 miles on various courses during a 10-year period. Here is a histogram of the times: Describe the distribution and summarize the important features. What is it about running that mig
> Look once more at the table summarizing the political views of Intro Stats students in Exercise 30. 1. Produce a graphical display comparing the conditional distributions of males and females among the three categories of politics. 2. Comment briefly on
> The histogram shows the carbohydrate content of 77 breakfast cereals (in grams). 1. Describe this distribution. 2. If you can, open the dataset and identify the cereals with the highest carbohydrate content.
> A survey of athletic trainers asked what modalities (treatment methods such as ice, whirlpool, ultrasound, or exercise) they commonly use to treat injuries. Respondents were each asked to list three modalities. The article included the following figure r
> The Yale Program on Climate Change Communication surveyed 1263 American adults in March 2015 and asked them about their attitudes on global climate change. Here a display of the percentages of respondents choosing each of the major alternatives offered.
> Fifty-nine countries won gold medals in the 2016 Summer Olympics. The table lists them, along with the total number of gold medals each won. 1. Try to make a display of these data. What problems do you encounter? 2. Organize the data so that the graph is
> The movie genres listed in Exercise 35 were originally listed as these: 1. What problem would you encounter in trying to make a display of these data? 2. How did the creators of the bar chart in Exercise 35 solve this problem?
> An investigator compiled information about recent nonmilitary plane crashes. The causes, to the extent that they could be determined, are summarized in the table. 1. Is it reasonable to conclude that the weather or mechanical failures caused only about 3
> The Centers for Disease Control and Prevention lists causes of death in the United States during 2014: 1. Is it reasonable to conclude that heart or lung diseases were the cause of approximately 29.04% of U.S. deaths in 2014? 2. What percentage of deaths
> The Chance article about the Houston magnet schools program described in Exercise 37 also indicated that 517 applicants were black or Hispanic, 292 Asian, and 946 white. Summarize the relative frequency distribution of ethnicity with a sentence or two (i
> An article in the Winter 2003 issue of Chance magazine (www.chance.amstat.org) reported on the Houston Independent School District magnet schools programs. Of the 1755 qualified applicants, 931 were accepted, 298 were wait-listed, and 526 were turned awa
> Here is a bar chart summarizing the movie ratings from the 891 movies shown in Exercise 4. 1. Which was the least common rating? 2. Is it easier to answer the question from the bar chart or from the pie chart in Exercise 4? Explain.
> The A Chance magazine article described in Chapter 2, Exercise 37 further examined the impact of an applicant ethnicity on the likelihood of admission to the Houston Independent School District magnet schools programs. Those data are summarized in the ta
> Here is a bar chart summarizing the movie genres from the 891 movies in Exercise 3. (Data extracted from Movies 06-15) 1. Were Thriller/Suspense or Adventure films more common? 2. Is it easier to answer the question from the bar chart or from the pie cha
> Would you expect distributions of these variables to be uniform, unimodal, or bimodal? Symmetric or skewed? Explain why. 1. Ages of people at a Little League game 2. Number of siblings of people in your class 3. Pulse rates of college-age males 4. Number
> Would you expect distributions of these variables to be uniform, unimodal, or bimodal? Symmetric or skewed? Explain why. 1. The number of speeding tickets each student in the senior class of a college has ever had 2. Players scores (number of strokes) at
> Find an article in a newspaper, a magazine, or the Internet that discusses a measure of spread. 1. Does the article discuss the W for the data? 2. What are the units of the variable? 3. Does the article use the range, IQR, or standard deviation? 4. Is th
> Find an article in a newspaper, a magazine, or the Internet that discusses an average. 1. Does the article discuss the W for the data? 2. What are the units of the variable? 3. Is the average used the median or the mean? How can you tell? 4. Is the choic
> Find a graph other than a histogram that shows the distribution of a quantitative variable in a newspaper, a magazine, or the Internet. 1. Does the article identify the W? 2. Discuss whether the display is appropriate for the data. 3. Discuss what the di
> Find a histogram that shows the distribution of a variable in a newspaper, a magazine, or the Internet. 1. Does the article identify the W? 2. Discuss whether the display is appropriate. 3. Discuss what the display reveals about the variable and its dist
> Find a table of categorical data from a newspaper, a magazine, or the Internet. 1. Is it clearly labeled? 2. Does it display percentages or counts? 3. Does the accompanying article tell the W of the variables? 4. Do you think the article correctly interp
> Find a frequency table of categorical data from a newspaper, a magazine, or the Internet. 1. Is it clearly labeled? 2. Does it display percentages or counts? 3. Does the accompanying article tell the W of the variable? 4. Do you think the article correct
> Find a pie chart of categorical data from a newspaper, a magazine, or the Internet. 1. Is the graph clearly labeled? 2. Does it violate the area principle? 3. Does the accompanying article tell the W of the variable? 4. Do you think the article correctly
> Look again at the table of political views for the Intro Stats students in Exercise 30. 1. Find the conditional distributions (percentages) of political views for the females. 2. Find the conditional distributions (percentages) of political views for the
> Find a bar chart of categorical data from a newspaper, a magazine, or the Internet. 1. Is the graph clearly labeled? 2. Does it violate the area principle? 3. Does the accompanying article tell the W of the variable? 4. Do you think the article correctly
> Here is a mosaic plot of the data on being successful from Exercise 10: 1. Are the differences in sample sizes in the four groups very large? Explain briefly. 2. Which factor seems more important in determining how someone responded: Age or Gender? Expla
> The organization Monitoring the Future (www.monitoringthefuture.org) asked 2048 eighth graders who said they smoked cigarettes what brands they preferred. The table shows brand preferences for two regions of the country. Write a few sentences describing
> The New York Times combined survey data (economix.blogs.nytimes.com/2013/07/10/working-parents-wanting-fewer-hours/) with data from the U.S. Bureau of Labor Statistics (BLS) (www.bls.gov/news.release/archives/famee_04262013.htm) comparing how mothers and
> Pew Research (www.pewsocialtrends.org/2013/03/14/modern-parenthood-roles-of-moms-and-dads converge-as-they-balance-work-and-family/) surveyed parents and asked how many hours they spent in various activities. They compared 2011 responses with those from
> Look again at the table of smoking prevalence in Exercise 19. 1. Compare the smoking rate among 1824-year-old women to that of men during the time covered by this table. 2. Relatively few women over the age of 65 smoke. What other variables might affect
> The Centers for Disease Control and Prevention provide data on smoking rates by year and for men and women separately. Here is a table with some of that information: 1. What was the smoking rate among 1824-year-old men in 1974? 2. How has the smoking rat
> You find a number of cartoons throughout this text. Are they Are cartoons simply entertaining, or will they help with learning? Lawrence M. Lesser, Dennis K. Pearl, and John J. Weber III (Assessing fun items effectiveness in increasing learning of colleg
> A recent article in Geophysical Research Letters (Asteroid impact effects and their immediate hazards for human populations, 10.1002/2017GL073191) simulated the consequences of an earth impact by an asteroid of 400 m in diameter. They estimate that for a
> The Motion Picture Association of America studies the ethnicity of moviegoers to understand changes in the demographics of moviegoers over time. Here are the numbers of moviegoers (in millions) classified as Hispanic, African American, Caucasian, and Oth
> Look again at the table of post-graduation plans for the senior class in Exercise 29. 1. Find the conditional distributions (percentages) of plans for the white students. 2. Find the conditional distributions (percentages) of plans for the minority stude
> Find a bar graph or pie chart of categorical data from a newspaper, a magazine, or the Internet. 1. Is the graph clearly labeled? 2. Does the graph violate the area principle? 3. Does the accompanying article tell the W of the variable? 4. Do you think t
> Find a contingency table of categorical data from a newspaper, a magazine, or the Internet. 1. Is it clearly labeled? 2. Does it display percentages or counts? 3. Does the accompanying article tell the W of the variables? 4. Do you think the article corr
> The following table shows the reasons given by people 16 years of age and older in the United States who are not in the labor force for not working in early 2015. Counts are in thousands of people. (bls.gov/cps/cpsaat35.htm) 1. What percent of the unempl
> Refer to the experiment in Exercise 7 . After collecting her data and analyzing the results, the student reports that the F-ratio for Power is 13.56 and the F-ratio for Time is 9.36. 1. What are the P-values? 2. What would you conclude? 3. What else abou
> In Chapter 25 we saw a student experiment to study the effect of Tire Pressure and Acceleration on gas mileage. He devises a system so that his Jeep Wagoneer uses gasoline from a one-liter container. He uses 3 levels of Tire Pressure (low, medium, and fu
> In the previous chapter we saw a two-factor experiment to test how microwave power and temperature affect popping. She chooses 3 levels of Power (low, medium, and high) and 3 Times (3 minutes, 4 minutes, and 5 minutes), running one bag at each condition.
> Another student performs a one-way ANOVA on the container data of Exercise 20 , using the 4 treatments water room, water outside, coffee room, and coffee outside. Perform this analysis and comment on the differences between this analysis and the one in E
> Another student analyzed the battery data from Exercise 25, using a one-way ANOVA. He considered the experimental factor to be an 8-level factor consisting of the 8 possible combinations of Brand and Environment. Here are the boxplots for the 8 treatment
> In an experiment on growing sweet peas, a team of students selected 2 factors at 4 levels each and recorded Weight, Stem Length, and Root Length after 612 days of growth. They grew plants using various amounts of Water and Quickgrow solution, a fertilize
> The U.S. Department of Labor (www.bls.gov) collects data on the number of U.S. workers who are employed at or below the minimum wage. Here is a table showing the number of hourly workers by Age and Sex and the number who were paid at or below the prevail
> A student experiment was run to test the performance of 4 brands of batteries under 2 different Environments (room temperature and cold). For each of the 8 treatments, 2 batteries of a particular brand were put into a flashlight. The flashlight was then
> Refer back to the experiment in Exercise 22 . Instead of Total counts, redo the analysis using log(Total counts) as the response. Do your conclusions change? How? Are the assumptions of the model better satisfied?
> Refer back to the experiment in Exercise 21 . Instead of mpg redo the analysis using log (mpg) as the response. Do your conclusions change? How? Are the assumptions of the model better satisfied?
> A gas chromatograph is an instrument that measures the amounts of various compounds in a sample by separating its constituents. Because different components are flushed through the system at different rates, chromatographers are able to both measure and
> An experiment to test a new gasoline additive, Gasplus, was performed on three different cars: a sports car, a minivan, and a hybrid. Each car was tested with both Gasplus and regular gas on 10 different occasions and their gas mileage was recorded. Here
> Building on the cup experiment of the Chapter 4 Step-By-Step, a student selects one type of container and designs an experiment to see whether the type of Liquid stored and the outside Environment affect the ability of a cup to maintain temperature. He r
> The students running the sprouts experiment (Exercise 12 ) also kept track of the number of beans sprouted (out of 40) for each of the 36 dishes. Here are the partial boxplots of Sprouts plotted against Salinity and Temperature: 1. State the hypotheses a
> For his final project, Jonathan examined the effects of two factors on how well stains are removed when washing clothes. On each of 16 new white handkerchiefs, he spread a teaspoon of dirty motor oil (obtained from a local garage). He chose 4 Temperature
> A student performed an experiment to see if her favorite sneakers and the time of day might affect her free throw percentage. She tried shooting with and without her favorite sneakers and in the early morning and at night. For each treatment combination,
> Refer back to Exercise 14 . Perform your own analysis of the data to see if eating fish and contracting prostate cancer are related.
> How have movies changed during the decade from 2006 to 2015? Here is a contingency table showing the proportion of movies with each of the MPAA categories in each year: 1. Are these column percents or row percents? How can you tell? 2. Does it look like
> Refer back to Exercise 13 . Perform your own analysis of the data to see if baldness and heart disease are related. Do your conclusions support the claim that baldness is a cause of heart disease? Explain.
> The Chapter 3 Step-By-Step looked at a Swedish study that asked 6272 men how much fish they ate and whether or not they had prostate cancer. (Data in Fish diet) Here are summary counts: 1. Comment on her analysis. What problems, if any, do you find with
> A retrospective study examined the link between baldness and the incidence of heart disease. In the study, 1435 middle-aged men were selected at random and examined to see whether they showed signs of Heart Disease (or not) and what amount of Baldness th
> An experiment on mung beans was performed to investigate the environmental effects of salinity and water temperature on sprouting. Forty beans were randomly allocated to each of 36 petri dishes that were subject to one of four levels of Salinity (0, 4, 8
> The National Highway Transportation Safety Administration runs crash tests in which stock automobiles are crashed into a wall at 35 mph with dummies in both the passenger and the driver seats. The THOR Alpha crash dummy is capable of recording 134 channe
> Refer to the experiment in Exercise 8 . After analyzing his data the student reports that the F-ratio for Tire Pressure is 4.29 with a P-value of 0.030, the F-ratio for Acceleration is 2.35 with a P-value of 0.143, and the F-ratio for the Interaction eff
> A pharmaceutical company tested three formulations of a pain relief medicine for migraine headache sufferers. For the experiment, 27 volunteers were selected and 9 were randomly assigned to one of three drug formulations. The subjects were instructed to
> To see how much of a difference time of day made on the speed at which he could download files, a college sophomore performed an experiment. He placed a file on a remote server and then proceeded to download it at three different time periods of the day.
> We also have data on the protein content of the cereals in Exercise 19 by their shelf number. Here are the boxplot and ANOVA table: 1. What are the null and alternative hypotheses? 2. What does the ANOVA table say about the null hypothesis? (Be sure to r
> Supermarkets often place similar types of cereal on the same supermarket shelf. We have data on the shelf as well as the sugar, sodium, and calorie content of 77 cereals. Does sugar content vary by shelf? At the top of the next column is a boxplot and an
> Students in an Intro Stats course were asked to describe their politics as Liberal, Moderate, or Conservative. Here are the results: 1. What percent of the class is male? 2. What percent of the class considers themselves to be Conservative? 3. What perce
> A biology student is studying the effect of 10 different fertilizers on the growth of mung bean sprouts. She sprouts 12 beans in each of 10 different petri dishes, and adds the same amount of fertilizer to each dish. After one week she measures the heigh
> A school district superintendent wants to test a new method of teaching arithmetic in the fourth grade at his 15 schools. He plans to select 8 students from each school to take part in the experiment, but to make sure they are roughly of the same ability
> In a statement to a Senate Public Works Committee, a senior executive of Texaco, Inc., cited a study on the effectiveness of auto filters on reducing noise. Because of concerns about performance, two types of filters were studied, a standard silencer and
> A student wants to investigate the effects of real vs. substitute eggs on his favorite brownie recipe. He enlists the help of 10 friends and asks them to rank each of 8 batches on a scale from 1 to 10. Four of the batches were made with real eggs, four w
> Particulate matter is a serious form of air pollution often arising from industrial production. One way to reduce the pollution is to put a filter, or scrubber, at the end of the smokestack to trap the particulates. An experiment to determine which smoke
> An experiment to determine the effect of several methods of preparing cultures for use in commercial yogurt was conducted by a food science research group. Three batches of yogurt were prepared using each of three methods: traditional, ultrafiltration, a
> A regression model for data on breakfast cereals originally looked like this: Dependent variable is: Calories R squared =84.5% R-squared (adjusted)=83.4% s=7.947 with 776=71 degrees of freedom Let’s take a closer look at the coefficien
> HIV One ongoing health problem in the part of Africa encompassing the outlying countries for the regression model of Exercise 17 is HIV/AIDS. Could that explain these outliers? Here another model, now with the logarithm of the HIV incidence included as a
> Here the residual plot corresponding to the regression model of Exercise 18 : The extreme case this time is Weight Watchers Pepperoni (makes sense, doesn’t it?). We can make one more indicator for Weight Watchers. Here the model: Depend
> The residual plot of Exercise 17 calls out some countries that have particularly large negative residuals. They are Gabon, Swaziland, Botswana, Namibia, and South Africa. What do these countries have in common? (Hint: Consult a map.) What does it mean fo
> Prior to graduation, a high school class was surveyed about its plans. The following table displays the results for white and minority students (the Minority group included African American, Asian, Hispanic, and Native American students): 1. What percent
> A plot of Studentized residuals against predicted values for the regression model found in Exercise 16 now looks like this. It has been colored according to Type of pizza and separate regression lines fitted for each type: 1. Comment on this diagnostic p
> At the top of the next column is a regression analysis to predict Life expectancy using the data of Exercise 15 and a plot of the residuals Response variable is: Life expectancy 240 total cases of which 20 are missing Comment on the model and the residua
> Here a plot of the Studentized residuals against the predicted values for the regression model found in Exercise 14 : The two extraordinary cases in the plot of residuals are Reggio and Michelina, two gourmet pizzas. 1. Interpret these residuals. What do
> Here is a scatterplot matrix of the variables as re-expressed in Exercise 13 using a version that places Normal probability plots on the diagonal. 1. Comment on their suitability for a regression model to predict Life expectancy. The points are colored a
> Union rated frozen pizzas. Their report includes the number of Calories, Fat content, and Type (cheese or pepperoni, represented here as an indicator variable that is 1 for cheese and 0 for pepperoni). Here a regression model to predict the Score awarded
> The United States Central Intelligence Agency maintains a public site called the World Factbook at www.cia.gov/library/publications/the-worldfactbook/. There you find a wealth of variables about all the countries of the world. Let’s exa
> In Chapter 9 , Exercises 14 , 18, 29, and 30, we considered data on hill races in Scotland. These are overland races that climb and descend hills sometimes several hills in the course of one race. Here is a regression analysis to predict the Women Record
> In Exercise 25 of Chapter 9 , we considered a multiple regression model for predicting calories in breakfast cereals. The regression looked like this: Dependent variable is: Calories R-squared =38.4% R-squared (adjusted)=35.9% s=15.60 with 774=73 degrees