Here are several scatterplots. The calculated correlations are 0.923,0.487,0.006, and 0.777. Which is which?
> An online clothing retailer keeps track of its customers purchases. For those customers who signed up for the company credit card, the company also has information on the customer Age and Income. A random sample of 500 of these customers shows the follow
> In Chapter 6, Exercise 40, we saw Below is a plot of mortgages in the United States (in trillions of 2013 dollars) vs. the interest rate at various times over the past 25 years. The correlation is r=0.845. The mean mortgage amount is $8.207 T and the mea
> In Chapter 6, Exercise 39, We learned that the Office of Federal Housing Enterprise Oversight (OFHEO) collects data on various aspects of housing costs around the United States. Here a scatterplot (by state) of the Housing Cost Index (HCI) vs. the Median
> Refer again to the regression analysis for home average attendance and games won by baseball teams, seen in Exercise 44. 1. Write the equation of the regression line. 2. Estimate the Home Average Attendance for a team with 750 Runs. 3. Interpret the mean
> Most roller coasters get their speed by dropping down a steep initial incline, so it makes sense that the height of that drop might be related to the speed of the coaster. Here a scatterplot of top Speed and largest Drop for 118 roller coasters around th
> Take another look at the regression analysis of tar and nicotine content of the cigarettes in Exercise 43. 1. Write the equation of the regression line. 2. Estimate the Nicotine content of cigarettes with 4 milligrams of Tar. 3. Interpret the meaning of
> Consider again the regression of Home Average Attendance on Runs for the baseball teams examined in Exercise 44. 1. What is the correlation between Runs and Home Average Attendance? 2. What would you predict about the Home Average Attendance for a team t
> Consider again the regression of Nicotine content on Tar (both in milligrams) for the cigarettes examined in Exercise 43. 1. What is the correlation between Tar and Nicotine? 2. What would you predict about the average Nicotine content of cigarettes that
> In Chapter 6, Exercise 45 looked We can look at the relationship between the number of runs scored by American League baseball teams and the average attendance at their home games for the 2016 season. Here are the scatterplot, the residuals plot, and par
> Is the nicotine content of a cigarette related to the tar? A collection of data (in milligrams) on 816 cigarettes produced the scatterplot, residuals plot, and regression analysis shown: 1. Do you think a linear model is appropriate here? Explain. 2. Exp
> Consider the roller coasters (with the outlier removed) described in Exercise 30 again. The regression analysis gives the model Duration=87.22+0.389 Drop. 1. Explain what the slope of the line says about how long a roller coaster ride may last and the he
> Consider the Albuquerque home sales from Exercise 29 again. The regression analysis gives the model Price=47.82+0.061Â Size. 1. Explain what the slope of the line says about housing prices and house size. 2. What price would you predict for a 3000-square
> Players in any sport who are having great seasons, turning in performances that are much better than anyone might have anticipated, often are pictured on the cover of Sports Illustrated. Frequently, their performances then falter somewhat, leading some a
> People who claim to have extrasensory perception (ESP) participate in a screening test in which they have to guess which of several images someone is thinking of. You and a friend both took the test. You scored 2 standard deviations above the mean, and y
> The regression of Duration of a roller coaster ride on the height of its initial Drop, described in Exercise 30, had R2=29.4%. 1. What is the correlation between Drop and Duration? 2. What would you predict about the Duration of the ride on a coaster who
> The National Insurance Crime Bureau reports that Honda Accords, Honda Civics, and Toyota Camrys are the cars most frequently reported stolen, while Ford Tauruses, Pontiac Vibes, and Buick LeSabres are stolen least often. Is it reasonable to say that ther
> The regression of Price on Size of homes in Albuquerque had R2=71.4% as described in Exercise 29. 1. What is the correlation between Size and Price? 2. What would you predict about the Price of a home 1 SD above average in Size? 3. What would you predict
> A sociology student investigated the association between a country Literacy Rate and Life Expectancy, and then drew the conclusions listed below. Explain why each statement is incorrect. (Assume that all the calculations were done properly.) 1. The R2 of
> A biology student who created a regression model to use a bird Height when perched for predicting its Wingspan made these two statements. Assuming the calculations were done correctly, explain what is wrong with each interpretation. 1. My R2 of 93% shows
> Exercise 30 examined the association between the Duration of a roller coaster ride and the height of its initial Drop, reporting that R2=29.4%. Write a sentence (in context, of course) summarizing what the R2 says about this regression.
> The regression of Price on Size of homes in Albuquerque had R2=71.4%, as described in Exercise 29. Write a sentence (in context, of course) summarizing what the R2 says about this regression.
> If you create a regression model for estimating the Height of a pine tree (in feet) based on the Circumference of its trunk (in inches), is the slope most likely to be 0.1, 1, 10, or 100? Explain.
> If you create a regression model for predicting the Weight of a car (in pounds) from its Length (in feet), is the slope most likely to be 3, 30, 300, or 3000? Explain.
> The dataset on roller coasters A lists the Duration of the ride in seconds in addition to the Drop height in feet for some of the coasters. One coaster (the Tower of Terror) is unusual for having a large drop but a short ride. After setting it aside, a r
> A random sample of records of home sales from Feb. 15 to Apr. 30, 1993, from the files maintained by the Albuquerque Board of Realtors gives the Price and Size (in square feet) of 117 homes. A regression to predict Price (in thousands of dollars) from Si
> Tell what each of the residual plots below indicates about the appropriateness of the linear model that was fit to the data.
> A candidate for office claims that there is a correlation between television watching and crime. Criticize this statement on statistical grounds.
> Tell what each of the residual plots below indicates about the appropriateness of the linear model that was fit to the data.
> Fill in the missing information in the following table.
> Fill in the missing information in the following table.
> For Exercise 16 regression model predicting fuel economy (in mpg) from the car engine size, se=3.522. Explain in this context what that means.
> For Exercise 15 regression model predicting potassium content (in milligrams) from the amount of fiber (in grams) in breakfast cereals, se=30.77. Explain in this context what that means.
> The correlation between a car engine size and its fuel economy (in mpg) is r=0.774. What fraction of the variability in fuel economy is accounted for by the engine size?
> The correlation between a cereal fiber and potassium contents is r=0.903. What fraction of the variability in potassium is accounted for by the amount of fiber that servings contain?
> In Exercise 16, the regression model Combined MPG=33.46 3.23 Displacement relates cars engine size to their fuel economy (Combined mpg). Explain what the slope means.
> In Exercise 15, the regression model Potassium^=38+27 Fiber relates fiber (in grams) and potassium content (in milligrams) in servings of breakfast cereals. Explain what the slope means.
> Exercise 16 describes a regression model that uses a car engine displacement to estimate its fuel economy. In this context, what does it mean to say that a certain car has a positive residual?
> Here are several scatterplots. The calculated correlations are 0.977,0.021,0.736, and 0.951. Which is which?
> The data set Igf13 contains the data from Igf for children under 13 years old. Most of the data was collected from physical examinations in schools. 1. Fit a linear regression to igÆ’ using age as the predictor variable. Comment on the appropriateness of
> Exercise 15 describes a regression model that estimates a cereal potassium content from the amount of fiber it contains. In this context, what does it mean to say that a cereal has a negative residual?
> In Chapter 6, Exercise 41 we examined We can examine the relationship between the fuel economy (Combined MPG) and Displacement (in liters) for 1211 models of cars. (Data in Fuel economy 2016) Further analysis produces the regression model Combined MPG^=3
> For many people, breakfast cereal is an important source of fiber in their diets. Cereals also contain potassium, a mineral shown to be associated with maintaining a healthy blood pressure. An analysis of the amount of fiber (in grams) and the potassium
> Agricultural scientists are working on developing an improved variety of Roma tomatoes. Marketing research indicates that customers are likely to bypass Romas that weigh less than 70 grams. The current variety of Roma plants produces fruit that averages
> Hens usually begin laying eggs when they are about 6 months old. Young hens tend to lay smaller eggs, often weighing less than the desired minimum weight of 54 grams. 1. The average weight of the eggs produced by the young hens is 50.9 grams, and only 28
> Most people think that the normal adult body temperature is 98.6F. That figure, based on a 19th-century study, has recently been challenged. In a 1992 article in the Journal of the American Medical Association, researchers reported that a more accurate f
> Companies that design furniture for elementary school classrooms produce a variety of sizes for kids of different ages. Suppose the heights of kindergarten children can be described by a Normal model with a mean of 38.2 inches and standard deviation of 1
> A tire manufacturer believes that the tread life of its snow tires can be described by a Normal model with a mean of 32,000 miles and standard deviation of 2500 miles. 1. If you buy one of these tires, would it be reasonable for you to hope it will last
> Assume the cholesterol levels of adult American women can be described by a Normal model with a mean of 188 mg/dL and a standard deviation of 24. 1. Draw and label the Normal model. 2. What percent of adult women do you expect to have cholesterol levels
> Consider the IQ model N(100,15) one last time. 1. What IQ represents the 15th percentile? 2. What IQ represents the 98th percentile? 3. What the IQR of the IQs?
> Consider the Angus weights model N(1152,84) one last time. 1. What weight represents the 40th percentile? 2. What weight represents the 99th percentile? 3. What the IQR of the weights of these Angus steers?
> In the Normal model N(100,15) from Exercise 10 , what cutoff value bounds 1. the highest 5% of all IQs? 2. the lowest 30% of the IQs? 3. the middle 80% of the IQs?
> Based on the model N(1152,84) describing Angus steer weights from Exercise 29 , what are the cutoff values for 1. the highest 10% of the weights? 2. the lowest 20% of the weights? 3. the middle 40% of the weights?
> Suppose we take logarithms of the CEO compensations in Exercise 47 . The histogram of log Compensation looks like this: with a mean of 1.07 and a standard deviation of 0.26. 1. According to the Normal model, what percent of CEOs would you expect to earn
> The Glassdoor Economic Research Blog published the compensation (in millions of dollars) for the CEOs of large companies. The distribution looks like this: The mean CEO compensation is $14.1M and the standard deviation is $11.32M. 1. According to the Nor
> A large philanthropic organization keeps records on the people who have contributed. In addition to keeping records of past giving, the organization buys demographic data on neighborhoods from the U.S. Census Bureau. Eighteen of these variables concern e
> NFL data from the 2015 football season reported the number of yards gained by each of the league 488 receivers: The mean is 274.73 yards, with a standard deviation of 327.32 yards. 1. According to the Normal model, what percent of receivers would you exp
> The mean of the 100 car speeds in Exercise 30 was 23.84 mph, with a standard deviation of 3.56 mph. 1. Using a Normal model, what values should border the middle 95% of all car speeds? 2. Here are some summary statistics. From your answer in part a, how
> Fifty-three men completed the men alpine downhill at the 2018 winter Olympics in PyeongChang. The gold medal winner finished in 100.25 seconds. Here are the times (in seconds) for all competitors. 1. The mean time was 103.883 seconds, with a standard dev
> For the car speed data in Exercise 30 , here are the histogram, boxplot, and Normal probability plot of the 100 readings. Do you think it is appropriate to apply a Normal model here? Explain.
> Owners of a new coffee shop tracked sales for the first 20 days and displayed the data in a scatterplot (by day). 1. Make a histogram of the daily sales since the shop has been in business. 2. State one fact that is obvious from the scatterplot, but not
> Later on, the forester in Exercise 39 shows you a histogram of the tree diameters he used in analyzing the woods that was for sale. Do you think he was justified in using a Normal model? Explain, citing some specific concerns.
> A company that manufactures rivets believes the shear strength (in pounds) is modeled by N(800,50). 1. Draw and label the Normal model. 2. Would it be safe to use these rivets in a situation requiring a shear strength of 750 pounds? Explain. 3. About wha
> A forester measured 27 of the trees in a large woods that is up for sale. He found a mean diameter of 10.4 inches and a standard deviation of 4.7 inches. Suppose that these trees provide an accurate description of the whole forest and that a Normal model
> Exercise 10 proposes modeling IQ scores with N(100,15). What IQ would you consider to be unusually high? Explain.
> In Exercise 29 , we suggested the model N(1152,84) for weights in pounds of yearling Angus steers. What weight would you consider to be unusually low for such an animal? Explain.
> A popular band on tour played a series of concerts in large venues. They always drew a large crowd, averaging 21,359 fans. While the band did not announce (and probably never calculated) the standard deviation, which of these values do you think is most
> A friend tells you about a recent study dealing with the number of years of teaching experience among current college professors. He remembers the mean but can’t recall whether the standard deviation was 6 months, 6 years, or 16 years. Tell him which one
> Suppose police set up radar surveillance on the Stanford street described in Exercise 30 . They handed out a large number of tickets to speeders going a mean of 28 mph, with a standard deviation of 2.4 mph, a maximum of 33 mph, and an IQR of 3.2 mph. Loc
> Suppose the auctioneer in Exercise 31 sold a herd of cattle whose minimum weight was 980 pounds, median was 1140 pounds, standard deviation 84 pounds, and IQR 102 pounds. They sold for 40 cents a pound, and the auctioneer took a $20 commission on each an
> For the car speed data in Exercise 30 , recall that the mean speed recorded was 23.84 mph, with a standard deviation of 3.56 mph. To see how many cars are speeding, John subtracts 20 mph from all speeds. 1. What is the mean speed now? What is the new sta
> A ceramics factory can fire eight large batches of pottery a day. Sometimes a few of the pieces break in the process. In order to understand the problem better, the factory records the number of broken pieces in each batch for 3 days and then creates the
> Beanstalk Clubs are social clubs for very tall people. To join, a man must be over 6²2³ tall, and a woman over 5²10³. The National Health Survey suggests that heights of adults may be Normally distributed, with mean heights of 69.1³ for men and 64.0³ for
> Recall that the beef cattle described in Exercise 29 had a mean weight of 1152 pounds, with a standard deviation of 84 pounds. 1. Cattle buyers hope that yearling Angus steers will weigh at least 1000 pounds. To see how much over (or under) that goal the
> In a study of dialysis, researchers found that of the three patients who were currently on dialysis, 67% had developed blindness and 33% had their toes amputated. What kind of display might be appropriate for these data? Explain.
> The boxplots display the heights (in inches) of 130 members of a choir by the part they sing. 1. It appears that the median height for sopranos is missing, but actually the median and the upper quartile are equal. How could that happen? 2. Write a few se
> Results of a 1996 American Medical Association report about the infant mortality rate for twins carried for the full term of a normal pregnancy are shown on the next page, broken down by the level of prenatal care the mother had received. 1. Is the overa
> Here are the prices (in cents per pound) of bananas reported from 15 markets surveyed by the U.S. Department of Agriculture. 1. Display these data with an appropriate graph. 2. Report appropriate summary statistics. 3. Write a few sentences about this di
> A student runs an experiment to test how different factors affect his reaction time while playing video games. He uses a specified race course with random hazards and times how long it takes him (in seconds) to finish 4 laps under a variety of experiment
> A student runs an experiment to test how different factors affect her bowling performance. She uses 3 levels for the weight of the ball (low, medium, and high) and 2 approaches (standing and walking), throwing 4 balls at each condition and choosing the c
> The Bureau of Transportation Statistics of the U.S. Department of Transportation reports statistics about airline performance. For September 2018 they report the following number of bags lost per 1000 passengers. Are the airlines roughly equal in their b
> The data on water hardness and mortality considered in Part VI Review Exercise 30 also included information on whether the town is north or south of Derby. When that information is coded in a variable as 1 for north of Derby and 0 for south of Derby, and
> In an attempt to control the growth of the herds of wild horses, managers sterilized some of the stallions. The variable Sterilized is coded 0 for herds and years in which no stallions were sterilized and 1 for herds and years in which some stallions wer
> A student runs an experiment to test four different clubs on her putting accuracy, recording the distance in centimeters from a small target that she places on the green. She randomizes the club used each time by drawing numbers out of a hat until each c
> John Beale of Stanford, California, recorded the speeds of cars driving past his house, where the speed limit read 20 mph. The mean of 100 readings was 23.84 mph, with a standard deviation of 3.56 mph. (He actually recorded every car for a two-month peri
> Students in two basic Spanish classes were required to learn 50 new vocabulary words. One group of 45 students received the list on Monday and studied the words all week. Statistics summarizing this group scores on Friday quiz are given. The other group
> A student runs an experiment to test four different grips on his football throwing distance, recording the distance in yards that he can throw the football using each grip. He randomizes the grip used each time by drawing numbers out of a hat until each
> A weight loss clinic advertises that its program of diet and exercise will allow clients to lose 10 pounds in one month. A local reporter investigating weight reduction gets permission to interview a randomly selected sample of clients who report the giv
> Using the same survey as in Exercise 27, the student examined the relationship between Athletic Participation and Weight. Here are the boxplots of Weight by Athletic Participation: 1. State the null hypothesis about the students (both numerically and in
> Using a Simple Random Sample, a student asked 200 students questions about their study and workout habits, and received 124 responses. One of the questions asked, Do you participate in intramural (IM) athletics, varsity athletics, or no athletics? while
> The Los Angeles Almanac website reports recent annual rainfall (in inches), as shown in the table. 1. Create a 90% confidence interval for the mean annual rainfall in LA. 2. If you wanted to estimate the mean annual rainfall with a margin of error of onl
> Brianna, a member of the track team, runs an experiment to test how different factors affect her javelin throw. She wants to know if the more expensive (premium) javelin is worth the extra price and is curious to know how much warming up helps her distan
> A student runs an experiment to test how different factors affect his score while playing video pinball. Here are the results of 16 runs of an experiment performed in random order. Factor Eyes has two levels: both open and right eye closed. Factor Tilt h
> A study published in the journal Environmental Health Perspectives examined the sex ratios of children born to workers exposed to dioxin in Russian pesticide factories. The data covered the years 1961 to 1988 in the city of Ufa, Bashkortostan, Russia. Of
> Developers of a new math curriculum called Accelerated Math compared performances of students taught by their system with control groups of students in the same schools who were taught using traditional instructional methods and materials. Statistics abo
> Using N(1152,84), the Normal model for weights of Angus steers in Exercise 13 , 1. How many standard deviations from the mean would a steer weighing 1000 pounds be? 2. Which would be more unusual, a steer weighing 1000 pounds or one weighing 1250 pounds?
> The New York Times reported the results of the 2003 NY Marathon by listing time brackets and the number of racers who finished within that bracket. Because the brackets are of different sizes, we look at the number of racers finishing per minute against
> Nick (see Exercise 19) designed a follow-up experiment to see if having Music or Television on would affect his typing speed. In particular, he’d like to know if he can type just as effectively with the music and/or the TV on while he t
> How accurate are pollsters in predicting the outcomes of Congressional elections? The table shows the actual number of Democrat seats in the House of Representatives and the number predicted by the Gallup organization for nonpresidential election years s
> For a class project, Nick M. designed and carried out an experiment to see if the room Temperature and the wearing of Gloves affected his typing speed. He ran each combination of hot and cold temperature and gloves on and off 8 times, recording the net n
> The National Vital Statistics Report provides information on live Births (per 1000 women), according to the age of the woman (in 5-year brackets Age used here is the midpoint of the bracket) and the Year from 1990 to 1999. The report isolates births to w