Module 1: Sampling and Data
Answers to Selected Exercises
Barbara Illowsky & OpenStax et al.
1. AIDS patients.
3. The average length of time (in months) AIDS patients live after treatment.
5. X = the length of time (in months) AIDS patients live after treatment
7.
- all children who take ski or snowboard lessons
- a group of these children
- the population mean age of children who take their first snowboard lesson
- the sample mean age of children who take their first snowboard lesson
- X = the age of one child who takes his or her first ski or snowboard lesson
- values for X, such as 3, 7, and so on
9.
- the clients of the insurance companies
- a group of the clients
- the mean health costs of the clients
- the mean health costs of the sample
- X = the health costs of one client
- values for X, such as 34, 9, 82, and so on
11.
- all the clients of this counselor
- a group of clients of this marriage counselor
- the proportion of all her clients who stay married
- the proportion of the sample of the counselor’s clients who stay married
- X = the number of couples who stay married
- yes, no
13.
- all people (maybe in a certain geographic area, such as the United States)
- a group of the people
- the proportion of all people who will buy the product
- the proportion of the sample who will buy the product
- X = the number of people who will buy it
- buy, not buy
15. a
17. b
19. a
21.
- 0.5242
- 0.03%
- 6.86%
- 823,088823,856
- quantitative discrete
- quantitative continuous
- In both years, underwater earthquakes produced massive tsunamis.
23. systematic
25. simple random
27. values for X, such as 3, 4, 11, and so on
29. No, we do not have enough information to make such a claim.
31. Take a simple random sample from each group. One way is by assigning a number to each patient and using a random number generator to randomly select patients.
33. This would be convenience sampling and is not random.
35. Yes, the sample size of 150 would be large enough to reflect a population of one school.
37. Even though the specific data support each researcher’s conclusions, the different results suggest that more data need to be collected before the researchers can reach a conclusion.
39. There is not enough information given to judge if either one is correct or incorrect.
41. The software program seems to work because the second study shows that more patients improve while using the software than not. Even though the difference is not as large as that in the first study, the results from the second study are likely more reliable and still show improvement.
43. Yes, because we cannot tell if the improvement was due to the software or the exercise; the data is confounded, and a reliable conclusion cannot be drawn. New studies should be performed.
45. No, even though the sample is large enough, the fact that the sample consists of volunteers makes it a self-selected sample, which is not reliable.
47. No, even though the sample is a large portion of the population, two responses are not enough to justify any conclusions. Because the population is so small, it would be better to include everyone in the population to get the most accurate data.
49. quantitative discrete, 150
51. qualitative, Oakland A’s
53. quantitative discrete, 11,234 students
55. qualitative, Crest
57. quantitative continuous, 47.3 years
59. b
61.
- The survey was conducted using six similar flights.
The survey would not be a true representation of the entire population of air travelers.
Conducting the survey on a holiday weekend will not produce representative results.
- Conduct the survey during different times of the year.
Conduct the survey using flights to and from various locations.
Conduct the survey on different days of the week.
63. Answers will vary. Sample Answer: You could use a systematic sampling method. Stop the tenth person as they leave one of the buildings on campus at 9:50 in the morning. Then stop the tenth person as they leave a different building on campus at 1:50 in the afternoon.
65. Answers will vary. Sample Answer: Many people will not respond to mail surveys. If they do respond to the surveys, you can’t be sure who is responding. In addition, mailing lists can be incomplete.
67. b
69. convenience cluster stratified systematic simple random
71.
- qualitative
- quantitative discrete
- quantitative discrete
- qualitative
73. Causality: The fact that two variables are related does not guarantee that one variable is influencing the other. We cannot assume that crime rate impacts education level or that education level impacts crime rate.
Confounding: There are many factors that define a community other than education level and crime rate. Communities with high crime rates and high education levels may have other lurking variables that distinguish them from communities with lower crime rates and lower education levels. Because we cannot isolate these variables of interest, we cannot draw valid conclusions about the connection between education and crime. Possible lurking variables include police expenditures, unemployment levels, region, average age, and size.
75.
- Possible reasons: increased use of caller id, decreased use of landlines, increased use of private numbers, voice mail, privacy managers, hectic nature of personal schedules, decreased willingness to be interviewed
- When a large number of people refuse to participate, then the sample may not have the same characteristics of the population. Perhaps the majority of people willing to participate are doing so because they feel strongly about the subject of the survey.
77.
- ordinal
- interval
- nominal
- nominal
- ratio
- ordinal
- nominal
- interval
- ratio
- interval
- ratio
- ordinal
79.
| # Flossing per Week | Frequency | Relative Frequency | Cumulative Relative Frequency |
|---|---|---|---|
| 0 | 27 | 0.4500 | 0.4500 |
| 1 | 18 | 0.3000 | 0.7500 |
| 3 | 11 | 0.1833 | 0.9333 |
| 6 | 3 | 0.0500 | 0.9833 |
| 7 | 1 | 0.0167 | 1 |
5.00%
93.33%
81. The sum of the travel times is 1,173.1. Divide the sum by 50 to calculate the mean value: 23.462. Because each state’s travel time was measured to the nearest tenth, round this calculation to the nearest hundredth: 23.46.
83. b
85.
- Inmates may not feel comfortable refusing participation, or may feel obligated to take advantage of the promised benefits. They may not feel truly free to refuse participation.
- Parents can provide consent on behalf of their children, but children are not competent to provide consent for themselves.
- All risks and benefits must be clearly outlined. Study participants must be informed of relevant aspects of the study in order to give appropriate consent.
87.
Explanatory variable: amount of sleep
Response variable: performance measured in assigned tasks
Treatments: normal sleep and 27 hours of total sleep deprivation
Experimental Units: 19 professional drivers
Lurking variables: none – all drivers participated in both treatments
Random assignment: treatments were assigned in random order; this eliminated the effect of any “learning” that may take place during the first experimental session
Control/Placebo: completing the experimental session under normal sleep conditions
Blinding: researchers evaluating subjects’ performance must not know which treatment is being applied at the time
89. You cannot assume that the numbers of complaints reflect the quality of the airlines. The airlines shown with the greatest number of complaints are the ones with the most passengers. You must consider the appropriateness of methods for presenting data; in this case displaying totals is misleading