Conducting a Study and Statistics Planning Study Guide (page 2)

Updated on Oct 5, 2011

Analyzing the Data

Before data are collected, the analysis should be outlined. With the analysis and potential conclusions in mind, the research question should be reviewed to confirm that the planned study has the potential of answering the question. Too often, studies are conducted before the researchers realize they have no idea how to analyze the data or that the collected data cannot be used to answer the research question. The statistician should verify that the data collection protocol was properly followed. Each analysis should begin by summarizing the data graphically and numerically. Then the appropriate statistical analyses should be conducted.

Answering the Question

Through interpretation of the analysis results, we learn what conclusions can be drawn from the study. The aim is to answer the research question using the conclusions drawn from the study. Sometimes, we are unable to answer the question or are able to only partially answer it. At the conclusion of any study, the research team should reflect on what was learned from the study and use that to direct future research.

Selecting the Sample

Most of the inferential methods introduced in the text are based on random selection. The simplest form of random selection is simple random sampling. A simple random sample of size n is one drawn in such a manner that every possible sample of size n has an equal chance of being chosen.

It is important to realize that, if every unit in the population has an equal chance of being included in a sample, the sample may still not be a simple random one. To see this, suppose that a company has two divisions, A and B. There are 700 employees in division A and 300 in division B. The management decides to take a sample of 100 employees. To do this, they write each employee's name on a chip and put the chip in bowl A or B, depending on whether the employee is in division A or division B, respectively. The chips are thoroughly mixed in each bowl. Seventy chips are drawn from bowl A and 30 chips are drawn from bowl B, and the employees whose names are on the selected chips comprise the sample. Each employee has a 1 in 100 chance of being included in the sample; however, this is not a random sample.

Only samples with 70 division-A employees and 30 division-B employees are possible; it would not be possible to have, for instance, a sample with 50 division-A employees and 50 division-B employees. Because not all samples of size 100 are equally likely to be selected, this is not a random sample. In Lesson 14, we will discuss other methods of random selection.

Care must be taken in selecting a sample so that it is not biased. Bias is the tendency for a sample to differ in some systematic manner from the population of interest. Some common sources of bias are selection bias, measurement bias, response bias, and nonresponse bias. Selection bias occurs when a portion of the population is systematically excluded from the sample. For example, suppose a company wants to estimate the percentage of adults in a community who smoke. If a telephone poll is conducted, adults without telephones would be excluded from the sample, and selection bias would be introduced.

Measurement bias, or response bias, occurs when the method of observation tends to produce values that are consistently above or below the true value. For example, if a scale is inaccurately calibrated, observed weights could be consistently greater than true weights, resulting in a measurement bias. The way in which a survey question is worded could influence the response, leading to bias. For example, suppose that a survey question was stated as follows: "Many people think driving motorcycles is dangerous. Do you agree?" When stated in this way, the proportion of those agreeing will tend to be larger than would have been the case if the question had been phrased in a neutral way. The tendency of people to lie when asked about illegal behavior or unpopular beliefs, characteristics of the interviewer, and the organization taking the poll could be other sources of response bias.

Often in surveys, some people refuse to respond. Nonresponse bias is present if those who respond differ in important ways from those who do not participate in the survey. In a survey of gardeners, those with smaller gardens were much more likely to respond than those with large gardens. Because some of the questions were related to the size of the garden, this nonresponse resulted in response bias.

View Full Article
Add your own comment