Skip to main content icon/video/no-internet

In gathering data about a group of individuals or items, rather than conducting a full census, very often a sample is taken from a larger population in order to save time and resources. These samples can be classified into two major groups describing the way in which they were chosen: probability samples and nonprob-ability samples. Both types of samples are made up of a basic unit called an individual, observation, or elementary unit. These are the units whose characteristics are to be measured from a population. In probability samples, each member of the population has a known nonzero probability of being chosen into the sample. By a random process, elements are selected and receive a known probability of being included in the sample; this is not the case in nonprobability sampling.

In order to estimate some quantity of interest with a desired precision from a sample, or to contrast characteristics between groups from a sample, one must rely on knowing to whom or to what population one is referring. Well-designed probability samples ensure that the reference population is known and that selection bias is minimized. The best samples are simply smaller versions of the larger population. The process by which a sample of individuals or items is identified will affect the reliability, validity, and ultimately the accuracy of the estimates and inferences made.

Underlying Concepts

The concepts behind probability sampling underlie statistical theory. From the finite population of N elements, all possible samples of size n are identified. For example, if the population consisted of 6 elements, and samples of size 2 are to be chosen, there would be 15,

, possible samples to consider. In theory, prior to selection, the probability of each of these samples being chosen is known. Therefore, the selection probability of each individual is also known. Summing the probabilities of all samples containing an individual element will compute the individual's probability of appearing in the sample. By knowing the probability of selecting each unit, a statistical weight can be assigned by which population estimates can be calculated. The statistical weight is defined as the inverse of the probability of selection into the sample, allowing each sampled unit to represent a certain number of units in the population. Most often, the goal of a probability sample is to estimate certain quantities in the population using these statistical weights.

Reliability And Validity

From a probability sample, the quantities that are estimated, such as population totals, means, proportions, and variances, have certain properties that can be evaluated. For instance, over repeated sampling, estimators from the probability sample can be evaluated for how reproducible or reliable they are (variance) and if, on average, the estimates from the sample are similar to the true value of the population quantity (unbiasedness or validity). Combining the ideas of reliability and validity, the accuracy, or how far away on average the estimator is from the true value (mean squared error), can also be evaluated on the sample estimator.

None of these desirable properties can be determined from estimates derived from nonprobability samples. Nonprobability samples are used in many unscientific surveys, market research, and public opinion polls, often because they are easier and less expensive to conduct. These types of surveys include purposive or deliberate, quota, and snowball samples. As an example, imagine that interviewers are attempting to question shoppers as to their political views at a local supermarket in order to describe future poll results for a city. In quota sampling, the interviewer may be asked to "find" a certain number of individuals in various demographic groups, such young women, older women, black men, and white men. The individuals that are found by the interviewer may be of only one political leaning or of one socioeconomic group simply because they are easy to find and shop at the particular market. Without a systematic plan for sampling, the decision about whom to interview is left up to the interviewer, likely creating bias in the sample. In certain circumstances, such as in populations that are hard to reach, probability samples may not be feasible. Thus, as long as they are not used to make inference to a larger population, some non-probability samples are useful.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading