Entry
Reader's guide
Entries A-Z
Subject index
Dummy Coding
Dummy coding, also known as indicator coding, provides a means for researchers to represent a categorical variable as a set of independent quantitative variables. The resulting dummy variables take on values of 0 and 1 and can be used as predictors in regression analysis. Given a categorical variable that can take on k values, it is possible to create k − 1 dummy variables without any loss of information. Dummy variables are often included in regression models to estimate the effects of categorical variables such as race, marital status, diagnostic group, and treatment setting.
When constructing a set of dummy variables, one level of the original categorical variable is selected as a reference category and is excluded from analysis. Each remaining level becomes a single dummy variable on which observations receive a value of 1 if they fall into the category and 0 if they do not. For example, given a four-level psychiatric diagnosis variable (bipolar disorder, major depressive disorder, other mood disorder, no mood disorder), the “no mood disorder” group might be selected as the reference category. Three dummy variables would then be constructed: “BIPOLAR” would be scored 1 for persons with this diagnosis and 0 for those in the other three diagnostic groups; similarly, “MDD” would be scored 1 for persons with a diagnosis of major depressive disorder and 0 for those with another diagnosis; and “OTHMOOD” would be scored 1 for persons with a diagnosis of an “other mood disorder” and 0 for those diagnosed with bipolar disorder, major depressive disorder, or no mood disorder. Each dummy variable would have 1 df.
In regression analyses, the coefficients for each of the k − 1 dummy variables quantify the estimated effect on the outcome variable of membership in the group in question versus membership in the reference group. For example, in a logistic regression analysis predicting diagnosis of bloodborne infection, a coefficient (b) for the “BIPOLAR” variable of 1.12 would represent the difference in the log-odds of infection between persons with bipolar disorder and those with no mood disorder. Expressed in terms of an odds ratio (eb = 3.06), persons with bipolar disorder would have slightly more than three times the odds of infection compared with persons in the reference group. Dummy variables are used in a wide variety of regression analyses, including, but not limited to, ordinary least-squares, logistic, Cox, and Poisson regression.
The choice of reference category should be made based on substantive scientific considerations, and the category should be selected to support meaningful contrasts. In the case of race, this often results in the selection of “White” as the reference category. In experimental studies, the control group is often an appropriate choice. Miscellaneous or “other” categories containing a heterogeneous mix of observations (e.g., “other race”) usually do not provide researchers with the ability to make useful inferences and are therefore rarely suitable as reference categories. To ensure stable estimates, the number of observations in the reference category should not be too small relative to the size of the other categories.
...
- Acculturation
- Bioterrorism
- Community-Based Participatory Research
- Community Health
- Community Trial
- Cultural Sensitivity
- Demography
- Determinants of Health Model
- Ecological Fallacy
- Epidemiology in Developing Countries
- EuroQoL EQ-5D Questionnaire
- Functional Status
- Genocide
- Geographical and Social Influences on Health
- Health, Definitions of
- Health Behavior
- Health Belief Model
- Health Communication
- Health Communication in Developing Countries
- Health Disparities
- Health Literacy
- Life Course Approach
- Locus of Control
- Medical Anthropology
- Network Analysis
- Participatory Action Research
- Poverty and Health
- Quality of Life, Quantification of
- Quality of Well-Being Scale (QWB)
- Race and Ethnicity, Measurement Issues With
- Race Bridging
- Rural Health Issues
- Self-Efficacy
- SF-36® Health Survey
- Social Capital and Health
- Social-Cognitive Theory
- Social Epidemiology
- Social Hierarchy and Health
- Social Marketing
- Socioeconomic Classification
- Spirituality and Health
- Targeting and Tailoring
- Theory of Planned Behavior
- Transtheoretical Model
- Urban Health Issues
- Urban Sprawl
- Applied Epidemiology
- Chronic Disease Epidemiology
- Clinical Epidemiology
- Descriptive and Analytic Epidemiology
- Disability Epidemiology
- Disaster Epidemiology
- Eco-Epidemiology
- Environmental and Occupational Epidemiology
- Field Epidemiology
- Genetic Epidemiology
- Injury Epidemiology
- Maternal and Child Health Epidemiology
- Molecular Epidemiology
- Neuroepidemiology
- Nutritional Epidemiology
- Pharmacoepidemiology
- Psychiatric Epidemiology
- Reproductive Epidemiology
- Social Epidemiology
- Veterinary Epidemiology
- Alzheimer's Disease
- Anxiety Disorders
- Arthritis
- Asthma
- Autism
- Avian Flu
- Bipolar Disorder
- Bloodborne Diseases
- Cancer
- Cardiovascular Disease
- Diabetes
- Foodborne Diseases
- Gulf War Syndrome
- Hepatitis
- HIV/AIDS
- Hypertension
- Influenza
- Insect-Borne Disease
- Malaria
- Measles
- Oral Health
- Osteoporosis
- Parasitic Diseases
- Plague
- Polio
- Post-Traumatic Stress Disorder
- Schizophrenia
- Severe Acute Respiratory Syndrome (SARS)
- Sexually Transmitted Diseases
- Sick Building Syndrome
- Sleep Disorders
- Smallpox
- Suicide
- Toxic Shock Syndrome
- Tuberculosis
- Vector-Borne Disease
- Vehicle-Related Injuries
- Vitamin Deficiency Diseases
- Waterborne Diseases
- Yellow Fever
- Zoonotic Disease
- Attack Rate
- Attributable Fractions
- Biomarkers
- Birth Cohort Analysis
- Birth Defects
- Case-Cohort Studies
- Case Definition
- Case-Fatality Rate
- Case Reports and Case Series
- Cohort Effects
- Community Trial
- Competencies in Applied Epidemiology for Public Health Agencies
- Cumulative Incidence
- Direct Standardization
- Disease Eradication
- Effectiveness
- Effect Modification and Interaction
- Efficacy
- Emerging Infections
- Epidemic
- Etiology of Disease
- Exposure Assessment
- Fertility, Measures of
- Fetal Death, Measures of
- Gestational Age
- Health, Definitions of
- Herd Immunity
- Hill's Considerations for Causal Inference
- Incidence
- Indirect Standardization
- Koch's Postulates
- Life Course Approach
- Life Expectancy
- Life Tables
- Malnutrition, Measurement of
- Mediating Variable
- Migrant Studies
- Mortality Rates
- Natural Experiment
- Notifiable Disease
- Outbreak Investigation
- Population Pyramid
- Preclinical Phase of Disease
- Preterm Birth
- Prevalence
- Prevention: Primary, Secondary, and Tertiary
- Public Health Surveillance
- Qualitative Methods in Epidemiology
- Quarantine and Isolation
- Screening
- Sensitivity and Specificity
- Sentinel Health Event
- Syndemics
- Administrative Data
- American Cancer Society Cohort Studies
- Behavioral Risk Factor Surveillance System
- Biomedical Informatics
- Birth Certificate
- Cancer Registries
- Death Certificate
- Framingham Heart Study
- Global Burden of Disease Project
- Harvard Six Cities Study
- Healthcare Cost and Utilization Project
- Health Plan Employer Data and Information Set
- Healthy People 2010
- Honolulu Heart Program
- Illicit Drug Use, Acquiring Information on
- Medical Expenditure Panel Survey
- Monitoring the Future Survey
- National Ambulatory Medical Care Survey
- National Death Index
- National Health and Nutrition Examination Survey
- National Health Care Survey
- National Health Interview Survey
- National Immunization Survey
- National Maternal and Infant Health Survey
- National Mortality Followback Survey
- National Survey of Family Growth
- Physicians' Health Study
- Pregnancy Risk Assessment and Monitoring System
- Relational Database
- Rochester Epidemiology Project
- Sampling Techniques
- Secondary Data
- Spreadsheet
- Youth Risk Behavior Surveillance System
- Epidemiology in Developing Countries
- Ethics in Health Care
- Ethics in Human Subjects Research
- Ethics in Public Health
- Evidence, Legal Admissibility of Scientific
- Genocide
- Harm Reduction
- Health, Definitions of
- Health Disparities
- Informed Consent
- Institutional Review Board
- Tuskegee Study
- War
- Association, Genetic
- Chromosome
- Epigenetics
- Family Studies in Genetics
- Gene
- Gene-Environment Interaction
- Genetic Counseling
- Genetic Disorders
- Genetic Epidemiology
- Genetic Markers
- Genomics
- Genotype
- Hardy-Weinberg Law
- Heritability
- Human Genome Project
- Icelandic Genetics Database
- Linkage Analysis
- Molecular Epidemiology
- Multifactorial Inheritance
- Mutation
- Newborn Screening Programs
- Phenotype
- Teratogen
- Twin Studies
- Biomedical Informatics
- EuroQoL EQ-5D Questionnaire
- Evidence-Based Medicine
- Formulary, Drug
- Functional Status
- Health Care Delivery
- Health Care Services Utilization
- Health Economics
- International Classification of Diseases
- International Classification of Functioning, Disability, and Health
- Managed Care
- Medicaid
- Medicare
- Partner Notification
- Quality of Life, Quantification of
- Quality of Well-Being Scale (QWB)
- SF-36® Health Survey
- Agent Orange
- Alcohol Use
- Allergen
- Asbestos
- Bioterrorism
- Child Abuse
- Cholesterol
- Circumcision, Male
- Diabetes
- Drug Abuse and Dependence, Epidemiology of
- Eating Disorders
- Emerging Infections
- Escherichia coli
- Firearms
- Foodborne Diseases
- Harm Reduction
- Hormone Replacement Therapy
- Intimate Partner Violence
- Lead
- Love Canal
- Malnutrition, Measurement of
- Mercury
- Obesity
- Oral Contraceptives
- Pain
- Physical Activity and Health
- Pollution
- Poverty and Health
- Radiation
- Sexual Risk Behavior
- Sick Building Syndrome
- Social Capital and Health
- Social Hierarchy and Health
- Socioeconomic Classification
- Spirituality and Health
- Stress
- Teratogen
- Thalidomide
- Tobacco
- Urban Health Issues
- Urban Sprawl
- Vehicle-Related Injuries
- Violence as a Public Health Issue
- Vitamin Deficiency Diseases
- War
- Waterborne Diseases
- Zoonotic Disease
- Budd, William
- Doll, Richard
- Ehrlich, Paul
- Epidemiology, History of
- Eugenics
- Farr, William
- Frost, Wade Hampton
- Genocide
- Goldberger, Joseph
- Graunt, John
- Hamilton, Alice
- Hill, Austin Bradford
- Jenner, Edward
- Keys, Ancel
- Koch, Robert
- Lind, James
- Lister, Joseph
- Nightingale, Florence
- Pasteur, Louis
- Public Health, History of
- Reed, Walter
- Ricketts, Howard
- Rush, Benjamin
- Snow, John
- Tukey, John
- Tuskegee Study
- American College of Epidemiology
- American Public Health Association
- Association of Schools of Public Health
- Centers for Disease Control and Prevention
- Council of State and Territorial Epidemiologists
- European Public Health Alliance
- European Union Public Health Programs
- Food and Drug Administration
- Governmental Role in Public Health
- Healthy People 2010
- Institutional Review Board
- Journals, Epidemiological
- Journals, Public Health
- National Center for Health Statistics
- National Institutes of Health
- Pan American Health Organization
- Peer Review Process
- Publication Bias
- Public Health Agency of Canada
- Society for Epidemiologic Research
- Surgeon General, U.S.
- United Nations Children's Fund
- U.S. Public Health Service
- World Health Organization
- Allergen
- Apgar Score
- Barker Hypothesis
- Birth Defects
- Body Mass Index (BMI)
- Carcinogen
- Case Reports and Case Series
- Clinical Epidemiology
- Clinical Trials
- Community Health
- Comorbidity
- Complementary and Alternative Medicine
- Effectiveness
- Efficacy
- Emerging Infections
- Escherichia coli
- Etiology of Disease
- Evidence-Based Medicine
- Gestational Age
- Intent-to-Treat Analysis
- International Classification of Diseases
- International Classification of Functioning, Disability, and Health
- Latency and Incubation Periods
- Life Course Approach
- Malnutrition, Measurement of
- Medical Anthropology
- Organ Donation
- Pain
- Placebo Effect
- Preclinical Phase of Disease
- Preterm Birth
- Public Health Nursing
- Quarantine and Isolation
- Screening
- Vaccination
- African American Health Issues
- Aging, Epidemiology of
- American Indian Health Issues
- Asian American/Pacific Islander Health Issues
- Breastfeeding
- Child and Adolescent Health
- Epidemiology in Developing Countries
- Hormone Replacement Therapy
- Immigrant and Refugee Health Issues
- Latino Health Issues
- Maternal and Child Health Epidemiology
- Men's Health Issues
- Oral Contraceptives
- Race and Ethnicity, Measurement Issues With
- Race Bridging
- Rural Health Issues
- Sexual Minorities, Health Issues of
- Urban Health Issues
- Women's Health Issues
- Additive and Multiplicative Models
- Analysis of Covariance
- Analysis of Variance
- Bar Chart
- Bayesian Approach to Statistics
- Bayes's Theorem
- Bias
- Binomial Variable
- Birth Cohort Analysis
- Box-and-Whisker Plot
- Capture-Recapture Method
- Categorical Data, Analysis of
- Causal Diagrams
- Causation and Causal Inference
- Censored Data
- Central Limit Theorem
- Chi-Square Test
- Classification and Regression Tree Models
- Cluster Analysis
- Coefficient of Determination
- Cohort Effects
- Collinearity
- Community-Based Participatory Research
- Community Trial
- Confidence Interval
- Confounding
- Control Group
- Control Variable
- Convenience Sample
- Cox Model
- Critical Value
- Cumulative Incidence
- Data Management
- Data Transformations
- Decision Analysis
- Degrees of Freedom
- Dependent and Independent Variables
- Diffusion of Innovations
- Discriminant Analysis
- Dose-Response Relationship
- Doubling Time
- Dummy Coding
- Dummy Variable
- Ecological Fallacy
- Economic Evaluation
- Effect Modification and Interaction
- Factor Analysis
- Fisher's Exact Test
- F Test
- Geographical and Spatial Analysis
- Graphical Presentation of Data
- Halo Effect
- Hawthorne Effect
- Hazard Rate
- Healthy Worker Effect
- Hill's Considerations for Causal Inference
- Histogram
- Hypothesis Testing
- Inferential and Descriptive Statistics
- Intent-to-Treat Analysis
- Internet Data Collection
- Interquartile Range
- Interrater Reliability
- Intervention Studies
- Interview Techniques
- Kaplan-Meier Method
- Kappa
- Kurtosis
- Latent Class Models
- Life Tables
- Likelihood Ratio
- Likert Scale
- Logistic Regression
- Log-Rank Test
- Longitudinal Research Design
- Matching
- Measurement
- Measures of Association
- Measures of Central Tendency
- Measures of Variability
- Meta-Analysis
- Missing Data Methods
- Multilevel Modeling
- Multiple Comparison Procedures
- Multivariate Analysis of Variance
- Natural Experiment
- Network Analysis
- Nonparametric Statistics
- Normal Distribution
- Null and Alternative Hypotheses
- Observational Studies
- Overmatching
- Panel Data
- Participatory Action Research
- Pearson Correlation Coefficient
- Percentiles
- Person-Time Units
- Pie Chart
- Placebo Effect
- Point Estimate
- Probability Sample
- Program Evaluation
- Propensity Score
- Proportion
- p Value
- Qualitative Methods in Epidemiology
- Quasi Experiments
- Questionnaire Design
- Race Bridging
- Random-Digit Dialing
- Randomization
- Random Variable
- Rate
- Ratio
- Receiver Operating Characteristic (ROC) Curve
- Regression
- Relational Database
- Reliability
- Response Rate
- Robust Statistics
- Sample Size Calculations and Statistical Power
- Sampling Distribution
- Sampling Techniques
- Scatterplot
- Secondary Data
- Sensitivity and Specificity
- Sequential Analysis
- Simpson's Paradox
- Skewness
- Spreadsheet
- Stem-and-Leaf Plot
- Stratified Methods
- Structural Equation Modeling
- Study Design
- Survival Analysis
- Target Population
- Time Series
- Type I and Type II Errors
- Unit of Analysis
- Validity
- Volunteer Effect
- Z Score
- Loading...
Get a 30 day FREE TRIAL
-
Watch videos from a variety of sources bringing classroom topics to life
-
Read modern, diverse business cases
-
Explore hundreds of books and reference titles
Sage Recommends
We found other relevant content for you on other Sage platforms.
Have you created a personal profile? Login or create a profile so that you can save clips, playlists and searches