Data dictionary: CDC Diabetes Health Indicators

The Diabetes Health Indicators Dataset contains healthcare statistics and lifestyle survey information about people in general along with their diagnosis of diabetes. The 35 features consist of some demographics, lab test results, and answers to survey questions for each patient. The target variable for classification is whether a patient has diabetes, is pre-diabetic, or healthy.

Dataset contains three files:

Each file contains 23 variables

Variable NameRoleTypeDemographicDescriptionUnitsMissing Values
IDIDIntegerPatient IDno
Diabetes_binaryTargetBinary0 = no diabetes 1 = prediabetes or diabetesno
HighBPFeatureBinary0 = no high BP 1 = high BPno
HighCholFeatureBinary0 = no high cholesterol 1 = high cholesterolno
CholCheckFeatureBinary0 = no cholesterol check in 5 years 1 = yes cholesterol check in 5 yearsno
BMIFeatureIntegerBody Mass Indexno
SmokerFeatureBinaryHave you smoked at least 100 cigarettes in your entire life? [Note: 5 packs = 100 cigarettes] 0 = no 1 = yesno
StrokeFeatureBinary(Ever told) you had a stroke. 0 = no 1 = yesno
HeartDiseaseorAttackFeatureBinarycoronary heart disease (CHD) or myocardial infarction (MI) 0 = no 1 = yesno
PhysActivityFeatureBinaryphysical activity in past 30 days - not including job 0 = no 1 = yesno
FruitsFeatureBinaryConsume Fruit 1 or more times per day 0 = no 1 = yesno
VeggiesFeatureBinaryConsume Vegetables 1 or more times per day 0 = no 1 = yesno
HvyAlcoholConsumpFeatureBinaryHeavy drinkers (adult men having more than 14 drinks per week and adult women having more than 7 drinks per week) 0 = no 1 = yesno
AnyHealthcareFeatureBinaryHave any kind of health care coverage, including health insurance, prepaid plans such as HMO, etc. 0 = no 1 = yesno
NoDocbcCostFeatureBinaryWas there a time in the past 12 months when you needed to see a doctor but could not because of cost? 0 = no 1 = yesno
GenHlthFeatureIntegerWould you say that in general your health is: scale 1-5 1 = excellent 2 = very good 3 = good 4 = fair 5 = poorno
MentHlthFeatureIntegerNow thinking about your mental health, which includes stress, depression, and problems with emotions, for how many days during the past 30 days was your mental health not good? scale 1-30 daysno
PhysHlthFeatureIntegerNow thinking about your physical health, which includes physical illness and injury, for how many days during the past 30 days was your physical health not good? scale 1-30 daysno
DiffWalkFeatureBinaryDo you have serious difficulty walking or climbing stairs? 0 = no 1 = yesno
SexFeatureBinarySex0 = female 1 = maleno
AgeFeatureIntegerAge13-level age category (_AGEG5YR see codebook) 1 = 18-24 9 = 60-64 13 = 80 or olderno
EducationFeatureIntegerEducation LevelEducation level (EDUCA see codebook) scale 1-6 1 = Never attended school or only kindergarten 2 = Grades 1 through 8 (Elementary) 3 = Grades 9 through 11 (Some high school) 4 = Grade 12 or GED (High school graduate) 5 = College 1 year to 3 years (Some college or technical school) 6 = College 4 years or more (College graduate)no
IncomeFeatureIntegerIncomeIncome scale (INCOME2 see codebook) scale 1-8 1 = less than $10,000 5 = less than $35,000 8 = $75,000 or moreno

Dataset is available at https://archive.ics.uci.edu/dataset/891/cdc+diabetes+health+indicators.