Chi Square in Stata: The Basics

Dec 18

Introduction

The chi square statistic and its underlying p value are most often calculated in order to determine whether some observed distribution is normal. In this post, we’ll show you how to run and interpret a chi square analysis in Stata.

Load the Dataset

Let’s load Stata’s prebuilt auto dataset using the following code:

sysuse auto

Two of the variables in the dataset are as follows:

rep78: How many times the car was repaired
foreign: Whether or not the car is foreign

Create a Table and Hypothesis

Try the following code to visualize your data before running the Chi square:

tab2 rep78 foreign

You get the following:

You want to know whether there is an effect on the manufacturing location of a car (domestic vs. foreign) on the number of times it was repaired.

Run and Interpret the Chi Square

Now you can enter the following code:

tab rep78 foreign, chi2

You get the following:

If the p value for the Chi square is less than .05, you could justifiably conclude that there is some relationship between the manufacturing location of a car and the number of times it was repaired. Here, the p value is < .001, so there is indeed a relationship between the manufacturing location of a car and the number of times it was repaired.

BridgeText can help you with all of your statistical analysis needs.

Have any questions?

Our support team is ready to answer your questions.

Help Center FAQ