# Writing about descriptive statistics in stata

However, looking at these summary statistics is a good start investigating patterns in the data.

## How to interpret descriptive statistics in stata

The wizard layout should look like this. In general, female students have an average SAT score in this sample of The statistics available are listed in the help tabstat: Table The table command calculates and displays tables of statistics. The main advantage of writing a do-file is that you can always reuse most of it on different projects, with only a few tweaks; if you use Stata by point and click commands, you will be condemned to start from scratch every time. Indicates how close the data is to the mean. The tabstat command allows more flexibility in terms of the statistics presented and the format of the table. In this workshop, you will learn to use Stata to create basic summary statistics, cross-tabulations, and increasingly rich tables of summary statistics. The summarize command returns mean, standard deviation, minimum, maximum and frequency. Overall econ major students have an average SAT score of B7. The first part of the command tabulate will split your data according to a categorical variable here we will use sex.

Up to four variables may be specified in the byso with the three row, column, and supercolumn variables, seven-way tables may be displayed. The tabstat command allows more flexibility in terms of the statistics presented and the format of the table. First we look at the summary statistics for the whole sample, and then we look at the statistics for subsamples each province.

For example, if you wanted to look at patterns of daily fruit and vegetable consumption for men and women with different smoking habits, you could create a table for that: The result seems to show a certain pattern: smokers look like they eat less fruit and vegetables than non-smokers, and women seem to eat more fruit and vegetable than men, on average [3].

This workshop is designed to teach you syntax, rather than point and click commands. By age there are more students 19 years old in the sample than any other group.

The example is built the same way the tabulate example was.

## How to store summary statistics in stata

The statistics available are listed in the help tabstat: Table The table command calculates and displays tables of statistics. For example a female student with an econ major has an average SAT score of cell B5 in the picture while a male student also with an econ major has B6. The sample variance measures the dispersion of the data from the mean. For example, if you wanted to look at patterns of daily fruit and vegetable consumption for men and women with different smoking habits, you could create a table for that: The result seems to show a certain pattern: smokers look like they eat less fruit and vegetables than non-smokers, and women seem to eat more fruit and vegetable than men, on average [3]. Technically speaking, kurtosis focuses more on the tails for the distribution than the peak, so positive kurtosis indicates too few cases in the tails or a tall distribution leptokurtic , negative kurtosis too many cases in the tails or a flat distribution platykurtic. Tabstat The tabstat command displays summary statistics for a series of numeric variables in one table, possibly broken down on conditioned by another variable. Overall econ major students have an average SAT score of B7. To get the median you have to order the data from lowest to highest. The tabstat command allows more flexibility in terms of the statistics presented and the format of the table.

According to Peter Westfall, that view is not quite correct. This is a crosstabulation between gender and major.

There are many good interenet sources for supplementary readings on creating summary statistics in Stata. In these examples we have focused on splitting the sample by province, but any categorical variable can be used.

If the number of cases is odd the median is the single value, for an even number of cases the median is the average of the two numbers in the middle. Without the by option, tabstat is a useful alternative to summarize because it allows you to specify the list of statistics to be displayed.

### Writing about descriptive statistics in stata

In subsequent examples, we will look at men and women, smokers and non-smokers, physically active or not. Yes, we can. The summarize command returns mean, standard deviation, minimum, maximum and frequency. The main advantage of writing a do-file is that you can always reuse most of it on different projects, with only a few tweaks; if you use Stata by point and click commands, you will be condemned to start from scratch every time. According to Peter Westfall, that view is not quite correct. The wizard layout should look like this. The current view of kurtosis argues that it measures the peak of a distribution. The way to read this table is simple: a female respondent who does not engage in more than 15 minutes of daily activity and has never smoked a whole cigarette eats on average 5. The first part of the command tabulate will split your data according to a categorical variable here we will use sex. You can also use the tabulate, summarize command to create a quick four-way summary statistics table. The answer will vary based on your level of sophistication, your research question, or your supervisor research agenda… For some, tabulate, summarize and maybe tabulate, summarize will be more than enough. Each cell represents the average SAT score for a student according to gender and major. For others, tabstat and table might be very useful tools indeed.

Rated 6/10
based on 33 review

Download