Descriptive Statistics
keywords: ALFA, summary statistics, descriptive statistics, groupwise
OUTLINE
1. Basic statistics
You can find usual statistics such as mean, median and standard deviation of any of the arrays by just giving the command name and the array name. For example, you can compute mean of an array as
load tumor data
mean of id
Alfa will figure out what you are trying to do as long as the instructions are unambiguous
If you ask for the mean of non-numeric data, Alfa will fail to execute the command. All the statistics commands can be listed using list statistics commands.
2. Summary Statistics
Alfa also provides a summary
command that computes the basic statistics of a data file.
This can either take a bunch of arrays or the complete dataframe/dataset as an input. The summary command calculates the basic statistics for each of the variables provided and displays the result in a tabular format.
Example:
load tumor data
summary statistics of tumor data
Alternatively, you can also provide the following command:
summary statistics of area perimeter and texture of the tumor data
3. Groupwise Statistics
Groupwise statistics are useful to look at statistics of a variable based on a certain reference. For example, I want to look at the tumor area statistics for benign and malignant tumors, I can use groupwise statistics as follows:
load tumor data
set reference to tumor type [benign or malignant]
groupwise mean of area
This will give me a tabular result with mean area values corresponding to each tumor type.
Groupwise statistics can also be done to summarize the data.
Example:
load tumor data
set reference to tumor type [benign or malignant]
groupwise summary of tumor data