Statistics

Range

  • [a,b) - inclusive a, exclusive b

Descriptive Statistics

Characteristics

  • discrete characteristics: number of dots on a die
  • continuous characteristics: body height

Absolute frequency is the number of occurrences.

Relative frequency:

  • without classes:
  • with classes:

Marginal frequencies (in contingency tables):

  • absolute MF:
  • relative MF:

Data Representation

  • Table
  • Histogram
  • Boxplot
  • Contingency table (joint occurrence of characteristics in the sample)

Measures of Central Tendency

  • Mean
  • Median
  • Variance / Standard deviation
  • Quantiles / Interquartile range
  • Marginal frequencies for classes: and

Graphical Analysis

Determine the linear relationship between two characteristics and :

  • Covariance
  • Cross-correlation coefficient ( indicates correlation)
  • Empirical coefficient of determination

Scale Transformation

When scaling central tendency & dispersion parameters linearly:

  • Mean:
  • Variance:

Regression

If a dataset has a high coefficient of determination , and the scatter plot shows a linear relationship, you can use linear regression to calculate the optimal parameters for and $b`.