Member-only story

Machine learning basics (part 10): Probabilistic models

Hang Nguyen
8 min readMay 17, 2022

--

Some useful statistical formulas

Expectation (expected values)

In practice we calculate — for random variable X — the mean taking into account probabilities of different values. For instance, for a dice with probabilities pj the expected value is calculated:

Variance

It measures how spread out the values are, with its equation as:

The square root of variance is known as standard deviation, σj .

Data matrix

Covariance

The variance looks at the variation in one variable compared to its mean. By generalizing to two variables we get covariance which is a measure of how dependent the two variables are in statistical sense:

If two variables X_j and X_l are independent, then covariance is 0 (they are uncorrelated), while if both increase and decrease at the same time, covariance is positive, and if one goes up while the other goes down, covariance is negative.

The covariance matrix presents how the data vary along with each dimension (variable).

--

--

Hang Nguyen
Hang Nguyen

Written by Hang Nguyen

Just sharing (data) knowledge

No responses yet