Member-only story

Multivariate Normal distribution and Cholesky decomposition in Stan

3 min readFeb 2, 2021

Figure 1: Simulated data in a Multivariate Normal distribution

This post provides an example of simulating data in a Multivariate Normal distribution with given parameters, and estimating the parameters based on the simulated data via Cholesky decomposition in stan. Multivariate Normal distribution is a commonly used distribution in various regression models and machine learning tasks. It generalizes the Normal distribution into multidimensional space. Its PDF can be expressed as:

where mu is a vector of k elements for the location parameters and Sigma is the Variance-Covariance matrix. The |Sigma| calculates the absolute determinant of Sigma in the equation above.

Cholesky decomposition is used to decompose a positive-definite matrix into the product of a lower triangular matrix and its transpose. In our case, we will decompose the correlation matrix (R) into the product of two triangular matrices (L*L’). Note: even though we can directly perform Cholesky decomposition on the Variance-Covariance matrix (Sigma), it is not recommended, since we may fail to estimate the standard deviations (sigma) of each variable.

Matrix representation of Cholesky decomposition

1. Data simulation

The variance-covariance matrix (Sigma) for data simulation and its analytical solution via Cholesky decomposition are provided below. For example, the standard deviation for the first variable sd_1 is 0.5, and the correlation coefficient (rho) between first and second variables is 0.7 (see Figure 1 for more intuitive illustration).

Decomposing an example correlation matrix (R) via Cholesky factorization

To generate the data, we are using the mvrnorm function from the package MASS by specifying a vector of means (mu) and variance-covariance matrix (Sigma).

mu = c(1, 2, -5) # means
R = matrix(c(1, 0.7, 0.2, # correlation matrix (R)
             0.7, 1, -0.5,
             0.2, -0.5, 1), 3)
sigmas = c(0.5, 1.2, 2.3) # sd1=0.5, sd2=1.2, sd3=2.3
Sigma = diag(sigmas) %*% R %*% diag(sigmas) # VCV matrix
data = mvrnorm(1000, mu = mu, Sigma = Sigma) #…

Multivariate Normal distribution and Cholesky decomposition in Stan

1. Data simulation

Create an account to read the full story.

Written by Jake Jing

No responses yet

More from Jake Jing

R Markdown Notebook in VS code

I am looking for a general-purpose editor that can integrate and customize different features across all programming languages that I…

Cholesky factors of covariance and correlation matrices in Stan

In this blog, I would like to give a quick overview of different types of matrices and their transformations (e.g., Cholesky decomposition)…

A Complete Tutorial for Kitty to Fish with vifm

A good terminal tool can speed up your workflow, and make your life much easier. Here I give a complete tutorial of installing the kitty…

Preview Pictures, PDFs and Videos in vifm

It takes a while for me to figure out how you can preview pictures and pdfs in vifm on Kitty terminal. I tried many different utilities…

Recommended from Medium

Outlier Detection & Treatment: Z-score, IQR, and Robust Methods

Learn to detect and treat outliers in datasets using Z-score, IQR, and robust statistical methods.

This new IDE from Google is an absolute game changer

This new IDE from Google is seriously revolutionary.

Fired From Meta After 1 Week: Here’s All The Dirt I Got

This is not just another story of a disgruntled ex-employee. I’m not shying away from the serious corporate espionage or the ethical…

Simple Ways to Tell if Python Code Was Written by an LLM

Yes, We Can Tell

Jeff Bezos Says the 1-Hour Rule Makes Him Smarter. New Neuroscience Says He’s Right

Jeff Bezos’s morning routine has long included the one-hour rule. New neuroscience says yours probably should too.

Gen Z Are Getting Fired Left and Right

The reasons are obvious, yet troublesome for most companies