9  A Math Reference: Sample Modeling Equations to Borrow

Author

Luke Miratrix

9.1 Introduction

This document has a bunch of mathematical equations we use in the class. It is a good reference for how to write your own math equations in your life moving forward. Generally, people write math equations using something called Latex. Latex (or Tex) is a way of writing documents where mixed in with the writing of what you want to say are commands (editorial markup, if you will) describing how you want your document to look. This is a very powerful thing: there are Tex editors that allow you to write entire articles, books, reports, poetry, or whatever with extreme control over the typesetting used. It creates beautifully typeset documents that are easy to distinguish from those written in, say, MS Word due to how they adjust whitespace on the page. That being said, it can be a lot to jump in to.

Enter R Markdown. R Markdown is a useful and easy way to take advantage of this syntax without the overhead of writing entire documents in Latex, even if you don’t have any R code in your document. Inside R Markdown you can write math equations, and then when you render the report, it not only runs all the R code, but it formats all the math for you as well! You can even have R Markdown render to MS Word to give you a word doc with all your math equations ready to go.

9.1.1 Using this document

You are probably reading the PDF version of this document. But really you should open the .Rmd file that generated this document, so you can cut and paste the relevant equations into your own work, and then modify as necessary. The link to this file here.

9.2 Overview of Using Latex

For math in your writing, you denote the beginning and the end of a math equation in your text using “$”s—one at the start and one at the stop. E.g., “$ math stuff $”. Most greek letters are written as their names with a backslash “\” just before it. E.g., “\alpha”.

So if I want to write an alpha, I write “$\alpha$” and get \(\alpha\).

I can do subscripts by using an underscore. E.g., “$\alpha_j$” gives \(\alpha_j\). I can also do superscripts by using a hat. E.g., “$\alpha^2$” gives \(\alpha^2\). To put more than one character in a subscript (or superscript), put the stuff to be subscripted in curly braces, e.g., “$\alpha_{ij}$” gives \(\alpha_{ij}\).

9.2.1 Some useful greek letters

Here are some useful greek letters and symbols

Letter Name
\(\alpha\) \alpha
\(\beta\) \beta
\(\delta, \Delta\) \delta, \Delta
\(\epsilon\) \epsilon
\(\sigma, \Sigma\) \sigma, \Sigma
\(\rho\) \rho
\(\mu\) \mu (Meew!)
\(\tau\) \tau
\(\times\) \times
\(\sim\) \sim

See many more symbols at, e.g., https://www.caam.rice.edu/~heinken/latex/symbols.pdf. This was found by searching “tex symbols” on Google.

9.2.2 Equations on lines by themselves

To write an equation on a line by itself, put the math stuff in between a pair of double “$”. E.g., if we write:

$$ Y = a X + b $$

We get \[ Y = a X + b .\]

If we want multiple lines, we have to put our equation between a \begin{aligned} and \end{aligned} command and use a double backslash (“\\”) to denote each line break (even if we have a line break we have to do this—we have to explicitly tell the program converting our raw text to nice formatted text where the line breaks are). Finally, inside the begin-end block of math, line things up with & symbols on each row of our equation. The & symbols will be lined up vertically.

So if we write

$$
\begin{aligned}
Y &= 10 X + 2 \\
Y - 5 &= 3 X^2 + 5
\end{aligned}
$$

we get \[ \begin{aligned} Y &= 10 X + 2 \\ Y - 5 &= 3 X^2 + 5 \end{aligned} \] Also consider:

$$
\begin{aligned}
a + b + c + d &= c \\
 d &= e + f + g + h 
\end{aligned}
$$

giving \[ \begin{aligned} a + b + c + d &= c \\ d &= e + f + g + h \end{aligned} \]

9.2.3 Normal text in equations

If you put words in your equations, they get all italliced and weird, without their spaces:

$$
5 + my dog = 10
$$

\[ 5 + my dog = 10 \]

You can fix using the “\mbox{}” command as so:

$$
5 + \mbox{my dog} = 10
$$

\[ 5 + \mbox{my dog} = 10 \]

We next walk through some latex code for the models you will most see.

9.3 Sample code: Random Intercept Model

Our canonical Random Intercept model is as follows. First, our Level 1 model:

$$
\begin{aligned}
y_{ij} &= \alpha_{j} + \beta_{1} ses_{ij} +  \epsilon_{ij} \\
\epsilon_{ij} &\sim N( 0, \sigma^2_y ) \\
\end{aligned}
$$

\[ \begin{aligned} y_{ij} &= \alpha_{j} + \beta_{1} ses_{ij} + \epsilon_{ij} \\ \epsilon_{ij} &\sim N( 0, \sigma^2_y ) \\ \end{aligned} \]

Our Level 2 model:

$$
\begin{aligned}
\alpha_{j} &= \gamma_{0} + \gamma_{1} sector_j + u_{j} \\
u_{j} &\sim N( 0, \sigma^2_\alpha ) \\
\end{aligned}
$$

\[ \begin{aligned} \alpha_{j} &= \gamma_{0} + \gamma_{1} sector_j + u_{j} \\ u_{j} &\sim N( 0, \sigma^2_\alpha ) \\ \end{aligned} \]

The Gelman and Hill bracket notation looks like this:

$$
\begin{aligned}
y_{i} &= \alpha_{j[i]} + \beta_{1} ses_{i} +  \epsilon_{i} \\
\epsilon_i &\sim N( 0, \sigma^2_y ) \\
\alpha_{j} &= \gamma_{0} + \gamma_{1} sector_j + u_{j} \\
u_{j} &\sim N( 0, \sigma^2_\alpha ) \\
\end{aligned}
$$

\[ \begin{aligned} y_{i} &= \alpha_{j[i]} + \beta_{1} ses_{i} + \epsilon_{i} \\ \epsilon_i &\sim N( 0, \sigma^2_y ) \\ \alpha_{j} &= \gamma_{0} + \gamma_{1} sector_j + u_{j} \\ u_{j} &\sim N( 0, \sigma^2_\alpha ) \\ \end{aligned} \]

The reduced form would look like this:

$$
y_{i} = \gamma_{0} + \gamma_{1} sector_{j[i]} + \beta_{1} ses_{i} + u_{j[i]} + \epsilon_{i}
$$

\[ y_{i} = \gamma_{0} + \gamma_{1} sector_{j[i]} + \beta_{1} ses_{i} + u_{j[i]} + \epsilon_{i} \]

with

$$
\epsilon_i \sim N( 0, \sigma^2_y ), \mbox{ and } u_{j} \sim N( 0, \sigma^2_\alpha )
$$

\[ \epsilon_i \sim N( 0, \sigma^2_y ), \mbox{ and } u_{j} \sim N( 0, \sigma^2_\alpha ) \]

If we want to be really prissy, we can write down the i.i.d. aspect of our random effects like this

$$
\epsilon_i \stackrel{i.i.d}{\sim} N( 0, \sigma^2_y ), 
\mbox{ and } u_{j} \stackrel{i.i.d}{\sim} N( 0, \sigma^2_\alpha )
$$

\[ \epsilon_i \stackrel{i.i.d}{\sim} N( 0, \sigma^2_y ), \\ \mbox{ and } u_{j} \stackrel{i.i.d}{\sim} N( 0, \sigma^2_\alpha ) \] The \stackrel{}{} command takes two bits of latex, each in the curly braces, and stacks them on top of each other.

9.4 Sample code: Random Slope Model

The canonical random slope model for HS&B with ses at level 1 and sector at level 2 involves a matrix for the pair of random effects. We have to get a bit fancier with our TeX, therefore!

Level 1 models:

$$
\begin{aligned}
y_{ij} &= \beta_{0j} + \beta_{1j} ses_{ij} +  \epsilon_{ij} \\
\epsilon_{ij} &\sim N( 0, \sigma^2_y ) \\
\end{aligned}
$$

\[ \begin{aligned} y_{ij} &= \beta_{0j} + \beta_{1j} ses_{ij} + \epsilon_{ij} \\ \epsilon_{ij} &\sim N( 0, \sigma^2_y ) \\ \end{aligned} \]

Level 2 models:

$$
\begin{aligned}
\beta_{0j} &= \gamma_{00} + \gamma_{01} sector_j + u_{0j} \\
\beta_{1j} &= \gamma_{10} + \gamma_{11} sector_j + u_{1j} 
\end{aligned}
$$

\[ \begin{aligned} \beta_{0j} &= \gamma_{00} + \gamma_{01} sector_j + u_{0j} \\ \beta_{1j} &= \gamma_{10} + \gamma_{11} sector_j + u_{1j} \end{aligned} \]

with

$$
\begin{pmatrix} u_{0j} \\
u_{1j}
\end{pmatrix} \sim  N
\begin{bmatrix}
\begin{pmatrix}
0 \\
0
\end{pmatrix}\!\!,&
\begin{pmatrix}
\tau_{00} & \tau_{01}\\
 & \tau_{11} \\
\end{pmatrix}
\end{bmatrix}
$$

\[ \begin{pmatrix} u_{0j} \\ u_{1j} \end{pmatrix} \sim N \begin{bmatrix} \begin{pmatrix} 0 \\ 0 \end{pmatrix}\!\!,& \begin{pmatrix} \tau_{00} & \tau_{01}\\ & \tau_{11} \\ \end{pmatrix} \end{bmatrix} \]

The TeX for the derivation of the reduced form is:

$$
\begin{aligned}
y_{ij} &= \beta_{0j} + \beta_{1j} ses_{ij} + \epsilon_{ij}\\
&= \left( \gamma_{00} + \gamma_{01} sector_j + u_{0j} \right)+ \\\
  (\gamma_{10} + \gamma_{11} sector_j + u_{1j}) ses_{ij} +  \epsilon_{ij} \\
&= \gamma_{00} + \gamma_{01} sector_j + u_{0j}  + \gamma_{10}ses_{ij} + \\
  \gamma_{11} sector_j ses_{ij} + u_{1j} ses_{ij} +  \epsilon_{ij}  \\
&= \gamma_{00} + \gamma_{01} sector_j +  \gamma_{10}ses_{ij} + \\
  \gamma_{11} sector_j ses_{ij} + \left(u_{0j} + u_{1j} ses_{ij} + \epsilon_{ij} \right) 
\end{aligned}
$$

\[ \begin{aligned} y_{ij} &= \beta_{0j} + \beta_{1j} ses_{ij} + \epsilon_{ij}\\ &= \left( \gamma_{00} + \gamma_{01} sector_j + u_{0j} \right)+ (\gamma_{10} + \gamma_{11} sector_j + u_{1j}) ses_{ij} + \epsilon_{ij} \\ &= \gamma_{00} + \gamma_{01} sector_j + u_{0j} + \gamma_{10}ses_{ij} + \gamma_{11} sector_j ses_{ij} + u_{1j} ses_{ij} + \epsilon_{ij} \\ &= \gamma_{00} + \gamma_{01} sector_j + \gamma_{10}ses_{ij} + \gamma_{11} sector_j ses_{ij} + \left(u_{0j} + u_{1j} ses_{ij} + \epsilon_{ij} \right) \end{aligned} \]

Commentary: There are various and competing ways of writing the covariance matrix for the random effects. The \(\tau_{**}\) notation is easy and expands to any sized matrix (if we, for example, have more than one random slope). But all the \(\tau_{**}\) are variances, not standard deviations, and we often like to talk about random effect variation in terms of standard deviations. We can thus use something like this instead:

$$
\begin{pmatrix} u_{0j} \\
u_{1j}
\end{pmatrix} \sim  N
\begin{bmatrix}
\begin{pmatrix}
0 \\
0
\end{pmatrix}\!\!,&
\begin{pmatrix}
\sigma^2_{0} & \rho \sigma_0 \sigma_1 \\
 & \sigma^2_{1} \\
\end{pmatrix}
\end{bmatrix}
$$

\[ \begin{pmatrix} u_{0j} \\ u_{1j} \end{pmatrix} \sim N \begin{bmatrix} \begin{pmatrix} 0 \\ 0 \end{pmatrix}\!\!,& \begin{pmatrix} \sigma^2_{0} & \rho \sigma_0 \sigma_1 \\ & \sigma^2_{1} \\ \end{pmatrix} \end{bmatrix} \]

Here we have a correlation of random effects, \(\rho\), instead of a covariance, \(\tau_{01}\). And we can talk about the standard deviation of, e.g., the random intercepts, as \(\sigma_0\) rather than \(\sqrt{ \tau_{00} }\). Different ways of writing the same mathematical thing are called different parameterizations; they are equivalent, but are more or less clear for different contexts.
Unfortunately, this means there is no one right answer for how to write down a mathematical equation!

9.5 Summations and fancy stuff

Fractions are as follows:

$$
cor( A, B ) = \frac{ cov( A, B ) }{ \sigma_A \sigma_B }
$$

\[ cor( A, B ) = \frac{ cov( A, B ) }{ \sigma_A \sigma_B } \]

For reference, you can do summations and whatnot as follows:

$$
Var( Y_{i} ) = \frac{1}{n-1} \sum_{i=1}^n \left( Y_{i} - \bar{Y} \right)^2 
$$

\[ Var( Y_{i} ) = \frac{1}{n-1} \sum_{i=1}^n \left( Y_{i} - \bar{Y} \right)^2 \]

And if you have fractions you can have big brackets with “\left(” and “\right)” as follows:

$$
X = \left( \frac{1}{2} + y \right)
$$

\[ X = \left( \frac{1}{2} + y \right) \]

Annoyingly, you always need a pair of these big brackets. If you really don’t want one, you use a backslash and a dot, like so:

$$
X = \left( \frac{1}{2} + y \right.
$$

\[ X = \left( \frac{1}{2} + y \right. \]

The rest you can find on StackOverflow or similar. Or perhaps have ChatGPT help you write your code!