Applied exercises

This section provides some exercises that are meant to deepen your knowledge in the topics covered in this section and to gain experience solving real-world problems.

1. Computing the OLS estimator

In this exercise you will compute the OLS estimator on a simulated data set using basic MATLAB commands. Please refer to the theory section below for the necessary formulas.

Import the simulated data from olsdata.m and compute the OLS estimator $\hat{\beta}$ using matrix expressions. Create a results matrix which stacks the estimated parameters and the values supplied in the vector beta_true side by side. Are the estimated and the true values close?

Next, read up on MATLABs regress function on the MATLAB documentation page on regress. Estimate the OLS coefficient using this function and compare the results to the ones you computed manually.

Create a new folder for this exercise and copy the olsdata.mat file into it
In MATLAB, create a new script and save it into the same folder
Start your script with the following commands

  clear all; close all; clc;
  load olsdata.mat

Check the data has been loaded into your workspace. You should see a matrix X, a vector y as well as a vector beta_true which contains the true values of $\beta$ that were used to generate the data
Add a line which computes the OLS estimator and saves it into a new variable beta_hat

beta_hat = ...formula goes here...

Note that you can code up the formula in two ways. Either by computing $(X'X)^{-1}$ separately from $X'y$ and then multiplying them or by directly writing the formula into one line.

% Compute OLS estimator
clear all; close all; clc;

load olsdata

% Compute OLS estimator
beta_hat = (X'*X)\(X'*y);

% Compare to true value
[beta_hat, beta_true]

% Extension: Compare to MATLABs regress function
beta_regress = regress(y,X);
[beta_hat, beta_regress]

Theoretical Background

Let $y$ be a $N \times 1$ vector of data on the dependent variable and let $X$ be a $N \times K$ matrix with data on the regressors where the first column is a vector of ones.

The OLS estimator of the regression coefficients is defined as $\hat{\beta} = (X'X)^{-1} (X'y)$ .

2. Computing the log-likelihood of a logit model

In this exercise you will compute the log-likelihood of a logit model on a simulated data set using basic MATLAB commands. Please refer to the theory section below for the necessary formulas.

Import the simulated data from logitdata.m and calculate the value of the log-likelihood for different values of the parameter vector $\beta$ using matrix expressions.

Approximately, for which value of $\beta$ is the log-likelihood maximal?

Create a new folder for this exercise and copy the logitdata.mat file into it
In MATLAB, create a new script and save it into the same folder
Start your script with the following commands

  clear all; close all; clc;
  load logitdata.m

Check the data has been loaded into your workspace. You should see a matrix X, and a vector y
Fix a value for $\beta$ by setting it to e.g. 0.5

beta = 0.5

Create a variable Land assign it the value of the formula for the likelihood from the theory section

L = ...formula goes here...

Hint: When coding up the formula, write it as a function of a vector. Start with the inner parts of the formula i.e. first think about how $x_i \beta$ looks like for different $i$ and how you can write it as a vector. Then think about what applying functions like exp() and ln (which in MATLAB is the log function) does to this vector. Finally think about how to evaluate the sum. It might be easiest to split up the formula into two separate parts (the one starting with $y_i$ and the one starting with $(1-y_i)$ that you save in different variables, then evaluate the sum operator over the sum of these variables.

Theoretical Background

Consider the following discrete choice logit model with no constant and one regressor

3. Estimating a factor model using Principal Components (Advanced)

In this exercise you will estimate a factor model on a simulated data set using basic MATLAB commands. Please refer to the theory section below for the necessary formulas.

Theoretical Background

We will use the following factor model

PreviousSelf-Assessment NextIntro to MATLAB Programming

Last updated 4 years ago

Was this helpful?