Bayesian Ridge Regression

Given an input vector xt, the online Bayesian Ridge Regression predicts at each step T the normal distribution N(γT,σT2) with the mean and variance given by $\gamma_T = Y'_{T-1} X_{T-1} A_{T-1}^{-1} x_T , \quad \sigma_T^2 = \sigma^2 x_T' A_{T-1}^{-1} x_T + \sigma^2$ for some a>0 and the known noise variance σ2. Here Xt is the t×n matrix of row vectors x1ʹ,,xtʹ and Yt be the column vector of outcomes y1,,yt. Here also At=XʹtXt+aI.