Equal covariances

Next: Analytical solution of mixture Up: Prior mixtures for regression Previous: High and low temperature Contents

Equal covariances

Especially interesting is the case of -independent ${{\bf K}}_j(\theta)$ = ${{\bf K}}_0 (\theta )$ and $\theta$ -independent $\det {{\bf K}}_0 (\theta)$ . In that case the often difficult to obtain determinants of ${{\bf K}}_j$ do not have to be calculated.

For -independent inverse covariances the high temperature solution is according to Eqs.(555,561) a linear combination of the (potential) low temperature solutions

$\begin{displaymath} \bar t = \sum_j^m a^0_j \bar t_j . \end{displaymath}$

(566)

It is worth to emphasize that, as the solution $\bar t$ is not a mixture of the component templates

but of component solutions $\bar t_j$ , even poor choices for the template functions

can lead to good solutions, if enough data are available. That is indeed the reason why the most common choice $t_0\equiv 0$ for a Gaussian prior can be successful.

Eqs.(565) simplifies to

$\begin{displaymath} {h} = \frac{\sum_j^m \bar t_j e^{-\beta E_{{h},j} ({h})-E_{... ..._j^m a_j \bar t_j =\bar t + \sum_j^m (a_j-a_j^0) \, \bar t_j , \end{displaymath}$

(567)

where

$\begin{displaymath} \bar t_j = \left( {{\bf K}}_D + {{\bf K}}_0 \right)^{-1} \left( {{\bf K}}_D t_D + {{\bf K}}_0 t_j \right) , \end{displaymath}$

(568)

and (for

-independent

)

$\begin{displaymath} a_j = \frac{e^{-E_j}} {\sum_k e^{-E_k}} = \frac{e^{-\beta ... ...a {B}_j a+d_j}} {\sum_k e^{-\frac{\beta}{2} a {B}_k a+d_k}} , \end{displaymath}$

(569)

introducing vector

with components

, $m\times m$ matrices

$\begin{displaymath} {B}_j (k,l) = \Big(\bar t_k-\bar t_j,\,\left( {{\bf K}}_D + {{\bf K}}_0\right) \,(\bar t_l-\bar t_j)\Big) \end{displaymath}$

(570)

and constants

$\begin{displaymath} d_j= -\beta V_j-E_{\theta,\beta,j} , \end{displaymath}$

(571)

with

given in (564). Eq. (567) is still a nonlinear equation for

, it shows however that the solutions must be convex combinations of the

-independent $\bar t_j$ . Thus, it is sufficient to solve Eq. (569) for

mixture coefficients

instead of Eq. (548) for the function

The high temperature relation Eq. (553) becomes

$\begin{displaymath} a_j \stackrel{\beta\rightarrow 0}{\longrightarrow} a^0_j = ... ...{e^{-E_{\theta,\beta,j}}} {\sum_k^me^{-E_{\theta,\beta,k}}} , \end{displaymath}$

(572)

for a hyperprior $p(\theta,\beta,j)$ uniform with respect to

. The low temperature relation Eq. (559) remains unchanged.

For Eq. (567) becomes

$\begin{displaymath} {h} =\sum_j^2 a_j \bar t_j =\frac{\bar t_1 + \bar t_2}{2} ... ... + \left(\tanh \Delta\right) \frac{\bar t_1 - \bar t_2}{2} , \end{displaymath}$

(573)

with $(\bar t_1+\bar t_2)/2$ = $\bar t$ in case $E_{\theta,\beta,j}$ is uniform in

so that

, and

$\displaystyle \Delta$	$\textstyle =$	$\displaystyle \frac{E_2-E_1}{2} \; = \; \beta \frac{E_{{h},2}-E_{{h},1}}{2} +\frac{E_{\theta,\beta,2}-E_{\theta,\beta,1}}{2}$
	$\textstyle =$	$\displaystyle -\frac{\beta}{4} a(B_1-B_2)a +\frac{d_1-d_2}{2} \; =\; \frac{\beta}{4} \,b(2 a_1-1) + \frac{d_1-d_2}{2} ,$	(574)

because the matrices

are in this case zero except

. The stationarity Eq. (569) can be solved graphically (see Figs.7, 8), the solution being given by the point where $a_1 e^{-\frac{\beta}{2} b a_1^2 + d_2} = (1-a_1) e^{-\frac{\beta}{2} b (1-a_1)^2+d_1}$ , or, alternatively,

$\begin{displaymath} a_1 = \frac{1}{2} \left(\tanh \Delta + 1\right) . \end{displaymath}$

(575)

That equation is analogous to the celebrated mean field equation of the ferromagnet.

We conclude that in the case of equal component covariances, in addition to the linear low-temperature equations, only a -dimensional nonlinear equation has to be solved to determine the `mixing coefficients' $a_1,\cdots , a_{m-1}$ .