Explore_the_Formulation_of

In the EM algorithm, both complete data and missing data are defined:

defines the complete data.

{,} is missing data, where is a K-dimensional vector whose ^th component, , is 1 or 0 depending on whether belongs to the ^th mixing in the equation:

where

w_superk is the weight for the ^th Gaussian distribution Nmusuperkomegasuperk (nonnegative number, normalized by Eqtn_2_2_3b_Normalized )

mu_superk is the mean vector ( musuperk_RsuperKtheta )

omega_superk is the positive definite covariance matrix ( omegak_Rsuperpxp )

The purpose of the EM algorithm is to start with phi_super0 and iterate from phi_superr to phi_superrplus1 at the r^th iteration, continuing the process until the desired parameters are identified, such that

Eqtn_2_2_7d_DesiredParams

where Qpsipsi_superr is defined and calculated by equations discussed below. This process guarantees convergence to a stationary point of the likelihood [3] [11] [12], and typically, a number of starting positions are suggested in an effort to ensure convergence to a global maximum [3].

The E-Step

During the Expectation Step, the function is defined as Eqtn_2_2_8_EstepFunctionQ , where the complete data likelihood logLc_psi is given by

Eqtn_2_2_9_CompleteDataLikelihood

By using Bayes Theorem, the function can be written as [3]

Eqtn_2_2_10_QFunctionRewrite

where

Eqtn_2_2_11_gikEquation

and, for some constant C,

Eqtn_2_2_12_logPEquation

Note that the probability that the i^th individual belongs to the ^th mixing component can be defined as

Eqtn_2_2_13_ProbabilityEquation

The M-Step

In the Maximization Step, it is sufficient to find the unique solution of phi_superrplus1 such that

Eqtn_2_2_14_UniqueSolution

where Eqtn_2_2_14a_Where . This leads to unique solutions [3] of Eqtn_2_2_14_UniqueSolutionParams . (See the “Solutions” section for details.)

The updating of w_superK can be calculated as the average of the contributions from each subject to the ^th mixing [3], i.e.,

Eqtn_2_2_15_WeightEquation

To calculate the log of the likelihood function Log_phi in

Eqtn_2_2_4_OverallLikelihood

(discussed in “Two-Stage Nonlinear Random Effects Mixture Model”) first evaluate the denominator of gik_thetai_psisuperr , which does not depend on . Define it as N_i such that

Eqtn_2_2_16_Ni

where

Eqtn_2_2_17_nIntegral

Once nik and N_i are obtained, the earlier equation:

Eqtn_2_2_13_ProbabilityEquation

can be immediately evaluated by

The log of the likelihood function Eqtn_2_2_4_OverallLikelihood is Eqtn_2_2_19_LogLikelihood .

The EM iterates phi_superr have the important property that the corresponding likelihoods Log_phi_superr are non-decreasing, i.e., Eqtn_2_2_19a_NondecreasingLikelihood for all r [11] [3].