Explore the Formulation of EM

In the EM algorithm, both complete data and missing data are defined:

Ye_Yithetaizi_i1NSUB defines the complete data.

{Thetai,zi_vector} is missing data, where zi_vector is a K-dimensional vector whose kth component, zik, is 1 or 0 depending on whether Thetai belongs to the kth mixing in the equation:
     Eqtn_2_2_3_GaussianKMixEM
where

w_superk is the weight for the kth Gaussian distribution Nmusuperkomegasuperk (nonnegative number, normalized by Eqtn_2_2_3b_Normalized)

mu_superk is the mean vector (musuperk_RsuperKtheta)

omega_superk is the positive definite covariance matrix (omegak_Rsuperpxp)

The purpose of the EM algorithm is to start with phi_super0 and iterate from phi_superr to phi_superrplus1 at the rth iteration, continuing the process until the desired parameters phi_superrplus1 are identified, such that

Eqtn_2_2_7d_DesiredParams

where Qpsipsi_superr is defined and calculated by equations discussed below. This process guarantees convergence to a stationary point of the likelihood [3] [11] [12], and typically, a number of starting positions are suggested in an effort to ensure convergence to a global maximum [3].

The E-Step 

During the Expectation Step, the function is defined as Eqtn_2_2_8_EstepFunctionQ, where the complete data likelihood logLc_psi is given by

Eqtn_2_2_9_CompleteDataLikelihood

By using Bayes Theorem, the function can be written as [3] 

Eqtn_2_2_10_QFunctionRewrite

where

Eqtn_2_2_11_gikEquation

and, for some constant C,

Eqtn_2_2_12_logPEquation

Note that the probability that the ith individual belongs to the kth mixing component can be defined as

Eqtn_2_2_13_ProbabilityEquation

The M-Step 

In the Maximization Step, it is sufficient to find the unique solution of phi_superrplus1 such that

Eqtn_2_2_14_UniqueSolution

where Eqtn_2_2_14a_Where. This leads to unique solutions [3] of Eqtn_2_2_14_UniqueSolutionParams. (See the “Solutions” section for details.)

The updating of w_superK can be calculated as the average of the contributions from each subject to the kth mixing [3], i.e.,

Eqtn_2_2_15_WeightEquation

To calculate the log of the likelihood function Log_phi in

Eqtn_2_2_4_OverallLikelihood

(discussed in “Two-Stage Nonlinear Random Effects Mixture Model”) first evaluate the denominator of gik_thetai_psisuperr, which does not depend on k. Define it as Ni such that

Eqtn_2_2_16_Ni

where

Eqtn_2_2_17_nIntegral

Once nik and Ni are obtained, the earlier equation:

Eqtn_2_2_13_ProbabilityEquation

can be immediately evaluated by

The log of the likelihood function Eqtn_2_2_4_OverallLikelihood is Eqtn_2_2_19_LogLikelihood.

The EM iterates phi_superr have the important property that the corresponding likelihoods Log_phi_superr are non-decreasing, i.e., Eqtn_2_2_19a_NondecreasingLikelihood for all r [11] [3].

Solutions 

Looking at gikthetaiphi_superr (from the E-step)

Eqtn_2_2_11_gikEquation

leads to the conclusions that

Eqtn_2_2_20_1_Summations

Therefore, the unique solutions [3] of Eqtn_2_2_14_UniqueSolution (from the M-step) can be written as

Eqtn_2_2_26_7_8_UniqueSolutions

In a case described in reference [3], the parameter Thetai can be partitioned into two components thetai_alphaizetai, where

alphai is from a mixture of multivariate Gaussians.

zetai is from one single multivariate Gaussian.

The EM updates from Eqtn_2_2_14_UniqueSolution are given by

Eqtn_2_2_28d_EMUpdates


Legal Notice | Contact Certara
© Certara USA, Inc. All rights reserved.