Mathematical_Difference_of

Mathematical Difference of Methods

Mathematically, the difference between parametric and nonparametric method is the probability density function (PDF) of the parameters. For the parametric method, the PDF is a Gaussian, such as

Eqtn_1_1_3_PDF_Theta
(discussed in “Review of Laplacian-Approximation Formulation”)

and

Eqtn_2_2_7_Multvariate_PDF_Theta
(discussed in “Two-Stage Nonlinear Random Effects Mixture Model”)

For nonparametric methods, the PDF is simply the sum of Dirac delta functions (discussed below).

Exploring the formulas

The population analysis problem can be stated as follows:

Let

Y1Y2Yn be a sequence of independent but not necessarily identically distributed random vectors constructed from one or more observations from each of N_SUBsubjects in the population. The {} are observed.

theta12NSUB be a sequence of independent and identically distributed random vectors belonging to a compact subset ThetaCapital of Euclidean space with common but unknown distribution f. The { Thetai } are not observed.

Assume that the conditional densities pYibarbetathetai are known, for i = 1,..., n, where Beta is an unknown vector in set B.

Then, the probability of given Beta and f, can be stated as Eqtn_3_0_0_Prob_Yi .

Because of the independence of {}, the probability of {}, given Beta and f, is given by the likelihood

Eqtn_3_0_1_Likelihood_Yi

where phibetaf . In the context of “mixed-effects” problems, the vector Beta describes the fixed effects and { Thetai } describes the random effects.

Thus, the population analysis problem is to maximize the likelihood function Log_phi with respect to all parameters Beta in B and all density functions f on ThetaCapital .

The maximum likelihood problem, as stated above, is infinite dimensional. The theorems of Mallet [6] and Lindsay [5] reduce this problem to finite dimensions. It was proved under simple hypotheses on the conditional densities pYibarbetathetai , that for fixed Beta , the optimal density f could be found in the space of discrete densities with no more than N_SUB support points, where n is the number of subjects. This means K is a bounded value, making all the numerical computations feasible. Therefore, f can be written as a weighted sum of delta functions.

Eqtn_3_0_3_OptimalDensity_Theta

where

Eqtn_3_0_3a_Where .

Eqtn_3_0_3b_Sum .

The weights and the vector muk (also known as a support point [4]) are implicit functions of Beta .

The likelihood equation now becomes

Eqtn_3_0_4_Likelihood

where Eqtn_3_0_5_phi .

The quantity pYibarbetamuk can be taken as the (i, k) element of the N_SUB by the K likelihood matrix:

Eqtn_3_0_6_KLikelihoodMatrix

Typically, the log likelihood function for L(ϕ) is calculated as:

Eqtn_3_0_7_lnL

where

Eqtn_3_0_8_Ni

n_ik is the (i, k) element of the N_SUB by K likelihood matrix, nik_pYibarbetamuk .

In most cases, the -dimensional observation vector for the i^th individual Yi_Y1iYmii is sampled from a Gaussian distribution such as Eqtn_2_2_1_GaussianDist (discussed in “Two-Stage Nonlinear Random Effects Mixture Model”). Thus, pYibarbetamuk is simply the corresponding Gaussian.

The maximum likelihood approaches for parametric and nonparametric methods are the same. It is only due to the PDF of nonparametric approaches being the sum of delta functions, such as

Eqtn_3_0_3_OptimalDensity_Theta

instead of Gaussian that the corresponding log-likelihood function, such as

Eqtn_3_0_7_lnL

does not involve any integrals. Therefore, the calculation of log-likelihood is much faster than that in parametric methods. However, the time-consuming part for nonparametric methods, is to search for the , Beta , the support points muk , and the optimum number of support points K. Note that there is no covariance matrix OMEGA in nonparametric methods.