Theory¶
In the problem of fitting a theoretical model \(f\left(t_m, \vec{\theta}\right)\) to the \(M\) experimentally determined data points \(y_m\) at times \(t_m\), by assuming that the experimental errors for the data points are independent and Gaussian distributed with standard deviation of \(\sigma\), the probability that a given model produced the observed data points is
The likelihood function of this model, \(L\left(\vec{\theta}\middle|\vec{y}\right)\), is the probability of the occurrence of the outcomes \(\vec{y}\) given a set of parameters \(\vec{\theta}\) of the model, \(P\left(\vec{y}\middle|\vec{\theta}\right)\). Using the equation above, we can write
where \(C\left(\vec{\theta}\right)\) is the cost function, given by
Suppose the model \(f\left(t_m, \vec{\theta}\right)\) has \(N\) parameters, written as \(\{ \theta_1, \cdots, \theta_N \}\). The profile likelihood of the model for parameter \(\theta_j\) is the possible maximum likelihood given the parameter \(\theta_j\). The profile likelihood for parameter \(\theta_j\) is calculated by setting \(\theta_j\) to a fixed value, then maximizing the likelihood function (by minimizing the cost function) over the other parameters of the model. We repeat this computation across a range of \(\theta_j\), \(\left(\theta_j^{\min}, \theta_j^{\max}\right)\).
Computation process¶
Assuming that the best-fit parameter is the maximum of the likelihood function, the computation starts at the best-fit parameter. Then the profile likelihood computation will continue for the parameters to the left of the best-fit. After this is done, the computation returns to the best-fit and continue for the parameters to the right. This process is done for each parameter.