统计代写|贝叶斯分析代考Bayesian Analysis代写|The multivariate Newton-Raphson algorithm

The Newton-Raphson algorithm can also be used to solve several equations simultaneously, say
g_k\left(x_1, \ldots, x_K\right)=0, k=1, \ldots, K \text {. }
Let: $x=\left(\begin{array}{c}x_1 \ \vdots \ x_K\end{array}\right), g(x)=\left(\begin{array}{c}g_1(x) \ \vdots \ g_K(x)\end{array}\right), 0=\left(\begin{array}{c}0 \ \vdots \ 0\end{array}\right)$ (a column vector of length $K$ ).
Then the system of $K$ equations may be expressed as
and the NR algorithm involves iterating according to
x^{(j+1)}=x^{(j)}-g^{\prime}\left(x^{(j)}\right)^{-1} g\left(x^{(j)}\right),
where: $x^{(j)}=\left(\begin{array}{c}x_1^{(j)} \ \vdots \ x_K^{(j)}\end{array}\right)$ is the value of $x$ at the $j$ th iteration
& x^{(j+1)}=\left(\begin{array}{c}
x_1^{(j+1)} \
\vdots \
\end{array}\right), \quad g\left(x^{(j)}\right)=\left(\begin{array}{c}
g_1\left(x^{(j)}\right) \
\vdots \
g_1(x) \
\vdots \
\end{array}\right|{x=x^{(j)}}\right] \ & g^{\prime}\left(x^{(j)}\right)=\left[\left.g^{\prime}(x)\right|{x=x^{(j)}}\right] \
& g^{\prime}(x)=\left(\begin{array}{c}
\partial g_1(x) / \partial x^T \
\vdots \
\partial g_K(x) / \partial x^T
\partial g_1(x) / \partial x_1 & \cdots & \partial g_1(x) / \partial x_K \
\vdots & \ddots & \vdots \
\partial g_K(x) / \partial x_1 & \cdots & \partial g_K(x) / \partial x_K

统计代写|贝叶斯分析代考Bayesian Analysis代写|The Expectation-Maximisation (EM) algorithm

We have shown how the Newton-Raphson algorithm for solving $g(x)=0$ numerically can be useful for finding the posterior median and the HPDR. That algorithm can also be used for finding the posterior mode, when this is the solution of
\frac{\partial f(\theta \mid y)}{\partial \theta}=0
or equivalently
\frac{\partial \log f(\theta \mid y)}{\partial \theta}=0
In some situations, finding the posterior mode either analytically or via the NR algorithm may be problematic because the posterior density $f(\theta \mid y)$ has a very complicated form. In that case, one may consider applying the Expectation-Maximisation (EM) algorithm .
This algorithm first requires the specification (i.e. definition by the user) of some suitable latent data, which we will denote by $z$, and then the application of the following two steps iteratively until convergence.
Note: The choice of the latent data $z$ will depend on the particular application.
Step I. The Expectation Step (E-Step)
Determine the $Q$-function, defined as
Q_j(\theta) & =E_z\left{\log f(\theta \mid y, z) \mid y, \theta_j\right} \
& =\int \log f(\theta \mid y, z) f\left(z \mid y, \theta_j\right) d z,
or, in words, as
the expectation of the log-augmented posterior density with respect to the distribution of the latent data given the observed data and current parameter estimates.
Step 2. The Maximisation Step (M-Step)
Find the value of $\theta$ which maximises the Q-function, for example using the Newton-Raphson algorithm.
This value becomes the current parameter estimate in the next iteration.
Note 1: For mathematical convenience, the Q-function may also be defined as at (4.3) but plus and/or multiplied by any constants which do not depend on the parameter $\theta$. This extended definition allows us to ignore terms which have no impact on the final results. If (4.3) is multiplied by a negative constant, the resulting Q-function should be minimised at Step 2 rather than maximised.
Note 2: If there is a choice between using the NR algorithm or the EM algorithm, one should consider the fact that the EM algorithm is slower to converge but far more stable. In fact, under certain regularity conditions, the EM algorithm is guaranteed to move closer to the required solution at each iteration. By contrast, the $\mathrm{NR}$ algorithm may not converge at all if started at a value far away from the required solution. Thus, one plausible strategy is to use the EM algorithm to obtain an approximate solution which is sufficiently close to the correct answer, and then to obtain a very high precision using just a few iterations of the NR algorithm.

g_k\left(x_1, \ldots, x_K\right)=0, k=1, \ldots, K \text {. }
设:$x=\left(\begin{array}{c}x_1 \ \vdots \ x_K\end{array}\right), g(x)=\left(\begin{array}{c}g_1(x) \ \vdots \ g_K(x)\end{array}\right), 0=\left(\begin{array}{c}0 \ \vdots \ 0\end{array}\right)$(长度为$K$的列向量)。
, NR算法根据
x^{(j+1)}=x^{(j)}-g^{\prime}\left(x^{(j)}\right)^{-1} g\left(x^{(j)}\right),
进行迭代,其中$x^{(j)}=\left(\begin{array}{c}x_1^{(j)} \ \vdots \ x_K^{(j)}\end{array}\right)$为$j$次迭代时$x$的值
& x^{(j+1)}=\left(\begin{array}{c}
x_1^{(j+1)} \
\vdots \
\end{array}\right), \quad g\left(x^{(j)}\right)=\left(\begin{array}{c}
g_1\left(x^{(j)}\right) \
\vdots \
g_1(x) \
\vdots \
\end{array}\right|{x=x^{(j)}}\right] \ & g^{\prime}\left(x^{(j)}\right)=\left[\left.g^{\prime}(x)\right|{x=x^{(j)}}\right] \
& g^{\prime}(x)=\left(\begin{array}{c}
\partial g_1(x) / \partial x^T \
\vdots \
\partial g_K(x) / \partial x^T
\partial g_1(x) / \partial x_1 & \cdots & \partial g_1(x) / \partial x_K \
\vdots & \ddots & \vdots \
\partial g_K(x) / \partial x_1 & \cdots & \partial g_K(x) / \partial x_K

统计代写|贝叶斯分析代考Bayesian Analysis代写|The Expectation-Maximisation (EM) algorithm

\frac{\partial f(\theta \mid y)}{\partial \theta}=0
\frac{\partial \log f(\theta \mid y)}{\partial \theta}=0
在某些情况下,通过解析或通过NR算法寻找后验模式可能会有问题,因为后验密度$f(\theta \mid y)$具有非常复杂的形式。在这种情况下,可以考虑应用期望最大化(EM)算法。
步骤1 .期望步骤(E-Step)
确定$Q$ -函数,定义为
Q_j(\theta) & =E_z\left{\log f(\theta \mid y, z) \mid y, \theta_j\right} \
& =\int \log f(\theta \mid y, z) f\left(z \mid y, \theta_j\right) d z,

统计代写|贝叶斯分析代考Bayesian Analysis代写

统计代写|贝叶斯分析代考Bayesian Analysis代写 请认准exambang™. exambang™为您的留学生涯保驾护航。


