28. Derive equation (8.100).
Solution: By the definition
Jn(θ) = 1
n
n
X
k=1
L(yk,xk,θ).
29. Consider the online version of PDMb in (8.64), i.e.,
θn=(PCθn−1−µn
J(θn−1)
||J′(θn−1)||2J′(θn−1),If J′(θn−1)6=0,
PC(θn−1),If J′(θn−1) = 0,(26)
where we have assumed that J∗= 0. If this is not the case, a shift
can accommodate for the difference. Thus we assume that we know the
minimum. For example, this is the case for a number tasks, such as the
hinge loss function, assuming linearly separable classes, or the linear ǫ–
insensitive loss function, for bounded noise. Assume that
Ln(θ) =
n
X
k=n−q+1
ωkdCk(θn−1)
Pn
k=n−q+1 ωkdCk(θn−1)dCk(θ)
Then derive that APSM algorithm of (8.39).
Solution: Let the loss function be
Ln(θ) =
n
X
k=n−q+1
ωkdCk(θn−1)
Pn
k=n−q+1 ωkdCk(θn−1)dCk(θ) =
n
X
k=n−q+1
βkdCk(θ),