Its all about Data: Gradient Descent Algorithm for minimizing Cost Function

Gradient Descent algorithm not only use for minimizing cost function for linear regression but also for minimizing other functions as well.

Algorithm Outline Steps :

Step1 : Start with some Θ_{0 ,}Θ₁
_{Step2 : Keep changing}Θ_{0 ,}Θ_{1 to reduce J(}Θ_{0 ,}Θ_{1) until algorithm end up at minimum.}
_{Algorithm :}
_{Repeat until convergence occur}
_{

}

Here α is learning rate.

How Θ_{0 ,}Θ₁ update is shown as :

Correct way of simultaneously update is :

Incorrect way of simultaneously update is :

Pictorial representation of how Θ₁ update with +ve/-ve slope :

Θ_{1 =}Θ₁- α (+ve number)

Θ_{1 =}Θ₁- α (-ve number)

Learning rate α for minimizing cost function represents as :

Two things always keep in mind for choosing the learning rate α :

1) If α is too small, gradient descent can be very slow.
2) If α is too large, gradient descent can overshoot the minimum. It may fail to converge or even diverge.

Gradient Descent Algorithm also called as Batch Gradient Descent Algorithm. Because at each step it use all training example.

Its all about Data

Friday, May 17, 2013

Gradient Descent Algorithm for minimizing Cost Function

No comments:

Post a Comment