Logistic Regression is a classification problem whose target / output is a set of discrete values.

Binary Classification

In binary classification, the output of Y has only two values, usually 0 and 1.

Logistic function of binary classification is call Sigmoid Function, its charateristic includes:

sigmoid

Logistic Function Equation

h_θ(x) = P(y=1|x;θ) = 1 - P(y=0|x;θ), in other words, h_θ(x) will give us the probability that our output is 1.

P(y=1|x;θ) + P(y=0|x;θ) = 1

Assume this is how we will translate the P into 0 and 1:

Since h_θ(x) = g(θ^Tx), for h_θ(x) >= 0.5 means θ^Tx >= 0, thus:

Solving the equation we get the Decision Boundary

The original sigmoid function is non-linear thus will not have global minimum. Log function solves the problem.

logistic-cost

If we correctly predict the outcome, then the cost will nearly 0

If we wrongly predict the outcome, the cost will be close to infinity

If y=1	If y=0

logistic-gd