# Linear Regression

Given a set of points, find a line that fits the data best! Even this simple form of regression allows us to predict future points.

The size of the step that gradient descent takes is called the *learning rate*. Finding an adequate value for the *learning rate* is key to achieve convergence. If this value is too large the algorithm will never reach the optimus, but if is too small it will take too much time to achieve the desired value.

