Chain Rule
Chain rule can be simply compared with the chain where say small objects are chained together.
Mathematically, one property relating to a secondary property and that another seconday property also relating with another third property implies new relation between first and the third one.
The below images can clear the concept of chain rule easily:
Things to know for the concpet of the chain rule in maths are:
a. Derivative
b. Slope
Finding best fit model
Step 1:
For each residual (x-axis is weight and y-axis is height) for a particular model(linear regression in this case), plot residuals and intercepts.
Step 2:
Plot residuals and squares of residual
Step 3:
Find the relation. Relation we got is
Weight -> Height -> Intercept (y-axis intercept/Height itself) -> Residuals -> Residual squares
Step 4:
Derivative of residual square with respect to intercept(Height) being zero means minimizing the squared residual (meaning less error)
References
- https://www.youtube.com/watch?v=wl1myxrtQHQ&list=PLblh5JKOoLUICTaGLRoHQDuF_7q2GfuJF&index=55 | The Chain Rule, Clearly Explained!!! | StatQuest with Josh Starmer