Univariate Regression - Derivation of Equations

Fundamentals of Statistics contains material of various lectures and courses of H. Lohninger on statistics, data analysis and chemometrics......click here for more.

Home Bivariate Data Regression Derivation of a Univariate Regression Formula	Index
See also: regression, Curvilinear Regression, Regression - Confidence Interval, Regression after Linearisation
Univariate Regression - Derivation of Equations The principle of this derivation is quite simple: the least squares regression curve is one that minimizes the sum of squared differences between the estimated and the actual y values for given x values (residuals). Therefore you first have to define the equation of the sum of squares, calculate the partial derivatives (with respect to each parameter), and equate them to zero. The rest is just plain algebra to obtain an expression for the parameters. Let us conduct this procedure for a particular example: y = ax + bx² This formula is to be estimated from a series of data points [x_i,y_i], where the x_i are the independent values, and the y_i are to be estimated. By substituting the y_i values with their estimates ax_i+bx_i² we obtain the following series of data points: [xi, ax_i+bx_i²]. The actual values of the y values are, however, the y_i. Thus the sum of squared errors S for n data points is defined by S = (ax₁+bx₁²-y₁)² + (ax₂+bx₂²-y₂)² + (ax₃+bx₃²-y₃)² + ...... + (ax_n+bx_n²-y_n)² Now we have to calculate the partial derivatives with respect to the parameters a and b, and equate them to zero: dS/da = 0 = 2(ax₁+bx₁²-y₁)x₁+ 2(ax₂+bx₂²-y₂)x₂ + 2(ax₃+bx₃²-y₃)x₃ + ...... + 2(ax_n+bx_n²-y_n)x_n dS/db = 0 = 2(ax₁+bx₁²-y₁)x₁²+ 2(ax₂+bx₂²-y₂)x₂² + 2(ax₃+bx₃²-y₃)x₃² + ...... + 2(ax_n+bx_n²-y_n)x_n² These two equations can easily be reduced by introducing the sums of the individual terms: Now, solve these equations for the coefficients a and b: And then substitute the expressions for a and b into their counterparts, with the following final results:
Home Bivariate Data Regression Derivation of a Univariate Regression Formula