Copyright © Philip M. Parker, INSEAD. Terms of Use.

Synonym: Linear RegressionSynonym: rectilinear regression (n). (additional references) |
(From Wikipedia, the free Encyclopedia)
The notion of an independent variable often (but not always) implies the ability to choose the levels of the independent variable and that the dependent variable will respond naturally as in the stimulus-response model. The independent variable x may be a scalar or a vector. In the former case we may write one of the simplest linear-regression models as follows:
Historically, in applications to measurements in astronomy, the "error" was actually a random measurement error, but in many applications, ε is merely the amount by which the individual -value differs from the average -value among individuals having the same -value. The average value of the random "error" is zero. Often in linear regression problems statisticians rely on the Gauss-Markov assumptions:
Sometimes stronger assumptions are relied on:
It is often erroneously thought that the reason the technique is called "linear regression" is that the graph of is a line. But in fact, if the model is
A statistician will usually estimate the unobservable values of the parameters α and β by the method of least squares, which consists of finding the values of and that minimize the sum of squares of the residuals
Notice that, whereas the errors are independent, the residuals cannot be independent because the use of least-squares estimates implies that the sum of the residuals must be 0, and the dot-product of the vector of residuals with the vector of -values must be 0, i.e., we must have
These facts make it possible to use Student's t-distribution with degrees of freedom (so named in honor of the pseudonymous "Student") to find confidence intervals for and .
Denote by capital Y the column vector whose ith entry is yi, and by capital X the n x 2 matrix whose second column contains the xi as its ith entry, and whose first column contains n 1s. Let ε be the column vector containing the errors εi. Let δ and d be respectively the 2x1 column vector containing α and β and the 2x1 column vector containing the estimates a and b. Then the model can be written as
Then it can be shown that
The matrix In - X (X' X)-1 X' that appears above is a symmetric idempotent matrix of rank n - 2. Here is an example of the use of that fact in the theory of linear regression. The finite-dimensional spectral theorem of linear algebra says that any real symmetric matrix M can be diagonalized by an orthogonal matrix G, i.e., the matrix G'MG is a diagonal matrix. If the matrix M is also idempotent, then the diagonal entries in G'MG must be idempotent numbers. Only two real numbers are idempotent: 0 and 1. So In-X(X'X)-1X', after diagonalization, has n-2 0s and two 1s on the diagonal. That is most of the work in showing that the sum of squares of residuals has a chi-square distribution with n-2 degrees of freedom.
The first method of displaying the residuals use the histogram or cumulative distribution to depict the similarity (or lack thereof) to a normal distribution. Non-normality suggests that the model may not be a good summary description of the data.
We plot the residuals, against the independent variable, X. There should be no discernible trend or pattern if the model is satisfactory for this data. Some of the possible problems are:
The correlation coefficient, r, can be calculated by
Source: adapted by the editor from Wikipedia, the free encyclopedia under a copyleft GNU Free Documentation License (GFDL) from the article "Linear regression."
Crosswords: Linear Regression |
| English words defined with "linear regression": regression coefficient, regression curve, regression line. (references) |
| Specialty definitions using "linear regression": Linear Models ♦ multicollinearity. (references) |
| Domain | Title |
Books |
|
Source: compiled by the editor from various references; see credits. | |
| The following statistics estimate the number of searches per day across the major English-language search engines as identified by various trade publications. Hyperlinks lead to commercial use of the expression at Amazon.com. |
| Language | Translations for "linear regression"; alternative meanings/domain in parentheses. | ||||||||||||||||||||||
Danish | lineaer regression, lineær regression. (various references) | ||||||||||||||||||||||
Dutch | lineaire regressie. (various references) | ||||||||||||||||||||||
Finnish | lineaarinen regressio. (various references) | ||||||||||||||||||||||
French | régression linéaire. (various references) | ||||||||||||||||||||||
German | lineare Regression, lineare Progression. (various references) | ||||||||||||||||||||||
Greek | γραμμική παλινδρόμηση. (various references) | ||||||||||||||||||||||
Italian | regressione lineare. (various references) | ||||||||||||||||||||||
Pig Latin | inearlay egressionray regressão linear. (various references) regresión lineal. (various references) linjär regression. (various references) | ||||||||||||||||||||||
Misspellings | |
"Linear Regression" is suggested in spellcheckers for the following: linear reggression. (additional references) | |
| Source: compiled by the editor, based on several corpora (additional references). | |
Scrabble® Enable2K-Verified Anagrams | |
| Words within the letters "a-e-e-e-g-i-i-l-n-n-o-r-r-r-s-s" | |
-4 letters: legionnaires. | |
-5 letters: generalises, legionaries, legionnaire, rereleasing, reseasoning. | |
| Source: compiled by the editor from various references; see credits. SCRABBLE® is a registered trademark. All intellectual property rights in and to the game are owned in the U.S.A and Canada by Hasbro Inc., and throughout the rest of the world by J.W. Spear & Sons Limited of Maidenhead, Berkshire, England, a subsidiary of Mattel Inc. Mattel and Spear are not affiliated with Hasbro. | |
Hexadecimal (or equivalents, 770AD-1900s) (references)4C 69 6E 65 61 72      52 65 67 72 65 73 73 69 6F 6E |
| Leonardo da Vinci (1452-1519; backwards) (references)
|
Binary Code (1918-1938, probably earlier) (references)01001100 01101001 01101110 01100101 01100001 01110010 00100000 01010010 01100101 01100111 01110010 01100101 01110011 01110011 01101001 01101111 01101110 |
HTML Code (1990) (references)L i n e a r   R e g r e s s i o n |
ISO 10646 (1991-1993) (references)004C 0069 006E 0065 0061 0072      0052 0065 0067 0072 0065 0073 0073 0069 006F 006E |
Encryption (beginner's substitution cypher): (references)467580716784252717384718585758180 |
| Amazon.com BOOKS: Search for: "linear regression" |