Given any polynomial
equation
a
0 + a
1X + a
2X
2
+ ... a
n-1X
n-1 + a
nX
n
= 0, where a
n is
not 0, we can divide through by a
n to
get a polynomial in the form
p(x) = a
0 + a
1X + a
2X
2
+ ... a
n-1X
n-1
+ X
n = 0. We can
therefore without loss of generality restrict ourselves to polynmials
with leading coefficient equal to 1.
The polynomial can be factored in terms of its roots to get
(X - r
1)(X - r
2)...(X - r
n)=
0 so
(X - r
1)(X - r
2)...(X - r
n)
= a
0 + a
1X + a
2X
2
+ ... a
n-1X
n-1
+ Xn
Equating the coefficients on both sides we get:
Sum(r
i) = (r
1 + r
2
+ ... r
n) = a
n-1
Sum(r
ir
j) = a
n-2
By this is meant that to find the coefficient of X
n-2
on the left side, find all of the ways of choosing n-2
roots and two X factors from the terms in parentheses. If,
for example, n was equal to 3 we would have a
1 =
(r
1r
2 + r
1r
3
+ r
2r
3).
Similarly, Sum(r
ir
jr
k)
= a
n-3.
If we continue in this way, the left side terms will include all the
possible expressions of the form
Sum(r
1r
2...r
k)
where k ≤ n and each of the r
i are
distinct. These expressions are polynomials in the r
i
and are in fact symmetric polynomials. They are referred to as
elementary symmetric polynomials. Each one equates to one of the
coefficients of p(x). Therefore showing that a every symmetric polynomial in the r
i
is a polynomial of the coefficients is equivalent to showing that every
symmetric polynomial can be expressed as a polynomial of the
elementary symmetic polynomials. The exercise showed that r
12
+ r
22 =
(r
1 + r
2 )
2
- 2r
1r
2, which we noww see as a polynomial in the two elementary symmetric polynomials (r
1 + r
2 ) and r
1r
2.
Proof Outline
In computing r
12
+ r
22, a first approach is to use
(r
1 + r
2 )
2 to compute the squared terms. We are left with the symmetric polynomial 2r
1r
2,
which we can view as a simpler problem to solve since it does not
contain any square terms. In this case what we are left with is
just twice the elementary symmetric polynomial r
1r
2.
We can handle the general case in the same way - at each step
eliminate the highest order term with any additional terms created
being of lower order. In order to do this we need a way of
specifying what is meant by higher order terms. We start by
writing the terms such that their exponents are in descending order.
Because the polynomials are symmetric we can always select a term
listed with r
1 first, r
2 second and so on. Consider the term r
110r
28r
36. This term will be greater than any term whose leading exponent is less than 10 like r
19r
28r
37r
46.
It will also be greater than any term whose leading exponent is
10 but whose second order exponent is either less than 8 or is
non-existent like r
110r
27r
36r
45 or r
110. It should be clear how to continue.
We will now see how to replace the terms r
110r
28r
36 with smaller order terms. r
1, r
2 and r
3 all have exponents of at least 6, so we start by factoring out (r
1r
2r
3)
6. r
110r
28r
36 = (r
1r
2r
3)
6 r
14r
22. In the portion of the term remaing after factoring, r
14r
22, r1 and r2 both have exponents of at least two, so we can factor out (r
1r
2)
2, giving us
r
110r
28r
36 = (r
1r
2r
3)
6 (r
1r
2)
2r
12.
How should this be interpreted? Suppose that the original
polynomial was third order so there are only three roots. We
would get rid of the terms like r
i10r
j8rk
6 with the following polynomial in elementary symmetric polynomials:
(r
1r
2r
3)
6(r
1r
2 + r
2r
3 + r
1r
3)
2(r
1 + r
2 + r
3)
2. The highest order terms from (r
1 + r
2 + r
3)
2 are the terms r
i2. The highest order terms of (r
1r
2 + r
2r
3 + r
1r
3)
2 are the terms (r
ir
j)
2 . We therefore have succeeded in producing terms like ri
10rj
8rk
6 and
introducing only lower order terms. The method can now be applied to
the remaining terms. We will eventually come to a halt because at
each stage the remaining terms are of lower order.
General Symmetric Polynomials
In order to motivate the discussion of
symmetric polynomials, they were introduced as being symmetric
polynomials in the roots of some other polynomial. The above
discussion shows how any symmetric polynomial in x1, x2, ..., xn. can be expressed in terms of the elementary symmetric polynomials sum(xixj...xk).