In this appendix we will describe a concrete algorithm for solving the general quintic equation. This algorithm is based on Klein's theory of the connection between the general quintic and the icosahedral equation, described in his famous lectures on the icosahedron [8]. See also [4] (from which we take the illustration below), and [2]. We begin by reviewing this theory.
Associated with the icosahedron
(normalized as in §5)
is a tiling of the
Riemann sphere by 120 spherical triangles,
60 black and 60 white (Figure 5).
This configuration is invariant under the icosahedral group,
represented as a group
of Möbius transformations.
Each triangle has angles
corresponding to the 30 edge midpoints, 20 face centers, and 12 vertices of
the icosahedron.
We will refer to these special points as 2-, 3-, and 5-vertices.
Figure 5: The icosahedral tiling.
Map each white triangle conformally to the upper half-plane,
and map each black triangle conformally to the lower half-plane,
so that the 3-, 5-, and 2-vertices map to .
These 120 separate mappings piece together
to give a rational function of degree 60,
the icosahedral function.
This function, denoted by
, is
right-invariant under the
icosahedral group
:
it gives the quotient map .
To write down the icosahedral function explicitly,
recall that every homogeneous polynomial invariant under the
binary icosahedral group is a polynomial in
,
, and
, where
The polynomials ,
, and
vanish at the 5-, 3-, and 2-vertices respectively.
They satisfy the identity
The icosahedral function is
To check this, note that the top and bottom are homogeneous
of degree 60 (so the ratio is a rational function of
),
the zeros and poles occur at the 3- and 2- vertices, and
by the identity
the 5-vertices of the icosahedron are mapped to 1.
The equation
is called the icosahedral equation.
Solving the icosahedral equation amounts to finding one of the 60
points that map to Z under the icosahedral function.
Given one such point,
the 59 others can be found by determining the images of the first
under the group .
Please note that our normalization of the icosahedral function differs from the normalizations of [8] and [2]:
In this section we give a brief account of the classical reduction of the general quintic equation
to the icosahedral equation, following [8]. As Klein emphasized, this reduction is best understood geometrically.
The first step in the reduction dates back to 1683, when Tschirnhaus showed that by making a substitution of the form
the general quintic can be reduced to
a quintic for which .
Here a and b are determined by solving an
auxiliary quadratic equation.
Such a quintic is called a principal quintic.
Equivalently, a principal quintic is one normalized
so its roots satisfy .
These homogeneous equations determine a quadric surface in
the projective space of roots.
Viewed geometrically, the Tschirnhaus transformation
moves an ordered set of roots to one of the two
points of intersection of this quadric with the line
determined by allowing a and b to vary.
Which point depends on the
choice of auxiliary root.
The symmetric group acts
on the quadric by permuting the roots.
An odd permutation interchanges the two rulings
of the quadric by lines;
adjoining with square-root of the discriminant
reduces the action to the alternating group
,
which preserves the rulings.
The space of lines in a given ruling is isomorphic to
the Riemann sphere , and in appropriate
coordinates the action of
(on the space of lines) is none other than the
icosahedral action.
From the original principal quintic and the square-root
of its discriminant, we may determine a point Z on
the quotient such that a solution to
corresponds to a line containing the point
for some ordering
of the roots. Then the roots themselves can be
found by elimination.
Perhaps the most intriguing part of this whole story is the square root
used in the Tschirnhaus transformation to obtain a
principal quintic.
This square root is an accessory irrationality,
as it does not diminish the Galois group of the equation,
and as such is not expressible in terms of the roots of the equation.
Rather, its function (as pointed out in [15])
is to eliminate the cohomological obstruction described in
.
The culmination of Klein's lectures on the icosahedron
is the result,
which Klein calls Kronecker's theorem,
that without the introduction of such an accessory irrationality
the general quintic equation cannot be reduced to a resolvent equation
that depends--like the icosahedral equation--on a single parameter.
While this result was stated by Kronecker,
the first correct proof was given by Klein.
Apparently, Kronecker felt that accessory irrationalities
were `algebraically worthless',
and proposed what he called the `Abelian Postulate',
requiring that such accessory irrationalities be avoided at all costs.
According to this view,
the reduction of the quintic to the icosahedral equation is inadmissible.
Arguing against this point of view,
Klein (on page 504 of [9])
writes:
Soll man, wo sich neue Erscheinungen (oder hier die Leistungsfähigkeit der akzessorischen Irrationalitäten) darbieten, zugunsten einer einmal gefassten systematischen Ideenbildung die Weiterentwicklung abschneiden, oder vielmehr das systematische Denken als zu eng zurückschieben und den neuen Problemen unbefangen nachgehen? Soll man Dogmatiker sein oder wie ein Naturforscher bemüht sein, aus den Dingen selbst immer neu zu lernen?(When new phenomena appear, like the efficacy of the accessory irrationality, should we halt our investigations because the facts fail to agree with our preconceived notions, or should we cast aside those preconceived notions as being too narrow, and pursue the new problems wherever they lead? Should we be dogmatists, or should we--like experimental scientists--try always to learn from the facts themselves?)
The algorithm we are going to develop to solve the general quintic proceeds by computing a root, not of the icosahedral equation itself, but of a certain quintic resolvent.
Algebraically, the icosahedral equation determines an
extension of function fields k'/k,
where
and
.
A quintic resolvent is the irreducible polynomial
satisfied by an element of k' of degree 5 over k.
In this section, we will derive formulas for the tetrahedral and Brioschi resolvents, again following [8]. The Brioschi resolvent is a one parameter family of quintics, to which the general quintic may be reduced; it is this equation we will actually solve. The tetrahedral resolvent is used to determine a root from the limit point of an iteration.
Figure 6: A cube inscribed in the icosahedron.
The root of a quintic resolvent is stabilized by
an subgroup of
.
There are five such tetrahedral subgroups
in
, all conjugate.
One tetrahedral subgroup, which we denote
, is distinguished
because it leads to a resolvent defined over
.
can be described geometrically as follows.
There are five cubes whose vertices lie on the vertices
of a regular dodecahedron. Of these, exactly one is
symmetric with respect to reflection through the real axis;
the intersection of its symmetry group with
is
.
The vertices of this cube, and the one-skeleton of its
dual octahedral (which includes the real axis),
appear in Figure 6.
permutes the 12 pentagons that correspond to faces of the dodecahedron,
and any one of them is a fundamental domain for
.
preserves the 6 vertices of the dual octahedron,
and the 4 vertices of each tetrahedron inscribed in the cube;
the stabilizers of all other points are trivial.
Note that only half of the symmetries of the cube (and octahedron)
are symmetries of the icosahedron; otherwise
would have a subgroup of order 24.
Besides the special orbits of ,
we need to pay attention to two orbits of order 12:
the face centers of the dodecahedron,
i.e., the 5-vertices,
and the 20 - 8 = 12 complementary 3-vertices--the
vertices of the icosahedron which do not lie on the cube.
There is a tetrahedral function , analogous to the
icosahedral function
,
which gives
the quotient map
.
By composing with a Möbius transformation,
this function can be normalized to take specified values
on any three orbits of
.
We choose the normalization so that the 5-vertices
map to
,
the vertices of the octahedron map to 0,
and the complementary 3-vertices map to 3.
To write down a formula for ,
we call forth some of the invariant forms for the
binary tetrahedral group
.
Fortunately,
all the forms that we need to work with are
absolute invariants (no character of
appears).
Those we use,
vanish at the vertices of the octahedron, the cube, and the complementary 3-vertices respectively.
Any invariant form of degree 12 is a linear combination of the forms
,
, and
,
which satisfy the identity
Thus
since this expression has zeros and poles in the right places, and the identity
shows the complementary 3-vertices are mapped to 3 as desired.
Under , the 60 roots of the icosahedral equation
map in groups of 12 to 5 distinct points. In terms of a single root z, these 5 images are
where
and is a fifth root of unity.
(The rotation
is an element of
).
The quintic resolvent for turns out to be
We will call this equation the tetrahedral resolvent.
Algebraically, the functions are just the roots
of the tetrahedral resolvent in the function field setting.
This equation can be derived entirely geometrically,
without recourse to the explicit formulas for
.
(See pages 100-102 of [8].)
The related function given by
satisfies the Brioschi resolvent
where C = (1-Z)/1728; the roots of this equation are:
Any principal quintic can be reduced to the Brioschi resolvent for some particular choice of C, determined rationally in terms of the original coefficients and the square-root of the discriminant. This reduction appears in detail in [2].
We are now ready to concoct a generally convergent
algorithm for the icosahedral field extension
k'/k.
The ingredients for such an algorithm are given
in Theorem 4.1; note that the Galois group,
, is tautologically identified with
a group of Möbius transformations.
The algorithm itself is specified by
for all in
.
by (a) and (b) and so T
only depends upon
.
To make the formulas as simple as possible, we will choose
, the unique lowest degree rational map with
icosahedral symmetry and a non-trivial attractor (see §5).
(The attractor of
is periodic of order 2, so we
will actually iterate
.)
As for , note that for each fixed w the map
is a rational map with icosahedral symmetry.
As mentioned in Remark 1 of §5, there is a one-parameter
family of symmetric maps of degree 31 (and none of smaller degree);
this provides the simplest candidate for
.
There are three points at which this family degenerates to
maps of lower degree
,
, and
; we arrange
that these degenerations occur at
, 0 and 1.
To derive a formula for in terms of Z, we begin
by expressing
in homogeneous coordinates
then
To check this formula, we just need to verify that it
degenerates as described above.
Clearly this is true for w=0 and .
For w=1 the rational map we get is
which agrees with by virtue of the identity
To get the formula for , we note
is canonically associated to the
12 vertices of the icosahedron, so T is
canonically associated to their images under
.
By remark 2 at the end of §5, all we must do
to specify
is to give a polynomial g(Z,w)
having these 12 points as its roots.
This leads us to look at the form
,
where
is the homogeneous version of
given above.
The form G is homogeneous of degree
in
and of degree 12 in
.
This polynomial is symmetric under the action of
on
.
Because the ring of
-symmetric forms is generated by
and
,
and because
,
it follows on numerological grounds that G is divisible by
,
and that the quotient
can be written
as a homogeneous polynomial of degree 6 in
and of degree 12 in
.
This polynomial can be found by solving a large system of linear equations.
Dividing the resulting expression
for
through by
and using the fact that
,
we get
where g is a polynomial with integer coefficients, exhibited at the end of this Appendix. We found the coefficients of g by solving the relevant system of equations with the aid of a computer.
The map is now given by
where g' denotes the derivative of g with respect to w.
Under the iteration
almost every starting guess is attracted to a cycle
of period 2
consisting of one of the 10 pairs of antipodal 3-vertices.
If instead iterating
we iterate
,
then almost every starting guess
is attracted to a single one of the 20 3-vertices.
The map is just
transported to new coordinates by
.
For almost every Z, almost every starting guess converges under
iteration of
to
where e is one of the 20 3-vertices of the icosahedron in its standard location.
Of course to be able to write
we have to select some particular root z of the icosahedral equation, for we could equally well write
Turning this around,
we see that if we choose some particular 3-vertex ,
there will be exactly three choices for the root z
for which
These three choices differ from one another by the action
of the stabilizer of the 3-vertex
.
Therefore from
we can
determine the values
of two of the functions
,
and hence two roots
,
of the Brioschi resolvent.
These two values
correspond to the two tetrahedral (
) subgroups of
that contain the stabilizer of
.
As ranges over the 20 attractors of
,
the pair
ranges over the 20 ordered pairs of
roots of the resolvent.
In particular,
going from
to the `antipodal point'
,
we get the same pair of roots in the opposite order.
To determine and
explicitly in terms of
,
we introduce the function
While expressed in terms of z,
this function really only depends on Z,
because the action of permutes the two sets of
factors in the same way.
The idea behind
is that
the first factor
acts as a `selector function' for the second:
Recall that the value of function
is 3 at the complementary 3-vertices;
at the vertices of the tetrahedron and the dual tetrahedron its
values are
which are the other two roots of
Thus the factor
vanishes for three values of k and takes on the values
for the remaining two values of k. Consequently
where are two roots of the Brioschi resolvent.
Replacing
with the `antipodal' fixed point
exchanges the roles of
and
,
so we have
Thus we get a pair of linear equations
from which we can determine and
.
All that remains is to express in terms of Z and w.
Let
be defined analogously to
.
Then
The denominator here is our old friend g(Z,w). The numerator can be expressed as a polynomial in Z and w, by the same technique used to determine g. We find
where h(Z,w) is a polynomial with integer coefficients, exhibited below.
To solve the Brioschi resolvent
we proceed in five steps.
where g(Z,w) is the polynomial in Z and w given below, and g' denotes the derivative of g with respect to w.
for i = 1,2, where h is the polynomial in Z and w given below.
for i = 1,2. These are two roots of the Brioschi resolvent.
The key ingredients g(Z,w) and h(Z,w) are given by:
Remarks.
1. A quintic with real coefficients always has at least one real root. Curiously, when applied to a real quintic with real initial guess for step 2, our method always returns a pair of conjugate roots.
2. To find the remaining roots of the quintic, we can apply del Ferro's formula or Example 3 of section 3 to solve the quotient cubic. We could also construct a single iteration that would find all five roots at once, but the formulas might be rather more complicated.
3. Remarkably, one can also derive the formulas for g and h
by hand, without even knowing
the basic invariants ,
and
of the icosahedral group.
This alternate approach exploits
the large number of coefficients that vanish,
and is based on a study of degenerations
of g and h and at Z = 0, 1
and
.