next up previous
Next: References Up: Solving the quintic by Previous: Rational Maps with Symmetry.

Appendix.

In this appendix we will describe a concrete algorithm for solving the general quintic equation. This algorithm is based on Klein's theory of the connection between the general quintic and the icosahedral equation, described in his famous lectures on the icosahedron [8]. See also [4] (from which we take the illustration below), and [2]. We begin by reviewing this theory.

The icosahedral equation.

Associated with the icosahedron (normalized as in §5) is a tiling of the Riemann sphere by 120 spherical triangles, 60 black and 60 white (Figure 5). This configuration is invariant under the icosahedral group, represented as a group tex2html_wrap_inline2873 of Möbius transformations. Each triangle has angles tex2html_wrap_inline2875 corresponding to the 30 edge midpoints, 20 face centers, and 12 vertices of the icosahedron. We will refer to these special points as 2-, 3-, and 5-vertices.

   figure1183
Figure 5: The icosahedral tiling.

Map each white triangle conformally to the upper half-plane, and map each black triangle conformally to the lower half-plane, so that the 3-, 5-, and 2-vertices map to tex2html_wrap_inline2877 . These 120 separate mappings piece together to give a rational function of degree 60, the icosahedral function. This function, denoted by tex2html_wrap_inline2879 , is right-invariant under the icosahedral group tex2html_wrap_inline2881 :

displaymath2853

it gives the quotient map tex2html_wrap_inline2883 .

To write down the icosahedral function explicitly, recall that every homogeneous polynomial invariant under the binary icosahedral group tex2html_wrap_inline2885 is a polynomial in tex2html_wrap_inline2887 , tex2html_wrap_inline2889 , and tex2html_wrap_inline2891 , where

displaymath2854

displaymath2855

displaymath2856

The polynomials tex2html_wrap_inline2893 , tex2html_wrap_inline2895 , and tex2html_wrap_inline2897 vanish at the 5-, 3-, and 2-vertices respectively. They satisfy the identity

displaymath2857

The icosahedral function tex2html_wrap_inline2899 is

displaymath2858

To check this, note that the top and bottom are homogeneous of degree 60 (so the ratio is a rational function of tex2html_wrap_inline2901 ), the zeros and poles occur at the 3- and 2- vertices, and by the identity

displaymath2859

the 5-vertices of the icosahedron are mapped to 1.

The equation

displaymath2860

is called the icosahedral equation. Solving the icosahedral equation amounts to finding one of the 60 points that map to Z under the icosahedral function. Given one such point, the 59 others can be found by determining the images of the first under the group tex2html_wrap_inline2905 .

Please note that our normalization of the icosahedral function differs from the normalizations of [8] and [2]:

displaymath2861

displaymath2862

From the general quintic to the icosahedral equation.

In this section we give a brief account of the classical reduction of the general quintic equation

displaymath2909

to the icosahedral equation, following [8]. As Klein emphasized, this reduction is best understood geometrically.

The first step in the reduction dates back to 1683, when Tschirnhaus showed that by making a substitution of the form

displaymath2910

the general quintic can be reduced to a quintic for which tex2html_wrap_inline2913 . Here a and b are determined by solving an auxiliary quadratic equation. Such a quintic is called a principal quintic.

Equivalently, a principal quintic is one normalized so its roots satisfy tex2html_wrap_inline2919 . These homogeneous equations determine a quadric surface in the projective space of roots. Viewed geometrically, the Tschirnhaus transformation moves an ordered set of roots to one of the two points of intersection of this quadric with the line determined by allowing a and b to vary. Which point depends on the choice of auxiliary root.

The symmetric group tex2html_wrap_inline2925 acts on the quadric by permuting the roots. An odd permutation interchanges the two rulings of the quadric by lines; adjoining with square-root of the discriminant reduces the action to the alternating group tex2html_wrap_inline2927 , which preserves the rulings.

The space of lines in a given ruling is isomorphic to the Riemann sphere tex2html_wrap_inline2929 , and in appropriate coordinates the action of tex2html_wrap_inline2931 (on the space of lines) is none other than the icosahedral action. From the original principal quintic and the square-root of its discriminant, we may determine a point Z on the quotient such that a solution to

displaymath595

corresponds to a line containing the point tex2html_wrap_inline2935 for some ordering of the roots. Then the roots themselves can be found by elimination.

Perhaps the most intriguing part of this whole story is the square root used in the Tschirnhaus transformation to obtain a principal quintic. This square root is an accessory irrationality, as it does not diminish the Galois group of the equation, and as such is not expressible in terms of the roots of the equation. Rather, its function (as pointed out in [15]) is to eliminate the cohomological obstruction described in tex2html_wrap_inline2937 . The culmination of Klein's lectures on the icosahedron is the result, which Klein calls Kronecker's theorem, that without the introduction of such an accessory irrationality the general quintic equation cannot be reduced to a resolvent equation that depends--like the icosahedral equation--on a single parameter. While this result was stated by Kronecker, the first correct proof was given by Klein. Apparently, Kronecker felt that accessory irrationalities were `algebraically worthless', and proposed what he called the `Abelian Postulate', requiring that such accessory irrationalities be avoided at all costs. According to this view, the reduction of the quintic to the icosahedral equation is inadmissible. Arguing against this point of view, Klein (on page 504 of [9]) writes:

Soll man, wo sich neue Erscheinungen (oder hier die Leistungsfähigkeit der akzessorischen Irrationalitäten) darbieten, zugunsten einer einmal gefassten systematischen Ideenbildung die Weiterentwicklung abschneiden, oder vielmehr das systematische Denken als zu eng zurückschieben und den neuen Problemen unbefangen nachgehen? Soll man Dogmatiker sein oder wie ein Naturforscher bemüht sein, aus den Dingen selbst immer neu zu lernen?

(When new phenomena appear, like the efficacy of the accessory irrationality, should we halt our investigations because the facts fail to agree with our preconceived notions, or should we cast aside those preconceived notions as being too narrow, and pursue the new problems wherever they lead? Should we be dogmatists, or should we--like experimental scientists--try always to learn from the facts themselves?)

Quintic resolvents of the icosahedral equation.

The algorithm we are going to develop to solve the general quintic proceeds by computing a root, not of the icosahedral equation itself, but of a certain quintic resolvent.

Algebraically, the icosahedral equation determines an tex2html_wrap_inline2963 extension of function fields k'/k, where tex2html_wrap_inline2967 and tex2html_wrap_inline2969 . A quintic resolvent is the irreducible polynomial satisfied by an element of k' of degree 5 over k.

In this section, we will derive formulas for the tetrahedral and Brioschi resolvents, again following [8]. The Brioschi resolvent is a one parameter family of quintics, to which the general quintic may be reduced; it is this equation we will actually solve. The tetrahedral resolvent is used to determine a root from the limit point of an iteration.

   figure1250
Figure 6: A cube inscribed in the icosahedron.

The root of a quintic resolvent is stabilized by an tex2html_wrap_inline2975 subgroup of tex2html_wrap_inline2977 . There are five such tetrahedral subgroups in tex2html_wrap_inline2979 , all conjugate. One tetrahedral subgroup, which we denote tex2html_wrap_inline2981 , is distinguished because it leads to a resolvent defined over tex2html_wrap_inline2983 .

tex2html_wrap_inline2985 can be described geometrically as follows. There are five cubes whose vertices lie on the vertices of a regular dodecahedron. Of these, exactly one is symmetric with respect to reflection through the real axis; the intersection of its symmetry group with tex2html_wrap_inline2987 is tex2html_wrap_inline2989 . The vertices of this cube, and the one-skeleton of its dual octahedral (which includes the real axis), appear in Figure 6.

tex2html_wrap_inline2991 permutes the 12 pentagons that correspond to faces of the dodecahedron, and any one of them is a fundamental domain for tex2html_wrap_inline2993 .

tex2html_wrap_inline2995 preserves the 6 vertices of the dual octahedron, and the 4 vertices of each tetrahedron inscribed in the cube; the stabilizers of all other points are trivial. Note that only half of the symmetries of the cube (and octahedron) are symmetries of the icosahedron; otherwise tex2html_wrap_inline2997 would have a subgroup of order 24.

Besides the special orbits of tex2html_wrap_inline2999 , we need to pay attention to two orbits of order 12: the face centers of the dodecahedron, i.e., the 5-vertices, and the 20 - 8 = 12 complementary 3-vertices--the vertices of the icosahedron which do not lie on the cube.

There is a tetrahedral function tex2html_wrap_inline3003 , analogous to the icosahedral function tex2html_wrap_inline3005 , which gives the quotient map tex2html_wrap_inline3007 . By composing with a Möbius transformation, this function can be normalized to take specified values on any three orbits of tex2html_wrap_inline3009 . We choose the normalization so that the 5-vertices map to tex2html_wrap_inline3011 , the vertices of the octahedron map to 0, and the complementary 3-vertices map to 3.

To write down a formula for tex2html_wrap_inline3017 , we call forth some of the invariant forms for the binary tetrahedral group tex2html_wrap_inline3019 . Fortunately, all the forms that we need to work with are absolute invariants (no character of tex2html_wrap_inline3021 appears). Those we use,

displaymath2939

displaymath2940

eqnarray615

vanish at the vertices of the octahedron, the cube, and the complementary 3-vertices respectively.

Any invariant form of degree 12 is a linear combination of the forms tex2html_wrap_inline3023 , tex2html_wrap_inline3025 , and tex2html_wrap_inline3027 , which satisfy the identity

displaymath2941

Thus

displaymath2942

since this expression has zeros and poles in the right places, and the identity

displaymath2943

shows the complementary 3-vertices are mapped to 3 as desired.

Under tex2html_wrap_inline3029 , the 60 roots of the icosahedral equation

displaymath2944

map in groups of 12 to 5 distinct points. In terms of a single root z, these 5 images are

displaymath2945

where

displaymath2946

and tex2html_wrap_inline3033 is a fifth root of unity. (The rotation tex2html_wrap_inline3035 is an element of tex2html_wrap_inline3037 ).

The quintic resolvent for tex2html_wrap_inline3039 turns out to be

displaymath2947

We will call this equation the tetrahedral resolvent. Algebraically, the functions tex2html_wrap_inline3041 are just the roots of the tetrahedral resolvent in the function field setting. This equation can be derived entirely geometrically, without recourse to the explicit formulas for tex2html_wrap_inline3043 . (See pages 100-102 of [8].)

The related function tex2html_wrap_inline3045 given by

displaymath2948

satisfies the Brioschi resolvent

displaymath2949

where C = (1-Z)/1728; the roots of this equation are:

displaymath2950

Any principal quintic can be reduced to the Brioschi resolvent for some particular choice of C, determined rationally in terms of the original coefficients and the square-root of the discriminant. This reduction appears in detail in [2].

The icosahedral iteration.

We are now ready to concoct a generally convergent algorithm for the icosahedral field extension k'/k. The ingredients for such an algorithm are given in Theorem 4.1; note that the Galois group, tex2html_wrap_inline3065 , is tautologically identified with a group of Möbius transformations.

The algorithm itself is specified by

(a)
a rational map f(w) commuting with tex2html_wrap_inline3069 , and
(b)
a Möbius transformation tex2html_wrap_inline3071 , depending on a root z of the icosahedral equation, such that

displaymath650

for all tex2html_wrap_inline3075 in tex2html_wrap_inline3077 .

The coordinate w can be thought of as residing on a separate Riemann sphere where the iteration is performed. The algorithm is given by

displaymath654

by (a) and (b) tex2html_wrap_inline3081 and so T only depends upon tex2html_wrap_inline3085 .

To make the formulas as simple as possible, we will choose tex2html_wrap_inline3087 , the unique lowest degree rational map with icosahedral symmetry and a non-trivial attractor (see §5). (The attractor of tex2html_wrap_inline3089 is periodic of order 2, so we will actually iterate tex2html_wrap_inline3091 .)

As for tex2html_wrap_inline3093 , note that for each fixed w the map tex2html_wrap_inline3097 is a rational map with icosahedral symmetry. As mentioned in Remark 1 of §5, there is a one-parameter family of symmetric maps of degree 31 (and none of smaller degree); this provides the simplest candidate for tex2html_wrap_inline3099 . There are three points at which this family degenerates to maps of lower degree tex2html_wrap_inline3101 , tex2html_wrap_inline3103 , and tex2html_wrap_inline3105 ; we arrange that these degenerations occur at tex2html_wrap_inline3107 , 0 and 1.

To derive a formula for tex2html_wrap_inline3109 in terms of Z, we begin by expressing tex2html_wrap_inline3113 in homogeneous coordinates

displaymath3053

then

displaymath3054

To check this formula, we just need to verify that it degenerates as described above. Clearly this is true for w=0 and tex2html_wrap_inline3117 . For w=1 the rational map we get is

displaymath3055

which agrees with tex2html_wrap_inline3121 by virtue of the identity

displaymath3056

To get the formula for tex2html_wrap_inline3123 , we note tex2html_wrap_inline3125 is canonically associated to the 12 vertices of the icosahedron, so T is canonically associated to their images under tex2html_wrap_inline3129 . By remark 2 at the end of §5, all we must do to specify tex2html_wrap_inline3131 is to give a polynomial g(Z,w) having these 12 points as its roots.

This leads us to look at the form tex2html_wrap_inline3135 , where tex2html_wrap_inline3137 is the homogeneous version of tex2html_wrap_inline3139 given above. The form G is homogeneous of degree tex2html_wrap_inline3143 in tex2html_wrap_inline3145 and of degree 12 in tex2html_wrap_inline3147 . This polynomial is symmetric under the action of tex2html_wrap_inline3149 on tex2html_wrap_inline3151 . Because the ring of tex2html_wrap_inline3153 -symmetric forms is generated by tex2html_wrap_inline3155 and tex2html_wrap_inline3157 , and because tex2html_wrap_inline3159 , it follows on numerological grounds that G is divisible by tex2html_wrap_inline3163 , and that the quotient tex2html_wrap_inline3165 can be written as a homogeneous polynomial of degree 6 in tex2html_wrap_inline3167 and of degree 12 in tex2html_wrap_inline3169 . This polynomial can be found by solving a large system of linear equations. Dividing the resulting expression for tex2html_wrap_inline3171 through by tex2html_wrap_inline3173 and using the fact that tex2html_wrap_inline3175 , we get

displaymath3057

where g is a polynomial with integer coefficients, exhibited at the end of this Appendix. We found the coefficients of g by solving the relevant system of equations with the aid of a computer.

The map tex2html_wrap_inline3181 is now given by

displaymath675

where g' denotes the derivative of g with respect to w.

From the iteration to a root.

Under the iteration tex2html_wrap_inline3211 almost every starting guess is attracted to a cycle of period 2 consisting of one of the 10 pairs of antipodal 3-vertices. If instead iterating tex2html_wrap_inline3215 we iterate tex2html_wrap_inline3217 , then almost every starting guess is attracted to a single one of the 20 3-vertices.

The map tex2html_wrap_inline3219 is just tex2html_wrap_inline3221 transported to new coordinates by tex2html_wrap_inline3223 . For almost every Z, almost every starting guess converges under iteration of tex2html_wrap_inline3227 to

displaymath3189

where e is one of the 20 3-vertices of the icosahedron in its standard location.

Of course to be able to write

displaymath3190

we have to select some particular root z of the icosahedral equation, for we could equally well write

displaymath3191

Turning this around, we see that if we choose some particular 3-vertex tex2html_wrap_inline3233 , there will be exactly three choices for the root z for which

displaymath3192

These three choices differ from one another by the action of the stabilizer tex2html_wrap_inline3237 of the 3-vertex tex2html_wrap_inline3239 . Therefore from tex2html_wrap_inline3241 we can determine the values of two of the functions tex2html_wrap_inline3243 , and hence two roots tex2html_wrap_inline3245 , tex2html_wrap_inline3247 of the Brioschi resolvent. These two values correspond to the two tetrahedral ( tex2html_wrap_inline3249 ) subgroups of tex2html_wrap_inline3251 that contain the stabilizer of tex2html_wrap_inline3253 .

As tex2html_wrap_inline3255 ranges over the 20 attractors of tex2html_wrap_inline3257 , the pair tex2html_wrap_inline3259 ranges over the 20 ordered pairs of roots of the resolvent. In particular, going from tex2html_wrap_inline3261 to the `antipodal point' tex2html_wrap_inline3263 , we get the same pair of roots in the opposite order.

To determine tex2html_wrap_inline3265 and tex2html_wrap_inline3267 explicitly in terms of tex2html_wrap_inline3269 , we introduce the function

displaymath3193

While expressed in terms of z, this function really only depends on Z, because the action of tex2html_wrap_inline3275 permutes the two sets of factors in the same way. The idea behind tex2html_wrap_inline3277 is that the first factor acts as a `selector function' for the second: Recall that the value of function tex2html_wrap_inline3279 is 3 at the complementary 3-vertices; at the vertices of the tetrahedron and the dual tetrahedron its values are

displaymath3194

which are the other two roots of

displaymath3195

Thus the factor tex2html_wrap_inline3281 vanishes for three values of k and takes on the values

displaymath3196

for the remaining two values of k. Consequently

displaymath3197

where tex2html_wrap_inline3287 are two roots of the Brioschi resolvent. Replacing tex2html_wrap_inline3289 with the `antipodal' fixed point tex2html_wrap_inline3291 exchanges the roles of tex2html_wrap_inline3293 and tex2html_wrap_inline3295 , so we have

displaymath3198

Thus we get a pair of linear equations from which we can determine tex2html_wrap_inline3297 and tex2html_wrap_inline3299 .

All that remains is to express tex2html_wrap_inline3301 in terms of Z and w. Let tex2html_wrap_inline3307 be defined analogously to tex2html_wrap_inline3309 . Then

eqnarray713

The denominator here is our old friend g(Z,w). The numerator can be expressed as a polynomial in Z and w, by the same technique used to determine g. We find

displaymath3199

where h(Z,w) is a polynomial with integer coefficients, exhibited below.

The algorithm.

To solve the Brioschi resolvent

displaymath3321

we proceed in five steps.

  1. Set Z = 1 - 1728C.
  2. Compute the rational function

    displaymath3322

    where g(Z,w) is the polynomial in Z and w given below, and g' denotes the derivative of g with respect to w.

  3. Iterate tex2html_wrap_inline3337 on a random starting guess until it converges. Call the limit point tex2html_wrap_inline3339 , and set tex2html_wrap_inline3341 .
  4. Compute

    displaymath3323

    for i = 1,2, where h is the polynomial in Z and w given below.

  5. Finally compute

    displaymath3324

    for i = 1,2. These are two roots of the Brioschi resolvent.

The key ingredients g(Z,w) and h(Z,w) are given by:

eqnarray735

Remarks.

1. A quintic with real coefficients always has at least one real root. Curiously, when applied to a real quintic with real initial guess for step 2, our method always returns a pair of conjugate roots.

2. To find the remaining roots of the quintic, we can apply del Ferro's formula or Example 3 of section 3 to solve the quotient cubic. We could also construct a single iteration that would find all five roots at once, but the formulas might be rather more complicated.

3. Remarkably, one can also derive the formulas for g and h by hand, without even knowing the basic invariants tex2html_wrap_inline3359 , tex2html_wrap_inline3361 and tex2html_wrap_inline3363 of the icosahedral group. This alternate approach exploits the large number of coefficients that vanish, and is based on a study of degenerations of g and h and at Z = 0, 1 and tex2html_wrap_inline3371 .


next up previous
Next: References Up: Solving the quintic by Previous: Rational Maps with Symmetry.

Peter Doyle