Note: Quadratics for Grown Ups

Steve Trettel

|

We study the quadratic formula (b±b24ac)/2a geometrically, as a map from the space of coefficients to the space of roots for polynomials of degree 2. Our main goal is the following theorem: restricted to real polynomials with complex roots, the quadratic formula realizes an isometry from the projective model to the conformal model of the hyperbolic plane.

Geometry of the map Polynomials Roots

Our goal is to understand in detail the geometry of the quadratic formula as a map from coefficients to roots, and from this understanding extract a cool fact: namely that restricted to real coefficients, the relevant geometry includes both a projective and a conformal model of the hyperbolic plane. Here we start the road to viewing the relationship between a polynomial and its roots as a geometric problem by first casting it as a topological one. As our interest lies in the quadratics everything is quite explicit: however just here at the beginning I will talk about general n, as it’s my hope that some of these ideas will (with much more work) continue to tell a good story in higher degree.

The fundamental theorem of algebra guarantees every complex nonconstant polynomial has at least one root, or equivalently every degree n polynomial has exactly n roots (counting multiplicity). One recasting of this as a topological statement about the map Roots:PolynomialsTheir;Roots, is as follows. Let PolnC××Cn denote the space of degree n polynomials over C identified with their coefficients, and for each fPoln let Roots(f) be its multiset of roots. To describe the space of these multi-sets, its useful recall the notion of \emph{symmetric power}: the nth symmetric power of a space X is the collection of all unordered n-tuples, SPn(X)=Xn/\Sym(n). The map Roots:C×CnSPn(C) is continuous, and factors through projectivization (f and cf have the same roots) to a map sending a monic polynomial to its roots. The result is a continuous bijection; and thus homeomorphism, between the space of polynomials viewed-as-their-coefficients, and the space of polynomials-viewed-as-their-roots.

(Fundamental Theorem of Algebra) The nth symmetric power of C is homeomorphic to the nth power of C by the roots map Roots:CnSPn(C).

This familiar setting is not quite where our story will take place however. Instead of projectivizing the space C××Cn of degree n polynomials onto their monic representatives, we will instead \emph{include back in} the missing polynomials of lower degree, and study the space PolnCn+1. This at first seems a difficult task, as the roots map no longer lands in a single space but rather in the union of the symmetric powers SPn(C)SPn1(C)SP1(C)SP0(C). However, it is natural to topologize the union of the first n symmetric powers of a space X by using the one-point compactification X: if k<n we represent the unordered k-tuple x1,,xk by the unordered n-tuple x1,,xk,,,, essentially filling in the correct number of empty slots with the `filler point' .

For polynomials, this amounts to (for example) saying that, viewed in relation to cubic polynomials, a linear polynomial has \emph{two roots at infinity}. This is actually quite natural, as when we write down families of cubics which in the limit become linear, two of their roots escape all bounded sets in C. This leads to the following observation: when we topologize the union SPn(C)SPn1(C) via the aforementioned identification with SPn(C), the natural extension of the roots map Roots:PolnSPn(C) remains continuous. This map again factors through projectivization of the domain to a continuous bijection, and thus homeomorphism from CPn onto SPn(CP1).

(Fundamental Theorem of Algebra) The nth symmetric power of the 2-sphere is CPn, with homeomorphism realized by Roots:CPnSPn(CP1).

As each space of lower degree polynomials is naturally built into the construction, we may take the direct limit of this over inclusions for increasing n to give a third and final topological incarnation of the fundamental theorem:

(Fundamental Theorem of Algebra) The infinite symmetric power of the 2-sphere is CP, with homeomorphism realized by Roots:P(C[z])SP(CP1).

What does this tell us? Firstly, that there is a uniform, natural way to consider polynomials of lower degree as degenerations of higher degree polynomials, allowing us to union to domains, codomains and root maps for each lower degree into a single morphism CPnSPn(CP1). Using this, from the viewpoint of coefficients the geometry of polynomials is naturally contained in the geometry of complex projective space, but from the viewpoint of roots the geometry of polynomials is naturally contained in the geometry of symmetric powers of the sphere. The roots map provides a way of passing between these geometries: if XPoln is a collection of poylnomials, we may consider both a \emph{projective model} P(Coefs(X)) and a \emph{symmetric powers model} Roots(X).

Quadratics

Restricting to (at most) quadratic polynomials, gives the map Roots:CP2SP2(CP1).
We will write the projectivized coefficients of a polynomial as [a:b:c] and the roots as z,w. The discriminant of the polynomial f(z)=az2+bz+c will be denoted δ(f)=b24ac.

In this low degree, an explicit description of the second symmetric power of CP1 is useful: SP2(CP1)=(CP1×CP1)/Z2 where Z2 acts by switching the first and second coordinates. We may even draw a useful picture of this, after identifying CP1 with the unit sphere in R3: let pr:S2I be the map sending a point on the sphere to its height; its z coordinate in I=[1,1]. Then the map pr×pr:S2×S2I×I gives us a kind of degenerate fibration: the preimage of (x,y)I2 is generically a torus, consisting of the latitude circle at height x times the latitude at height y.

$\S^2\times\S^2$ as a degenerate fiber bundle over the square.  Generic point preimages are flat tori, the preimage of points along the boundary are circles and corners are points.  Together the boundary square has preimage a necklace of 4-spheres onto which $T^2\times I^2$ is glued.

S2×S2 as a degenerate fiber bundle over the square. Generic point preimages are flat tori, the preimage of points along the boundary are circles and corners are points. Together the boundary square has preimage a necklace of 4-spheres onto which T2×I2 is glued.

The diagonal embedding Δ:S2\injectS2×S2 is the preimage of the diagonal in I×I, and the Z2 action permuting coordinates acts freely off of here. Along this diagonal the action is trivial, and so geometrically Δ(S2) is a locus cone singularities in the quotient SP2(S2)CP2. Subsets of SP2(S2) can be profitably understood by `folding' a corresponding subset of S2×S2 along the diagonal S2.

Forming the symmetric product SP2(CP1) as a quotient of CP1×CP1 by the map exchanging coordinates. (Note that in many of these cartoon drawings it will appear that Δ(CP1) is a boundary of SP2(CP1). This is of course not true; in reality it is a codimension-2 sphere shaped singular locus.)

It’s instructive to view some important subsets of polynomials in both the roots and coefficient spaces. In the coefficients model, the at-most-linear polynomials show up as the CP1 at infinity with respect to the affine patch centered at [1:0:0]z2. In the roots model, this same set identifies with the CP1 worth of points ,z in SP2(CP1). This CP1 intersects another important CP1 in the space of quadrics - the set of polynomials with a double root - in a single point (the constant polynomial, which by our convention has a double root at ). In the roots model, the quadratics with a double root are easy to describe, they are the image of the diagonal embedding Δ(CP1); the singular locus of the orbifold structure on SP2(CP1).

The sphere of at most linear polynomials and the sphere of polynomials having a double root meet only in the constant polynomial.

The sphere of at most linear polynomials and the sphere of polynomials having a double root meet only in the constant polynomial.

In the coefficient model this set is more difficult to spell out: it is the projective variety cut out by the discriminant V(δ)=V(b24ac). To see this is a 2-sphere, we may change variables so that b24ac is diagonal, x2+y2+z2 (all non-degenerate quadratic forms are equivalent over C), and note that in the affine patch z=1 we have x2+y2=1 in C2, whose solution set is a cylinder; together with two points (the north and south poles) at infinity with respect to this patch.

he projective variety $V(b^2-4ac)$ in $\CP^2$ is homeomorphic to a sphere; as can be seen by changing coordinates to $V(x^2+y^2+z^2)$ to and then computing $V(x^2+y^2=1)$ in an affine patch.

he projective variety V(b24ac) in CP2 is homeomorphic to a sphere; as can be seen by changing coordinates to V(x2+y2+z2) to and then computing V(x2+y2=1) in an affine patch.

Next we aim to impose some notions of geometry on the space of quadratics. The coefficient model, CP2, has a natural notion of geometry, with automorphism group PSL(3;C). Note that this is different than what we would get considering only monic degree 2-polynomials (which would be the affine group for C2), and it allows the mixing of linear with quadratic polynomials by sending roots to . To understand the geometry of SP2(CP1), we first look to its orbifold universal cover CP1×CP1. This has automorphism group given by Aut(CP1) on each factor extended by swapping the factors, or Z2PSL(2;C)×PSL(2;C). The automorphisms of SP2(CP1) are those elements above which are well defined (and nontrivial) on the quotient: that is, the diagonal elements Δ(PSL(2;C)). This has a nice geometric interpretation: automorphisms of SP2(CP1) preserve the singular locus which is itself a copy of CP1, and any automorphism of this CP1 is admissable. Equivalently, all automorphisms of SP2(CP1) are induced from automorphisms of the underlying extended complex plane, and then applied to multi-sets of cardinality 2.

Automorphisms of SP2(CP1) must preserve the singular locus, and thus all arise from automorphisms of the underlying space CP1

Finally, it will be useful to return to the coefficients model, and compute the restricted group of symmetries which `play nicely' with the roots map. As symmetries of SP2(CP1) must preserve the singular locus, the diagonal CP1 representing double roots, their realization in the coefficients model must preserve the discriminant locus V(δ). As δ is a quadratic form on C3, the automorphisms of CP2 preserving V(δ) form the complex orthogonal group PO(δ,C). This is just an incarnation of the irreducible representation SL(2,C)SO(3;C), taking the SL(2,C) action on the extended complex plane, applying it to unordered 2-tuples, then interpreting those as roots of a polynomial and viewing the action on coefficients. The fact that all orthogonal groups are isomorphic over C tells us that SO(δ,C) contains subgroups isomorphic to SO(3;R) and SO(2,1;R), whose actions we can understand geometrically. With SO(δ,C)SL(2;C) acting as conformal transformations of the sphere V(δ), as expected an SO(3) subgroup acts as rigid rotations of this sphere, and SO(2,1) subgroups act as M"obius transformations preserving some circle on V(δ). A particular one of these, namely the real points SO(δ;R) will be important below.

In root space, the automorphisms of SP2(CP1) are the diagonal embedding of PSL(2;C) as we saw above.
In coefficient space, this group appears instead as a complex orthogonal group, SO(δ,C)PSL(3;C).

It is interesting to note that from this perspective, it is still natural to allow automorphisms mixing quadratic with linear polynomials, but no longer mixing polynomials with distinct roots with those having a double root.

Real Quadratics

If we restrict to polynomials with real coefficients, Pol2(C) becomes Pol2(R) and correspondingly the space of projectivized coefficients changes from CP2 to RP2. It is more work to describe the image under the roots map. As Roots:CP2SP2(CP1) is a homeomorphism, that restriction is an embedding of RP2 into SP2(CP1) is clear.
To understand this embedding, we will realize this RP2 as a certain 2-complex in CP1×CP1, `folded over' the diagonal Δ(CP1).

Real quadratics come in two flavors: those with real roots, and those with complex conjugate roots.
In the CP1×CP1 cover, polynomials with complex conjugate roots correspond to the collection (z,z) as z ranges in C, forming a sphere. This sphere intersects the sphere Δ(CP1) along the great circle (x,x)xR, and so in the quotient gets folded along this great circle into a disk. The polynomials with real roots correspond to the collection (x,y)x,yR, which forms a torus in CP1×CP1. This torus also contains the real diagonal (x,x)xR as a (1,1) curve and is folded along it onto a M"obius band in the quotient. Thus, in CP1×CP1 the relevant 2-complex is the union of a sphere and a torus, glued along a circle as in the cartoon below.
In the quotient, the sphere becomes a disk and the torus a M"obius band, glued along their common circle of intersection: this is the RP2 embedded by the roots map.

The preimage of the RP2 of real polynomials in SP2(CP1), viewed in the orbifold cover CP1×CP1 and its quotient, a creased RP2.

Note that this RP2 contains part of the singular locus of SP2(CP1). It is constructed from S2S1T2 folded in half along their common S1, which we saw above to consist of real double roots. To remember this, I will draw a crease along this curve when a picture denotes the image of RP2 under the roots map.

The real quadratic polynomials form an RP2 inside of both CP2SP2(CP1). In the root-space, this RP2 intersects the singular locus in the circle of polynomials with a double real root.

To calculate the automorphisms of this subset in each the coefficient and roots viewpoints, we find the subgroup which fixes the real points setwise. In the coefficient view this is easy: the smoothly embedded RP2CP2 has automorphism group PSL(3;R)<PSL(3;C). On the roots side, any automorphism of this creased RP2 must send the crease to itself (to see this, note that all automorphisms of SP2(CP1) preserve the singular locus, and so the subgroup preserving real polynomials must preserve the intersection of this singular set with the real points, which is precisely the creased circle). This circle represents the real quadratics with a double root Δ(R)Δ(CP1) and the automorphisms preserving it setwise are none other than the diagonal embedding of PSL(2;R). Viewed back on the other side, the automorphisms of RP2CP2 which preserve the division into polynomials with a double root and those with distinct roots are the intersection of Aut(RP2) with Aut(V(δ)); that is SO(δ;R)=PGL(3;R)SO(δ;C). Over R, the discriminant δ=b24ac has signature (2,1), so this is a conjugate of SO(2,1) in PGL(3;R) preserving the divison of RP2 into a disk, M"obius band by the discriminant.

The automorphism group of real quadrics in coefficient space is all of PGL(3;R). In root space, any automorphism must preserve the singular locus, so the symmetries are reduced to a copy of PSL(2;R). Back in coefficient space, this subgroup is realized as SO(δ;R)<PSL(3;R).

This action of SO(2,1) is well known - it acts as the isometries of both the Klein (projective) model of the hyperbolic plane when restricted to the disk, and the projective model of de Sitter 1+1 space when restricted to the M"obius band exterior (or anti de Sitter 1+1 space, as the two are isomorphic in this dimension). The SL(2;R) action on the real polynomials in SP2(CP1) is maybe not as well known, but comes from gluing together two natural actions on the universal cover CP1×CP1. Up here, recall that the RP2 of real polynomials is double covered by a more interesting object; the union of a sphere and torus along a circle. The sphere consists of pairs (z,z) for zCP1 and the torus of points (x,y) for each coordinate in the extended reals. The subgroup of Aut(CP1×CP1) fixing this sphere a copy of PSL(2,C) embedded in Aut(CP1×CP1) as A(A,A) and the subgroup fixing the torus is the real points, PSL(2;R)×PSL(2;R).

Automorphisms of the branched cover of RPRoots2 in CP1×CP1.

Most of the automorphsims of either component individually do not descend to the quotient SP2(CP1); to do so an automorphism must preserve the singular locus, which intersects the picture here in the same S1 common to the sphere and torus. On the sphere, this selects out only conformal automorphisms preserving the equator, which divides CP1 into two disks, each of which when equipped with this PSL(2;R) action is a conformal model of the hyperbolic plane (either the Poincare disk or upper half plane, depending if the stereographic projection to C has projection point on, or off the equatorial circle). Thinking of the original CP1 as the boundary of hyperbolic 3-space when equipped with the PSL(2,C) action, we can see fixing a circle in the boundary as restricting the possible isometries to those that preserve that hyperbolic plane (which we can view conformally by then projecting onto the upper, lower hemispheres of the ideal boundary if we would like). On the torus, the corresponding picture is similar, but for anti-de Sitter space. Much like the sphere is the ideal boundary of H3, the torus is the ideal boundary of AdS3. Fixing certain curves (in our case the (1,1) curve with respect to the decomposition of isometries as PSL(2;R)×PSL(2;R) above) correspond to preserving a lower-dimensional anti de Sitter space AdS2, which is isomorphic to de Sitter 2-space as a coincidence of low dimensions and has model an open M"obius band. The PSL(2;R) action on the creased RP2 in SP2(CP1) is what you get from gluing these two actions together along the common ideal boundary of their spaces, where they agree.

The RP2 of real polynomials is a union of two natural geometries along their ideal boundary.

The Quadratic Formula

We’ve done all the hard work now; having described the domain and codomain of the roots map assigning coefficients in CP2 to their roots in SP2(CP1) as well as the symmetries relevant to each side.

The map taking coefficients to roots, when restricted to real quadratics with complex roots, is an isometry from the projective model H2RP2 to the conformal model H2CP1.

The complex roots of a real quadratic come in conjguate pairs, and so are represented in root space by the disk z,z in SP2(CP1). In coefficient space, the real polynomials with complex roots are the negative cone of the discriminant, represented by the disk \Pδ<0 in RP2CP2. The first of these spaces is naturally a conformal model of the hyperbolic plane (looking in the double cover CP1×CP1, it is a hemisphere of CP1 together with the M"obius transformations preserving it), and the second is naturally a projective model (the action of SO(δ,R)SO(2,1) is by projective transformations preserving the metric given by the cross ratio). Restricting the root map to this disk gives the map Roots:DCoefs2DRoots2, which is equivariant with respect to the action of hyperbolic isometries on each side (that is, if gIsom(H2) then Roots(g.f)=g.Roots(f) where in the first case g acts as an element of SO(δ;R) and in the second as an element of Δ(PSL(2;R)).)

The roots map is an intertwiner for the action of isometries on both sides, and is itself an isometry from the coefficients model to the roots model.

It only remains to show that Roots is an isometry. But this is no work at all, thanks to the symmetry we have established along the way. Since Roots is Isom(H2) equivariant and this action is transitive, Roots is completely determined by its value at any point. Consider f=z2+1 with Roots(f)=i,i.
Now let Φ be any isometry between these two models of hyperbolic space equivariant with respect to the given group actions. Without loss of generality we may assume that Φ(z2+1)=i,i (if not, pre- and post-compose by isometries of the domain / codomain to make it so). But now Roots,Φ are Isom(H2)-equivariant maps from DCoefs2 to DRoots2 agreeing on the point z2+1, and so they are equal.

To finish it off, we will write this map down in coordinates, as an explicit map from a disk in RP2 to a disk in CP1. This also constitutes a second proof of the statement above, in which you are free to ignore everything up to here in this writeup and just directly compare the formula to the standard conversion from the Klein disk to the Upper Half Plane, for instance as found on Wikiepdia. Starting with a quadratic f=az2+bz+c, we represent it in the space of projectivized coefficients as [a:b:c], and get at its roots via the familiar quadratic formula. Missing or unrecognized delimiter for \left

To get useful coordinates on the codomain, note that as the roots are always a complex conjugate pair, one must be in the upper half plane and the other in the lower. Thus we may unambigiously select the root in the upper half plane and represent our pair of roots by a single number x+iy for y>0. Noting that δ<0, we may write this as follows, using the convention that means `the unique positive square root of' when applied to real numbers. Roots:[a:b:c]b2a+i4acb22a This provides us with useful coordinates on the codomain, and so our next move is to do something similar for the domain. The first obvious choice is the reduction to monic quadratics using the affine patch a=1, which lets us think of the domain as the set of points in R2=(b,c) with b2<4c:

Roots:(b,c)b2+i4cb22

The quadratic formula as usually written is a map from a projective model of} H2 \textbf{as a paraboloid onto the upper half plane.

This is nothing more than the quadratic formula as learned in grade school, but we know from the above that we may interpret this as an isometry between two copies of the hyperbolic plane! The copy in the codomain is familiar; by selecting coordinates given by the root with positive imaginary part we have naturally landed in the upper half plane. But what is the model in the domain? This `paraboloid' model is in fact the Klein model in disguise - we have just chosen the wrong affine patch; one that runs parallel to the lightcone instead of transverse to it.

projective model of the hyperbolic plane is a paraboloid, hyperboloid disk depending on choice of affine patch.

To reconstruct the more familiar picture, we need to change the affine patch. The following transformation does the job:

(a\b\c)=(w+u2\vfracwu2)

In these coordinates, the discriminant is δ=u2+v2w2 and the quadratic formula is the map taking the polynomial (w+u)z2+2vz+(wu)=0 to its roots:

Roots:[u:v:w]vu+w+iw2u2v2u+w

The disk DCoefs2 here is fully contained in the affine patch w=1 and so we may take the domain to be the unit disk centered at 0 in R2=(u,v) giving the expression below, which is the usual transformation from the Klein disk (u,v)u2+v2<1 to the upper half plane zIm(z)>0 up to possibly a reflection of the domain/codomain, depending on your source.

Roots:(u,v)v1+u+i1u2v21+u

After a linear change of coordinates, the quadratic formula provides the usual identification of the Klein model with the upper half plane model of hyperbolic space.

Appendix

Here’s the easy lemma that was used in the proof: an equivariant map between two spaces each equipped with a transitive G action is determined by its value at a point.

Let G be a group and X,Y spaces each equipped with a transitive G action. Let f,ϕ be two intertwiners of this action, in the sense that f(g.x)=g.f(x) for all g,x and similarly for ϕ. Then, if there is some x with f(x)=ϕ(x), in fact f=ϕ.

Denote f(x)=ϕ(x)=y. Let zX, and choose gG such that z=g.x.
Then evaluating f,ϕ on z gives f(z)=f(g.x)=g.f(x)=g.y and similarly ϕ(z)=ϕ(g.x)=g.ϕ(x)=g.y. Thus f(z)=ϕ(z) so f=ϕ.