Modeling infinitesimals with 2x2 matrices
At some point I'll try to collect my thoughts on Sol Feferman's Thursday lecture on his alternative to Grothendieck universes, which he objected to as entailing an infinity of inaccessible cardinals. (What was Grothendieck's view of inaccessible cardinals vis a vis his universes?) During the lecture it struck me that his approach was quite like Robinson's approach to infinitesimals, in that it constructed lots of models of what was needed, took the common theory, then constructed a single model from the many, using techniques of Vaught and others to avoid losing too much of the common spirit of the many guided by the common theory (not sure if that captures the idea completely faithfully, but it's something like that). Thus distracted, I found myself wondering yet again why the d^2 = 0 property was so difficult for an infinitesimal d. Having been mulling over the quaternions lately, it seemed to me there was something of an analogy there, some property so built into our very psyche that we can't let go of it. Hamilton finally dropped commutativity, along with any reservations he might have harbored about vandalizing stone bridges in his own town. For the quaternions, d^2 = 0 implies d = 0, so this doesn't help. However the quaternions have a sibling algebra, just as noncommutative, and of exactly the same vector space dimension (in fact the only Clifford such, i.e. the only other real 4D vector space for which ij+ji=0 for all orthogonal vectors i,j having no real component), that is even better known than the quaternions (imagine that). Namely the Clifford algebra of 2x2 real matrices, as a 4D real vector space, made an algebra with matrix multiplication. Why not model d as the matrix 0 1 0 0? This is a perfectly good quantity, adding and scaling just like any real, e.g. 2d = 0 2 0 0. And obviously d^2 = 0. Standard reals x would have the form x 0 0 x 1+d would therefore be 1 1 0 1 (1+d)^2 then becomes 1 2 0 1 as common sense would indicate. The determinant of d being 0, one can't divide by it. But who in their right mind would want to divide by a quantity infinitesimally close to zero? Obviously that's going to produce an infinitely large quantity; if you want to do that, why not just go ahead and divide by zero itself? As Douglas Adams pointed out, you may think the store down the road is a fair way away, but other galaxies are even further away. To a nematode they're all far away. On the other hand 1 2 0 1 has a perfectly good reciprocal, namely 1 -2 0 1 again as suggested by common sense. So the proposal is to base calculus on a field-like object that is a field in the large, but zero divide errors set in when one gets infinitesimally close to zero. Basically what happens with IEEE floating point arithmetic, but modeled with 2x2 real matrices rather than 64-bit numbers. Oh, but what about the noncommutativity of 2x2 matrices, might that mess something up? Actually no, this two-dimensional algebra consisting of matrices of the form a b 0 a is commutative. So only the zero divisors really close to 0 constitute any departure at all from the field axioms. The diagonal element a is the standard real part and the off-diagonal element b in the upper right gives the infinitesimal displacement. So we have a real commutative associative algebra of refined numbers, having a real part and an infinitesimal part, whose only zero divisors are the infinitesimals. We don't *have* to think of them as matrices because we can just write its elements as x+yd by analogy with x+iy, where d is the above matrix representing the prototypical infinitesimal. The square of i is -1, and the square of d is 0. Moreover x and y in x+yd can be complex. We then have numbers x+iy+ud+ivd, which can parsed as either refined complex numbers, namely complex numbers with refined coefficients x+ud+i(y+vd), or complex refined numbers, namely refined numbers with complex coefficients x+iy+(u+iv)d. This is still a real associative algebra, which through force of habit people will no doubt want to call a complex commutative associative algebra, but it could just as legitimately be called a refined associative algebra. Ok, what about commutative? Well, the complex numbers are commutative and the refined numbers are commutative, so how could refining complex numbers make any difference? Well, the reason I wrote x+yd rather than x+dy is that, even though the *natural* thing to do is to make i commute with d, if instead we make id+di=0, the defining condition for Clifford algebras, then we can fit the whole thing into 2x2 *real* matrices! Here I'm using the following 2x2 real matrices for i and d respectively: (0 -1) (0 1) (1 0) (0 0) But now notice that the matrices for 1,i,d,id form a basis for all the 2x2 matrices. In fact *any* 2x2 matrix [[a,b],[c,d]] can be decomposed as (d -c) + (a-d b+c) (c d) ( 0 0 ) (I'd appreciate feedback from anyone for whom the above doesn't typeset readably.) So to read an arbitary 2x2 real matrix as a refined complex number, take the bottom row reversed as the complex part and the departure of the top row from the usual matrix representation of complex numbers as the infinitesimal part, taking care to get both signs right. How did I notice this? Simple. I knew (i) that id+di=0 would make it a Clifford algebra, (ii) there are only two 4D Clifford algebras, and (iii) d^2 = 0 -> d = 0 in the quaternions. This narrows things down to the 2x2 real matrices, there are no other associative algebras with these properties. Getting the above decomposition was then just a matter of solving some trivial linear equations. This is so simple, and the infinitesimals have been worried at for so long, that this *has* to be known already. But then it would really bug me to have been the last to learn about it -- why wasn't I told, as they say? Vaughan Pratt
Correction to my suggestion id+di = 0. Don't do it. id = di is fine as it stands for refined complex numbers, which should be represented in C(2) = 2x2 complex matrices (embeddable in R(4) - 4x4 real matrices) as the obvious extension of the refined reals x+yd. I shouldn't have been so smug about 4D Clifford algebras, this algebra of refined complex numbers doesn't satisfy d^4 = 1, needed if d is to be a Clifford generator. And in fact although di = 1 0 0 0 we have id = 0 0 0 1 (I should have checked that more carefully.) I thought about trying to make the infinitesimals points on the "light cone" of R(2) (the singular matrices) but couldn't get that to work. So 2x2 complex matrices with id = di is the best I could think of. This works for modeling the refined complex numbers (barring any other errors), but with nothing left to motivate id+di = 0. The representation x+iy+dv+idw is fine, with idw = diw = wid etc., all is commutative. (I was hoping too hard for the excitement of noncommutativity, this is boringly noninteractive as it stands.) Vaughan
I'm told that Bell's "microlinear calculus" in his 1998 book on infinitesimals is equivalent to the matrix approach I suggested, so that was not new after all, other than perhaps its formulation in terms of 2x2 matrices. On the other hand it is apparently mixed in with Bell's strongly intuitionistic outlook, whereas it would seem intuitively that something so simple as a model of d^2 = 0 should transcend whether one is working intuitionistically or classically. A more classical version of Bell's account might be of interest (perhaps to relatively few people on the categories mailing list though, which seems to have a strongly intuitionistic slant). Meanwhile I received the April issue of Mathematics Magazine just now, and it has an article on pp. 118-129 on "Geometry of Generalized Complex Numbers" by Anthony and Joseph Harkin. The microlinear calculus, under the names "Study product" and "parabolic complex numbers," apparently dates back to Study's 1903 book Geometrie der Dynamen. The Harkins associate i^2 = -1,0,1 with respectively Ordinary (i.e. complex) product, Study product, and Clifford product (though Clifford algebras include ordinary product as well, the quaternions being a Clifford algebra). The article makes no mention of infinitesimals, and it would be interesting to try to find the appropriate infinitesimal interpretations of the geometric properties of the parabolic complex plane. One approach I very much like to infinitesimals that I haven't seen in the nonstandard analysis/infinitesimal literature (but would certainly appreciate pointers) is one that does all the work with what one might call finitesimals. A finitesimal h is just a positive real that you plan one day to reduce to zero, and thus organize everything around it to that end. Polynomials in R[x] of degree d form a (d+1)-dimensional vector space. The usual basis for this space is the d+1 monomials x^i for i in 0..d. However if one fixes h > 0 and takes the basis to be 1, x, x(x-h), x(x-h)(x-2h),... then Boole's difference calculus works essentially identically to the infinitesimal calculus for polynomials represented in the monomial basis. Since h is a free variable throughout the development, one can do all the work first and then drive h to 0 uniformly everywhere at the end. Expressions such as x^i (Knuth writes an underbar under the i and calls it "x to the falling i") mention h only implicitly and hence don't change (as symbolic expressions) as h changes, though their numerical values at any given x change. The Stirling numbers of the first and second kind, organized as matrices, constitute linear transformations from the bases for h=1 to h=0 and back again, respectively. I've looked from time to time at how one might extend this to exponentials and logarithms, but have never been satisfied with the results. It would be nice to know how to deal exactly with exp(it) for nonzero h. If this were possible it might give an even nicer constructive treatment of infinitesimals than the others, and one that didn't care at all whether one was classically or intuitionistically inclined. Vaughan Pratt
participants (2)
-
Vaughan Pratt -
Vaughan Pratt