Polarities (Part 3)
Azimuth 2024-11-05
I’m talking about ‘causal loop diagrams’, which are graph with edges labeled by ‘polarities’. Often the polarities are simply and signs, like here:
But polarities can be elements of any monoid, and last time I argued that things work even better if they’re elements of a rig, so you can not only multiply them but also add them.
In fact, I argued that it’s even better if polarities are elements of a ‘hyperfield’. But what’s a hyperfield, and why are these good?
First, what’s a hyperring? Briefly and roughly, it’s a ring where addition can be multivalued. For example the set
is a hyperring. We multiply things in the obvious way: e.g.
because negative times negative is positive. Whenever possible, we also add things in the obvious way. For example,
because the sum of two negative things is negative. Note that we write addition using the funny symbol since we don’t want people to see equations like
and go insane. But there’s an ambiguous case: adding and When you add a positive thing and a negative thing, it could be positive, negative or zero! Here is where it comes in handy to let addition be multivalued. We write this as
The idea of an algebraic gadget where addition can be multivalued goes back to Frédéric Marty, who introduced ‘hypergroups’ in 1934. In the 1950s, Marc Krasner developed the theory of ‘hyperrings’ and ‘hyperfields’. They took a while to catch on… but by now people have discovered you can do good math with them! As Oleg Viro wrote in 2010:
Krasner, Marshall, Connes and Consani and the author came to hyperfields for different reasons, motivated by different mathematical problems, but we came to the same conclusion: hyperrings and hyperfields are great, very useful and very underdeveloped in the mathematical literature… Probably, the main obstacle for hyperfields to become a mainstream notion is that a multivalued operation does not fit to the tradition of set-theoretic terminology, which forces to avoid multivalued maps at any cost. I believe the taboo on multivalued maps has no real ground, and eventually will be removed.
I discovered them, and also this quote, through the blog articles of Matt Baker, who has been using them in combinatorics. And I think they can be useful in applied math, for computing with qualitative information!
What’s especially nice is that is not just a hyperring: it’s a ‘hyperfield’, meaning that you can also divide by anything except As you’d expect, it works like this:
Furthermore, any field is a hyperfield, so is a hyperfield, and there is a homomorphism of hyperfields
defined so that
This is really cool! Recall that a rational function in several variables is any ratio of polynomials, like
Since the map preserves addition, subtraction, multiplication and division, we can take any equations involving rational functions of real variables, and turn them into equations between rational functions of variables valued in , in a completely systematic way by applying !
That is, we can turn ‘quantitative’ equations involving real numbers, and turn them into ‘qualitative’ equations where all we care about is whether the numbers are positive, negative or zero! And this process is quite well-behaved.
But let me back up a bit and say what hyperrings and hyperfields are.
Hyperrings and hyperfields
To define a ring we usually first define an abelian group, since addition makes any ring into an abelian group. Similarly, to define a hyperring we should first define an ‘abelian hypergroup’. Unfortunately people call this a ‘canonical hypergroup’—I’m not sure why.
I’ll lead up to the definition rather slowly. First, in our canonical hypergroup we want to allow addition to be many-valued… but not undefined. Thus, for any set let be the collection of nonempty subsets of . Then addition in a canonical hypergroup will be a function
But what about expressions like If is actually a subset of what does it mean to add to that? Luckily this is no problem: we can always extend addition
to an operation on nonempty subsets, which we denote by the same symbol:
Here’s how we do it! Suppose and are nonempty subsets of Then we define
That is, consists of all possible values that we can get from adding an element of to an element of Similarly, we define
and similarly for
Now we’re ready for the definition of canonical hypergroup:
Definition. A canonical hypergroup is a set with a map
obeying:
1) the commutative law: for all
2) the associative law: for all
3) the unit law: for all
4) the existence and uniqueness of additive inverses: for every there exists a unique element such that and thus We call this element
5) compatibility of addition and subtraction: if and only if
Notice that while addition is multivalued, taking negatives is not! Similarly, in our addition of ‘hyperring’, only addition will be multivalued. You can imagine being more general, but people don’t do that—and this seems fine for the applications I have in mind.
Definition. A hyperring is a set with:
a) an addition map
b) a multiplication map
c) elements
such that:
1) makes into a canonical hypergroup with unit element
2) makes into a monoid with unit element
3) The distributive laws hold: and for all
4) The zero laws hold: for all
Finally:
Definition. A hyperfield is a hyperring such that every nonzero element has a unique element with . We write this element as
Quotient hyperrings and hyperfields
Here is a nice way to get lots of hyperrings. Let be a ring and let be its group of units, i.e. the set of elements with multiplicative inverses, made into a group using multiplication. Let be any subgroup such that for all Let be the set of equivalence classes of elements of where if and only if for some . Thanks to the condition we can multiply equivalence classes by
On the other hand, we define the sum of equivalence classes and to be the set of all equivalence classes You can check that this makes into a hyperring called a quotient hyperring of
In particular, if is a hyperfield so is
There’s always a map from any hyperring to any of its quotient hyperrings But I forgot to define maps! A map from a hyperring to a hyperring is a function
with
and
for all With this definition, it’s easy to see that the function
sending any to its equivalence class is a map of hyperrings.
The Krasner hyperfield
For example, let be the field If we take to be the group of all nonzero elements of then the quotient hyperring has just two elements. One is the equivalence class of all nonzero reals, while the other is the equivalence classes containing just If we call these two equivalence classes and we get
because the sum of two nonzero reals can be zero or nonzero.
So, we get a 2-element hyperfield called the Krasner hyperfield
If we use elements of the Krasner hyperfield as polarities, what can these polarities mean? One guess is that means has an effect while means does not have an effect.
But this seems useless for causal loop diagrams, since we can use plain old graphs to convey the same information—right? If we want to indicate that has an effect on we draw an edge from to Otherwise we just don’t draw an edge from to
But wait. Causal loop diagrams using the Krasner hyperfield convey more information than plain old graphs! There’s a difference between an edge labeled by and no edge at all.
So, there must be some better interpretation of the polarities and in the Krasner hyperfield. What is it? There may be more than one. What are they—can you help me here?
The sign hyperfield
Here’s an example I understand better: this one seems very important. Start with the field again, but now take to be the group of all positive elements of Now the quotient hyperring has three elements. One is the equivalence class of all positive reals, another is the equivalence class of the negative reals, and the third is the equivalence classes containing just If we call these equivalence classes and we get the sign hyperfield
In this hyperfield, only case where addition is many-valued is
because the sum of a positive real and a negative real can be positive, negative or zero. I’ve already mentioned the map
and now we know, from our general theory, why this is a map of hyperrings!
Next I should say how we use hyperfields (or more general hyperrings) in causal loop diagrams. But not today!