Thursday, 10 March 2016

Semigroups and friends

A gentle introduction to abstract algebra

What does it take for a semigroup to become a monoid? Why are all groups isomorphic to a group of permutations, and what does that even mean? When do I need to use a field rather than a plain old ring? But more importantly—what do all these questions have in common?

You’ll be forgiven if you didn’t guess abstract algebra, because another thing they share is cryptic terminology. Magmas, semigroups, monoids, groups, Abelian groups, rings, and fields—or as I like to summarize them, semigroups and friends—are the names mathematicians have given to the things we’re about to discuss. Maybe they used up all the good names centuries ago; maybe they want to appear smarter than everyone else; maybe naming things is really hard.¹ However, the problem with abstract algebra is that it’s, well, abstract. These things have no obvious connection to familiar concepts. They are unique entities in eternal Platonic realm.

When I say “things,” I mean algebraic structures. There are infinitely many algebraic structures, but I’m only going to focus on seven common types. Let me repeat my list: magma, semigroup, monoid, group, Abelian group, ring, and field. Although these intimidating names would have you believe otherwise, the concepts underlying them are quite simple. Understanding them requires no mathematical background other than basic set theory. With that in mind, let’s start with some definitions!

An algebraic structure is a set (called the underlying set) together with one or more operations on the set that satisfy certain axioms.² The arity of an operation is the number of operands it takes. Most operations are either unary, like negation, or binary, like addition and multiplication. In general, an operation on a set $S$ with arity $n$ is a function $f : S^{n} \to S .$ Since operations are conceptually different from ordinary functions, we represent them with symbols rather than letters. The notation also changes a bit: for unary operations we drop the parentheses, and for binary operations we use infix notation. Instead of $* (x)$ and $∙ (x, y),$ we write $* x$ and $x ∙ y .$

As is the case for most interesting mathematical objects, interesting algebraic structures usually have patterns. The axioms of an algebraic structure are descriptions of these patterns. Unlike the underlying set and the operations, the axioms are not part of the algebraic structure—they are facts about it. For any given algebraic structure and axiom, the structure either satisfies the axiom or it doesn’t. We could go on forever, observing more and more obscure facts about the structure and calling them axioms. Typically, though, our purpose is to classify algebraic structures by their adherence to a small set of axioms.

Let’s construct our first algebraic structure. Let $S = {a, b}$ and let $*$ be a unary operation on it where $* a = b$ and $* b = a .$ Then $(S, *)$ is an algebraic structure! Before you dismiss it, take a moment to examine its properties. We know $* a = b,$ and $* (* a) = a,$ and so on … is there anything else to learn about it? Well, because this structure is so simple, we can come up with general answers to just about any question that could be asked of it. Observe that all expressions have the form $* (\dots (* x)),$ and their values are entirely determined by $x$ and the parity³ of the applications of $* .$ We could make similar statements about the solutions to all possible equations in one or two variables. What we really need, though, is proof—can you find and prove the general solution to all equations in $(S, *) ?$

Now we’re ready to get acquainted with the semigroup and its friends. Of the seven, the first five have a single binary operation and the last two have two binary operations. We’ll start with the simplest one. A magma is an algebraic structure $(S, ∙)$ where $∙$ is a binary operation that is closed over $S .$ Notice I say a magma, not the magma; there are infinitely many magmas, each corresponding to different choices for $S$ and $∙ .$ Now, when I say $∙$ is “closed over $S,$ ” I mean it has closure, which brings us to our first axiom:

Closure: If $a$ and $b$ are in $S,$ then $a ∙ b$ is in $S$ as well.

This axiom is arguably redundant, since an operation on $S$ has codomain $S$ by definition, therefore it must be closed. However, it’s customary to include it anyway, for some reasons that I won’t go into now.

Most common algebraic properties have names. The closure property is so common that a statement like, “The operation $∙$ has closure over $S,$ ” is rarely accompanied by further explanation. In those cases where explanation is necessary, mathematicians sometimes prefer symbols over words for their conciseness and precision. Rather than saying, “If $a$ and $b$ are in $S,$ then $a ∙ b$ is in $S$ as well,” we can write, $\forall a, b \in S : a ∙ b \in S .$ Pronouncing $\forall$ as “for all” and $\in$ as “in,” this reads, “For all $a$ and $b$ in $S,$ $a ∙ b$ is in $S .$ ”

We still have six types of algebraic structure left to go, but instead of defining them one at a time, I’m going to throw five axioms at you:

Closure: If $a$ and $b$ are in $S,$ then $a ∙ b$ is in $S$ as well.
Associativity: If $a,$ $b,$ and $c$ are in $S,$ then $a ∙ (b ∙ c) = (a ∙ b) ∙ c .$
Identity: There is an $e$ in $S$ such that $a ∙ e = e ∙ a = a$ for any $a$ in $S .$
Inverse: For any $a$ in $S,$ there is a corresponding $b$ in $S$ such that $a ∙ b = b ∙ a = e,$ where $e$ is the identity element.
Commutativity: If $a$ and $b$ are in $S,$ then $a ∙ b = b ∙ a .$

If you’re comfortable with predicate logic, you may prefer this format:

	Name	Axiom
1	Closure	$\forall a, b \in S : a ∙ b \in S$
2	Associativity	$\forall a, b, c \in S : a ∙ (b ∙ c) = (a ∙ b) ∙ c$
3	Identity	$\exists e \in S : \forall a \in S : a ∙ e = e ∙ a = a$
4	Inverse	$\forall a \in S : \exists b \in S : a ∙ b = b ∙ a = e$
5	Commutativity	$\forall a, b \in S : a ∙ b = b ∙ a$

Magmas, semigroups, monoids, groups, and Abelian groups build on top of each other. In fact, they’re nothing more than shorthand for specifying how many of these five axioms to include:

Magma: An algebraic structure with a closed binary operation (axiom 1).
Semigroup: An associative magma (axioms 1 and 2)
Monoid: A semigroup that has an identity element (axioms 1 to 3).
Group: A monoid that has inverse elements (axioms 1 to 4).
Abelian group: A commutative group (axioms 1 to 5).

The next two on my list have an extra binary operation, so they need slightly longer definitions. A ring is an algebraic structure with two binary operations $(R, \oplus, ⊙)$ where $(R, \oplus)$ forms an Abelian group, $(R, ⊙)$ forms a monoid, and $⊙$ is distributive with respect to $\oplus$ on the left and the right:

Left distributivity: $\forall a, b, c \in R : a ⊙ (b \oplus c) = (a ⊙ b) \oplus (a ⊙ c) .$
Right distributivity: $\forall a, b, c \in R : (b \oplus c) ⊙ a = (b ⊙ a) \oplus (c ⊙ a) .$

A field is a special type of ring. Let $D = R ∖ {e},$ where $e$ is the identity element for $\oplus;$ that is, $D$ contains the elements of the underlying set except for $e .$ Then $(R, \oplus, ⊙)$ is a field if $(D, ⊙)$ forms an Abelian group.

That’s it! You’ve now been introduced to all seven of them. They may seem peculiar and overly abstract, but you’ve actually been using these structures ever since you learned arithmetic. In particular, $(ℤ, +)$ is an Abelian group, $(ℤ, +, \times)$ is a ring, and $(ℚ, +, \times)$ is a field. But they aren’t the only ones—the power of abstract algebra is that is allows us to abstract ourselves away from the familiar instances. Rather than studying these specific structures whose underlying sets contain numerals like 1 and 2, mathematicians instead study the general, abstract structure of any such algebra, because the structure is what matters.

Abstract algebra is the study of algebraic structures: sets imbued with structure by operations. Magmas, semigroups, monoids, groups, Abelian groups, rings, and fields are just a few varieties of algebraic structure. And abstract as they are, they do exist in the real world! When you solve a Rubik’s cube, you are dealing with group theory. When you split the bill at a restaurant, you are dealing with operations on a field. Each of these is a rich area of mathematics in itself, and I look forward to exploring them further. If you want to learn more about group theory, I recommend Introduction to Group Theory. You would be surprised at how vast and intricate a world is generated by those four simple axioms.

“There are only two hard things in Computer Science: cache invalidation and naming things” (Phil Karlton). I expect the situation with respect to naming things is similar in mathematics. ↩︎
The definition is sometimes extended to allow for zero operations or for more than one underlying set, but we’re going to keep things simple. ↩︎
Parity means the fact of being even or odd. ↩︎