Let be a countable alphabet of letters, and let denote the set of all finite length words using letters from . The empty word, with no letters, is denoted . Let denote the concatenation operation on elements of . That is, suppose that and that for and . Then and , and
The space is a discrete, positive semigroup, known as the free semigroup generated by .
The set of words is sometimes denoted , and the elements of are the irreducible elements of . The adjective free is used because there are no algebra rules
imposed on . Here are two examples that we will use in exercises:
The binary semigroup is the free semigroup on the alphabet , and so the simplest non-trivial free semigroup. The nonempty words are simply bit strings of finite length.
The genetic semigroup is the free semigroup on the alphabet . In genetics, the letters refer to the nucleotides that are the base building blocks of DNA: adenine, cytosine, guanine, and thymine, respectively. Nonempty words may represent genes.
For and , let denote the number of times that letter occurs in , and let denote the length of .
Note that for all but finitely many .
Suppose again that is the free semigroup corresponding to the countable alphabet .
- Let denote the partial order graph associated with . Then if and only if for some , so that is a prefix of and then is the corresponding suffix.
- Let denote the cover graph of . Then if and only if for some . Hence is a regular tree rooted at of degree .
- For , let denote the composition power of of order . Then if and only if for some with .
Since is a rooted tree, all of the results in Section 1 apply. In particular, if , then is a regular rooted tree of degree . Specializing further, when and , so that we have the binary semigroup described in [2], then the corresponding cover graph is the binary tree. Note also that is the distance from to in the cover graph for , so our notation is consistent. More generally, if then is the distance from to in . Like all rooted trees, is a lower semi-lattice and for , is the largest common prefix of and . But is not an upper semi-lattice unless consists of a single letter.
Let . We define the multinomial coefficients on by
The free semigroup has the property that is finite and totally ordered for each . Moreover, it is the only discrete positive semigroup with this property.
Suppose that is a discrete positive semigroup with associated partial order and with the property that is finite and totally ordered for each . Then is isomorphic to a free semigroup on an alphabet.
Details:
Let denote the set of irreducible elements of . If then for some and hence for some . If then we can repeat the argument to write for some and . Note that and hence . Moreover, and are distinct. Since is finite, the process must terminate. Thus, we can write in the form for some and some . Finally, we show that the factorization is unique. Suppose that where and . Then and . Since the elements of are totally ordered, we must have . Using left cancellation we have . Continuing in this fashion it follows that and , .
Suppose that is a discrete positive semigroup with the property that every has a unique finite factoring over the set of irreducible elements . Then is isomporphic to the free semigroup on .
Details:
Every has a finite factoring over , so that . If this factoring is unique, then clearly is an isomorphism from the free semigroup onto .
The left walk functions and the left generating functions for the graphs , and are given in Section 1, as is the Möbius kernel for . Since we have a discrete positive semigroup, we can give the more basic Möbius function:
The Möbius function of is given by , for , and for all other .
The semigroup dimension is the number of letters.
.
Details:
Note first that a homomorphism from into can uniquely be specified by defining for all and then defining
Thus, if is a such a homomorphism and for all , then for all . Now suppose that and . We will show that there exists a nonzero homomorphism from into with for all . Let denote the set of letters contained in the words in . Suppose first that if a proper subset of (and note that this must be the case if is infinite). Define a homomorphism by for and for . Then for , but is not the zero homomorphism. Suppose next that . Thus is finite, so let and , with . Denote the words in by
The set of linear, homogeneous equations
has unkowns, namely for , but only equations. Hence there exists a non-trivial solution. The homomorphism so constructed satisfies for .
We can use the strict positive semigroup to provide a simple example of a discrete semigroup in which multiples of counting measure are not the only left invariant measures.
Suppose that has at least two elements and consider the strict positive semigroup associated with the free semigroup . That is, while is still concatenation. Define the function by if the terminal letter of is , where for each and for some . Let be the positive measure on with density (relative to ). That is,
where is the number of words in with terminal letter for each . Then is left invariant: If and then the words in have the same terminal letters as the corresponding words in , so . But is clearly not a multiple of counting measure.