1. Reliability
  2. 5. Rooted Trees and Related Spaces
  3. 1
  4. 2
  5. 3
  6. 4
  7. 5
  8. 6

2. Free Semigroups

Let I be a countable alphabet of letters, and let S denote the set of all finite length words using letters from I. The empty word, with no letters, is denoted e. Let denote the concatenation operation on elements of S. That is, suppose that m,nN and that xi,yjI for i{1,2,,m} and j{1,2,,n}. Then x=x1xmS and y=y1ynS, and xy=x1xmy1yn The space (S,) is a discrete, positive semigroup, known as the free semigroup generated by I.

The set of words S is sometimes denoted I, and the elements of I are the irreducible elements of (S,). The adjective free is used because there are no algebra rules imposed on (S,). Here are two examples that we will use in exercises:

The binary semigroup is the free semigroup on the alphabet {0,1}, and so the simplest non-trivial free semigroup. The nonempty words are simply bit strings of finite length.

The genetic semigroup is the free semigroup on the alphabet {A,C,G,T}. In genetics, the letters refer to the nucleotides that are the base building blocks of DNA: adenine, cytosine, guanine, and thymine, respectively. Nonempty words may represent genes.

For xS and iI, let di(x) denote the number of times that letter i occurs in x, and let d(x)=iIdi(x) denote the length of x.

Note that di(x)=0 for all but finitely many iI.

Suppose again that (S,) is the free semigroup corresponding to the countable alphabet I.

  1. Let (S,) denote the partial order graph associated with (S,). Then xy if and only if y=xt for some tS, so that x is a prefix of y and then x1y is the corresponding suffix.
  2. Let (S,) denote the cover graph of (S,). Then xy if and only if y=xi for some iI. Hence (S,) is a regular tree rooted at e of degree #(I).
  3. For nN, let n denote the composition power of of order n. Then xny if and only if y=xt for some tS with d(t)=n.

Since (S,) is a rooted tree, all of the results in Section 1 apply. In particular, if #(I)=kN+, then (S,) is a regular rooted tree of degree k. Specializing further, when k=2 and I={0,1}, so that we have the binary semigroup described in [2], then the corresponding cover graph (S,) is the binary tree. Note also that d(x) is the distance from e to x in the cover graph (S,) for xS, so our notation is consistent. More generally, if xy then d(x,y)=d(y)d(x) is the distance from x to y in (S,). Like all rooted trees, (S,) is a lower semi-lattice and for x,yS, xy is the largest common prefix of x and y. But (S,) is not an upper semi-lattice unless I consists of a single letter.

Let NI={(ni:iI):niN for iI and ni=0 for all but finitely many iI}. We define the multinomial coefficients on NI by C(n)=#{xS:di(x)=ni for iI}=(iIni)!iIni!,n=(ni:iI)NI

The free semigroup has the property that[e,x] is finite and totally ordered for each xS. Moreover, it is the only discrete positive semigroup with this property.

Suppose that (S,) is a discrete positive semigroup with associated partial order and with the property that [e,x] is finite and totally ordered for each xS. Then (S,) is isomorphic to a free semigroup on an alphabet.

Details:

Let I denote the set of irreducible elements of (S,). If xS{e} then i1x for some i1I and hence x=i1y for some yS. If ye then we can repeat the argument to write y=i2z for some i2I and zS. Note that x=i1i2z and hence i1i2x. Moreover, i1 and i1i2 are distinct. Since [e,x] is finite, the process must terminate. Thus, we can write x in the form x=i1i2in for some nN+ and some i1,i2,,inI. Finally, we show that the factorization is unique. Suppose that x=i1i2in=j1j2jm where i1,,inI and j1,,jmI. Then i1x and j1x. Since the elements of [e,x] are totally ordered, we must have i1=j1. Using left cancellation we have i2in=j2jm. Continuing in this fashion it follows that m=n and i1=j1, i2=j2,,in=jn.

Suppose that (S,) is a discrete positive semigroup with the property that every xS has a unique finite factoring over the set of irreducible elements I. Then (S,) is isomporphic to the free semigroup on I.

Details:

Every xS has a finite factoring over I, so that x=i1i2in. If this factoring is unique, then clearly (i1,i2,in)i1i2in is an isomorphism from the free semigroup (I,) onto (S,).

The left walk functions and the left generating functions for the graphs (S,), (S,) and (S,) are given in Section 1, as is the Möbius kernel for (S,). Since we have a discrete positive semigroup, we can give the more basic Möbius function:

The Möbius function μ of (S,) is given by μ(e)=1, μ(i)=1 for iI, and μ(x)=0 for all other xS.

The semigroup dimension is the number of letters.

dim(S,)=#(I).

Details:

Note first that a homomorphism ϕ from (S,) into (R,+) can uniquely be specified by defining ϕ(i) for all iI and then defining ϕ(i1i2in)=ϕ(i1)+ϕ(i2)+ϕ(in) Thus, if ϕ is a such a homomorphism and ϕ(i)=0 for all iI, then ϕ(x)=0 for all xS. Now suppose that BS and #(B)<#(I). We will show that there exists a nonzero homomorphism from (S,) into (R,+) with ϕ(x)=0 for all xB. Let IB denote the set of letters contained in the words in B. Suppose first that IB if a proper subset of I (and note that this must be the case if I is infinite). Define a homomorphism ϕ by ϕ(i)=0 for iIB and ϕ(i)=1 for iIIB. Then ϕ(x)=0 for xB, but ϕ is not the zero homomorphism. Suppose next that IB=I. Thus I is finite, so let k=#(B) and n=#(I), with k<n. Denote the words in B by ij1ij2ijmj,j=1,2,,k The set of linear, homogeneous equations ϕ(ij1)+ϕ(ij2)++ϕ(ijmj)=0,j=1,2,,k has n unkowns, namely ϕ(i) for iI, but only k equations. Hence there exists a non-trivial solution. The homomorphism so constructed satisfies ϕ(x)=0 for xB.

We can use the strict positive semigroup to provide a simple example of a discrete semigroup in which multiples of counting measure # are not the only left invariant measures.

Suppose that I has at least two elements and consider the strict positive semigroup (S+,) associated with the free semigroup (S,). That is, S+=S{e} while is still concatenation. Define the function g:S+(0,) by g(x)=ci if the terminal letter of x is iI, where ci(0,) for each iI and cicj for some i,jI. Let λ be the positive measure on S+ with density g (relative to #). That is, λ(A)=xAg(x)=iIcini(A),AS+ where ni(A) is the number of words in A with terminal letter i for each iI. Then λ is left invariant: If xS and AS then the words in xA have the same terminal letters as the corresponding words in A, so λ(xA)=λ(A). But λ is clearly not a multiple of counting measure.