Coverage for local/lib/python2.7/site-packages/sage/combinat/tutorial.py: 0%

27493510569775696512677516320986352688173429315980054758203125984302147328114964173055050741660736621590157844774296248940493063070200461792764493033510116079342457190155718943509725312466108452006369558934464248716828789832182345009262853831404597021307130674510624419227311238999702284408609370935531629697851569569892196108480158600569421098519

Partitions of integers are combinatorial objects naturally equipped with

many operations. They are therefore returned as objects that are

richer than simple lists::

sage: P7 = Partitions(7)

sage: p = P7.unrank(5); p

[4, 2, 1]

sage: type(p)

For example, they can be represented graphically by a Ferrers diagram::

sage: print(p.ferrers_diagram())

****

We leave it to the user to explore by introspection the available

operations.

Note that we can also construct a partition directly by::

sage: Partition([4,2,1])

[4, 2, 1]

or::

sage: P7([4,2,1])

[4, 2, 1]

If one wants to restrict the possible values of the parts

`i_1,\dots,i_\ell` of the partition as, for example, when giving

change, one can use ``WeightedIntegerVectors``. For example, the

following calculation::

sage: WeightedIntegerVectors(8, [2,3,5]).list()

[[0, 1, 1], [1, 2, 0], [4, 0, 0]]

shows that to make 8 dollars using 2, 3, and 5 dollar bills, one can

use a 3 and a 5 dollar bill, or a 2 and two 3 dollar bills, or four 2

dollar bills.

Compositions of integers are manipulated the same way::

sage: C5 = Compositions(5); C5

Compositions of 5

sage: C5.cardinality()

sage: C5.list()

[[1, 1, 1, 1, 1], [1, 1, 1, 2], [1, 1, 2, 1], [1, 1, 3],

[1, 2, 1, 1], [1, 2, 2], [1, 3, 1], [1, 4], [2, 1, 1, 1],

[2, 1, 2], [2, 2, 1], [2, 3], [3, 1, 1], [3, 2], [4, 1], [5]]

The number `16` above seems significant and suggests the existence of a

formula. We look at the number of compositions of `n` ranging

from `0` to `9`::

sage: [ Compositions(n).cardinality() for n in range(10) ]

[1, 1, 2, 4, 8, 16, 32, 64, 128, 256]

Similarly, if we consider the number of compositions of `5` by

length, we find a line of Pascal’s triangle::

sage: x = var('x')

sage: sum( x^len(c) for c in C5 )

x^5 + 4*x^4 + 6*x^3 + 4*x^2 + x

The above example uses a functionality which we have not seen yet:

``C5`` being iterable, it can be used like a list in a ``for`` loop or

a comprehension (:ref:`section-bricks-iterators`).

Prove the formulas suggested by the above examples for the number of

compositions of `n` and the number of compositions of

`n` of length `k`; investigate by introspection

whether ``Sage`` uses these formulas for calculating cardinalities.

.. _section-bricks-divers:

Some other finite enumerated sets

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Essentially, the principle is the same for all the finite sets with

which one wants to do combinatorics in ``Sage``; begin by constructing

an object which models this set, and then supply appropriate methods,

following a uniform interface [1]_. We now give a few more typical

examples.

Intervals of integers::

sage: C = IntegerRange(3, 21, 2); C

{3, 5, ..., 19}

sage: C.cardinality()

sage: C.list()

[3, 5, 7, 9, 11, 13, 15, 17, 19]

Permutations::

sage: C = Permutations(4); C

Standard permutations of 4

sage: C.cardinality()

sage: C.list()

[[1, 2, 3, 4], [1, 2, 4, 3], [1, 3, 2, 4], [1, 3, 4, 2],

[1, 4, 2, 3], [1, 4, 3, 2], [2, 1, 3, 4], [2, 1, 4, 3],

[2, 3, 1, 4], [2, 3, 4, 1], [2, 4, 1, 3], [2, 4, 3, 1],

[3, 1, 2, 4], [3, 1, 4, 2], [3, 2, 1, 4], [3, 2, 4, 1],

[3, 4, 1, 2], [3, 4, 2, 1], [4, 1, 2, 3], [4, 1, 3, 2],

[4, 2, 1, 3], [4, 2, 3, 1], [4, 3, 1, 2], [4, 3, 2, 1]]

Set partitions::

sage: C = SetPartitions([1,2,3])

sage: C

Set partitions of {1, 2, 3}

sage: C.cardinality()

sage: C.list()

[{{1, 2, 3}}, {{1}, {2, 3}}, {{1, 3}, {2}}, {{1, 2}, {3}},

{{1}, {2}, {3}}]

Partial orders on a set of `8` elements, up to isomorphism::

sage: C = Posets(8); C

Posets containing 8 elements

sage: C.cardinality()

16999

sage: C.unrank(20).plot()

Graphics object consisting of 20 graphics primitives

.. image:: ../../media/a_poset.png

One can iterate through all graphs up to isomorphism. For example,

there are 34 simple graphs with 5 vertices::

sage: len(list(graphs(5)))

Here are those with at most `4` edges::

sage: up_to_four_edges = list(graphs(5, lambda G: G.size() <= 4))

sage: pretty_print(*up_to_four_edges)

.. image:: ../../media/graphs-5.png

However, the *set* ``C`` of these graphs is not yet available in

``Sage``; as a result, the following commands are not yet

implemented::

sage: C = Graphs(5) # todo: not implemented

sage: C.cardinality() # todo: not implemented

sage: Graphs(19).cardinality() # todo: not implemented

24637809253125004524383007491432768

sage: Graphs(19).random_element() # todo: not implemented

Graph on 19 vertices

What we have seen so far also applies, in principle, to finite algebraic

structures like the dihedral groups::

sage: G = DihedralGroup(4); G

Dihedral group of order 8 as a permutation group

sage: G.cardinality()

sage: G.list()

[(), (1,4)(2,3), (1,2,3,4), (1,3)(2,4), (1,3), (2,4), (1,4,3,2), (1,2)(3,4)]

or the algebra of `2\times 2` matrices over the finite field

`\ZZ/2\ZZ`::

sage: C = MatrixSpace(GF(2), 2)

sage: C.list()

[

[0 0] [1 0] [0 1] [0 0] [0 0] [1 1] [1 0] [1 0] [0 1] [0 1]

[0 0], [0 0], [0 0], [1 0], [0 1], [0 0], [1 0], [0 1], [1 0], [0 1],

[0 0] [1 1] [1 1] [1 0] [0 1] [1 1]

[1 1], [1 0], [0 1], [1 1], [1 1], [1 1]

]

sage: C.cardinality()

.. topic:: Exercise

List all the monomials of degree `5` in three variables (see

``IntegerVectors``). Manipulate the ordered set partitions

``OrderedSetPartitions`` and standard tableaux

(``StandardTableaux``).

.. _exercise-alternating-sign-matrices:

.. topic:: Exercise

List the alternating sign matrices of size `3`, `4`,

and `5` (``AlternatingSignMatrices``), and try to guess the

definition. The discovery and proof of the formula for the

enumeration of these matrices (see the method ``cardinality``),

motivated by calculations of determinants in physics, is quite a

story. In particular, the first proof, given by Zeilberger in 1992

was automatically produced by a computer program. It was 84 pages long,

and required nearly a hundred people to verify it.

.. topic:: Exercise

Calculate by hand the number of vectors in `(\ZZ/2\ZZ)^5`, and

the number of matrices in `GL_3(\ZZ/2\ZZ)` (that is to say,

the number of invertible `3\times 3` matrices with

coefficients in `\ZZ/2\ZZ`). Verify your answer with ``Sage``.

Generalize to `GL_n(\ZZ/q\ZZ)`.

.. _section-bricks-iterators:

Set comprehension and iterators

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

We will now show some of the possibilities offered by ``Python`` for

constructing (and iterating through) sets, with a notation that is

flexible and close to usual mathematical usage, and in particular the

benefits this yields in combinatorics.

We begin by constructing the finite set

`\{i^2\ \|\ i \in \{1,3,7\}\}`::

sage: [ i^2 for i in [1, 3, 7] ]

[1, 9, 49]

and then the same set, but with `i` running from `1` to

`9`::

sage: [ i^2 for i in range(1,10) ]

[1, 4, 9, 16, 25, 36, 49, 64, 81]

A construction of this form in ``Python`` is called *set comprehension*.

A clause can be added to keep only those elements with `i` prime::

sage: [ i^2 for i in range(1,10) if is_prime(i) ]

[4, 9, 25, 49]

Combining more than one set comprehension, it is possible to construct

the set `\{(i,j) \ | \ 1\leq j < i <5\}`::

sage: [ (i,j) for i in range(1,6) for j in range(1,i) ]

[(2, 1), (3, 1), (3, 2), (4, 1), (4, 2), (4, 3),

(5, 1), (5, 2), (5, 3), (5, 4)]

or to produce Pascal’s triangle::

sage: [[binomial(n,i) for i in range(n+1)] for n in range(10)]

[[1],

[1, 1],

[1, 2, 1],

[1, 3, 3, 1],

[1, 4, 6, 4, 1],

[1, 5, 10, 10, 5, 1],

[1, 6, 15, 20, 15, 6, 1],

[1, 7, 21, 35, 35, 21, 7, 1],

[1, 8, 28, 56, 70, 56, 28, 8, 1],

[1, 9, 36, 84, 126, 126, 84, 36, 9, 1]]

The execution of a set comprehension is accomplished in two steps; first

an *iterator* is constructed, and then a list is filled with the

elements successively produced by the iterator. Technically, an

*iterator* is an object with a method ``next`` which returns a new value

each time it is called, until it is exhausted. For example, the

following iterator ``it``::

sage: it = (binomial(3, i) for i in range(4))

returns successively the binomial coefficients `\binom 3 i` with

`i=0,1,2,3`::

sage: next(it)

When the iterator is finally exhausted, an exception is raised::

sage: next(it)

Traceback (most recent call last):

...

StopIteration

More generally, an *iterable* is a ``Python`` object ``L`` (a list,

a set, ...) over whose elements it is possible to iterate. Technically,

the iterator is constructed by ``iter(L)``. In practice, the commands

``iter`` and ``next`` are used very rarely, since ``for`` loops and list

comprehensions provide a much pleasanter syntax::

sage: for s in Subsets(3):

....: print(s)

{}

{1}

{2}

{3}

{1, 2}

{1, 3}

{2, 3}

{1, 2, 3}

sage: [ s.cardinality() for s in Subsets(3) ]

[0, 1, 1, 1, 2, 2, 2, 3]

What is the point of an iterator? Consider the following example::

sage: sum( [ binomial(8, i) for i in range(9) ] )

256

When it is executed, a list of `9` elements is constructed, and

then it is passed as an argument to ``sum`` to add them up. If, on the

other hand, the iterator is passed directly to ``sum`` (note the absence

of square brackets)::

sage: sum( binomial(8, i) for i in range(9) )

256

the function ``sum`` receives the iterator directly, and can

short-circuit the construction of the intermediate list. If there are a

large number of elements, this avoids allocating a large quantity of

memory to fill a list which will be immediately destroyed [2]_.

Most functions that take a list of elements as input will also accept

an iterator (or an iterable) instead. To begin with, one can obtain the

list (or the tuple) of elements of an iterator as follows::

sage: list(binomial(8, i) for i in range(9))

[1, 8, 28, 56, 70, 56, 28, 8, 1]

sage: tuple(binomial(8, i) for i in range(9))

(1, 8, 28, 56, 70, 56, 28, 8, 1)

We now consider the functions ``all`` and ``any`` which denote

respectively the `n`-ary *and* and *or*::

sage: all([True, True, True, True])

True

sage: all([True, False, True, True])

False

sage: any([False, False, False, False])

False

sage: any([False, False, True, False])

True

The following example verifies that all primes from `3` to

`99` are odd::

sage: all( is_odd(p) for p in range(3,100) if is_prime(p) )

True

A *Mersenne prime* is a prime of the form `2^p -1`. We verify

that, for `p<1000`, if `2^p-1` is prime, then

`p` is also prime::

sage: def mersenne(p): return 2^p -1

sage: [ is_prime(p)

....: for p in range(1000) if is_prime(mersenne(p)) ]

[True, True, True, True, True, True, True, True, True, True,

True, True, True, True]

Is the converse true?

.. topic:: Exercise

Try the two following commands and explain the considerable

difference in the length of the calculations::

sage: all( is_prime(mersenne(p))

....: for p in range(1000) if is_prime(p) )

False

sage: all( [ is_prime(mersenne(p))

....: for p in range(1000) if is_prime(p)] )

False

We now try to find the smallest counter-example. In order to do this, we

use the ``Sage`` function ``exists``::

sage: exists( (p for p in range(1000) if is_prime(p)),

....: lambda p: not is_prime(mersenne(p)) )

(True, 11)

Alternatively, we could construct an iterator on the counter-examples::

sage: counter_examples = \

....: (p for p in range(1000)

....: if is_prime(p) and not is_prime(mersenne(p)))

sage: next(counter_examples)

.. topic:: Exercise

What do the following commands do?

sage: cubes = [t**3 for t in range(-999,1000)]

sage: exists([(x,y) for x in cubes for y in cubes], # long time (3s, 2012)

....: lambda x_y: x_y[0] + x_y[1] == 218)

(True, (-125, 343))

sage: exists(((x,y) for x in cubes for y in cubes), # long time (2s, 2012)

....: lambda x_y: x_y[0] + x_y[1] == 218)

(True, (-125, 343))

Which of the last two is more economical in terms of time? In terms

of memory? By how much?

.. topic:: Exercise

Try each of the following commands, and explain its result. If

possible, hide the result first and try to guess it before

launching the command.

.. todo:: hide the results by default

.. warning:: it will be necessary to interrupt the execution of some of the commands

sage: x = var('x')

sage: sum( x^len(s) for s in Subsets(8) )

x^8 + 8*x^7 + 28*x^6 + 56*x^5 + 70*x^4 + 56*x^3 + 28*x^2 + 8*x + 1

sage: sum( x^p.length() for p in Permutations(3) )

x^3 + 2*x^2 + 2*x + 1

sage: factor(sum( x^p.length() for p in Permutations(3) ))

(x^2 + x + 1)*(x + 1)

sage: P = Permutations(5)

sage: all( p in P for p in P )

True

sage: for p in GL(2, 2): print(p); print("")

[1 0]

[0 1]

[0 1]

[1 0]

[0 1]

[1 1]

[1 1]

[0 1]

[1 1]

[1 0]

[1 0]

[1 1]

sage: for p in Partitions(3): print(p) # not tested

[3]

[2, 1]

[1, 1, 1]

...

sage: for p in Partitions(): print(p) # not tested

[]

[1]

[2]

[1, 1]

[3]

...

sage: for p in Primes(): print(p) # not tested

...

sage: exists( Primes(), lambda p: not is_prime(mersenne(p)) )

(True, 11)

sage: counter_examples = (p for p in Primes()

....: if not is_prime(mersenne(p)))

sage: for p in counter_examples: print(p) # not tested

...

Operations on iterators

^^^^^^^^^^^^^^^^^^^^^^^

``Python`` provides numerous tools for manipulating iterators; most of them

are in the ``itertools`` library, which can be imported by::

sage: import itertools

The behaviour of this library has changed a lot between Python 2 and

Python 3. What follows is mostly written for Python 2.

We will demonstrate some applications, taking as a starting point the

permutations of `3`::

sage: list(Permutations(3))

[[1, 2, 3], [1, 3, 2], [2, 1, 3],

[2, 3, 1], [3, 1, 2], [3, 2, 1]]

We can list the elements of a set by numbering them::

sage: list(enumerate(Permutations(3)))

[(0, [1, 2, 3]), (1, [1, 3, 2]), (2, [2, 1, 3]),

(3, [2, 3, 1]), (4, [3, 1, 2]), (5, [3, 2, 1])]

or select only the elements in positions 2, 3, and 4 (analogue of

``l[1:4]``)::

sage: import itertools

sage: list(itertools.islice(Permutations(3), 1, 4))

[[1, 3, 2], [2, 1, 3], [2, 3, 1]]

The itertools methods ``imap`` and ``ifilter`` have been renamed to

``map`` and ``filter`` in Python 3. You can get them also in Python 2 using::

sage: from builtins import map, filter

but they should rather be avoided, using list comprehension instead.

To apply a function to all the elements, one can do::

sage: list(z.cycle_type() for z in Permutations(3))

[[1, 1, 1], [2, 1], [2, 1], [3], [3], [2, 1]]

and similarly to select the elements satisfying a certain condition::

sage: list(z for z in Permutations(3) if z.has_pattern([1,2]))

[[1, 2, 3], [1, 3, 2], [2, 1, 3], [2, 3, 1], [3, 1, 2]]

Implementation of new iterators

^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

It is easy to construct new iterators, using the keyword ``yield``

instead of ``return`` in a function::

sage: def f(n):

....: for i in range(n):

....: yield i

After the ``yield``, execution is not halted, but only suspended, ready

to be continued from the same point. The result of the function is

therefore an iterator over the successive values returned by ``yield``::

sage: g = f(4)

sage: next(g)

Traceback (most recent call last):

...

StopIteration

The function could be used as follows::

sage: [ x for x in f(5) ]

[0, 1, 2, 3, 4]

This model of computation, called *continuation*, is very useful in

combinatorics, especially when combined with recursion. Here is how to

generate all words of a given length on a given alphabet::

sage: def words(alphabet,l):

....: if l == 0:

....: yield []

....: else:

....: for word in words(alphabet, l-1):

....: for l in alphabet:

....: yield word + [l]

sage: [ w for w in words(['a','b'], 3) ]

[['a', 'a', 'a'], ['a', 'a', 'b'], ['a', 'b', 'a'],

['a', 'b', 'b'], ['b', 'a', 'a'], ['b', 'a', 'b'],

['b', 'b', 'a'], ['b', 'b', 'b']]

These words can then be counted by::

sage: sum(1 for w in words(['a','b','c','d'], 10))

1048576

Counting the words one by one is clearly not an efficient method in this

case, since the formula `n^\ell` is also available; note,

though, that this is not the stupidest possible approach - it does, at

least, avoid constructing the entire list in memory.

We now consider Dyck words, which are well-parenthesized words in the

letters “`(`” and “`)`”. The function below generates

all the Dyck words of a given length (where the length is the number of

pairs of parentheses), using the recursive definition which says that a

Dyck word is either empty or of the form `(w_1)w_2` where

`w_1` and `w_2` are Dyck words::

sage: def dyck_words(l):

....: if l==0:

....: yield ''

....: else:

....: for k in range(l):

....: for w1 in dyck_words(k):

....: for w2 in dyck_words(l-k-1):

....: yield '('+w1+')'+w2

Here are all the Dyck words of length `4`::

sage: list(dyck_words(4))

['()()()()', '()()(())', '()(())()', '()(()())', '()((()))',

'(())()()', '(())(())', '(()())()', '((()))()', '(()()())',

'(()(()))', '((())())', '((()()))', '(((())))']

Counting them, we recover a well-known sequence::

sage: [ sum(1 for w in dyck_words(l)) for l in range(10) ]

[1, 1, 2, 5, 14, 42, 132, 429, 1430, 4862]

.. _exo-iterators-catalan:

.. topic:: Exercise: complete binary tree iterator

Construct an iterator on the set `C_n` of complete binary

trees with `n` leaves

(see :ref:`section-examples-catalan`).

Hint: ``Sage`` 4.8.2 does not yet have a native data structure to

represent complete binary trees. One simple way to represent them is

to define a formal variable ``Leaf`` for the leaves and a formal

2-ary function ``Node``::

sage: var('Leaf')

Leaf

sage: function('Node', nargs=2)

Node

The second tree in :ref:`figure-examples-catalan-trees`

can be represented by the expression::

sage: tr = Node(Node(Leaf, Node(Leaf, Leaf)), Leaf)

.. _section-constructions:

Constructions

-------------

We will now see how to construct new sets starting from these building

blocks. In fact, we have already begun to do this with the construction

of `\mathcal P(\mathcal P(\mathcal P(\{1,2,3,4\})))` in the

previous section, and to construct the example of sets of cards in

:ref:`section-examples`.

Consider a large Cartesian product::

sage: C = cartesian_product([Compositions(8), Permutations(20)]); C

The Cartesian product of (Compositions of 8, Standard permutations of 20)

sage: C.cardinality()

311411457046609920000

Clearly, it is impractical to construct the list of all the elements of this

Cartesian product! And, in the following example, `H` is equipped with the

usual combinatorial operations and also its structure as a product group::

sage: G = DihedralGroup(4)

sage: H = cartesian_product([G,G])

sage: H in Groups()

True

sage: t = H.an_element()

sage: t

((1,2,3,4), (1,2,3,4))

sage: t*t

((1,3)(2,4), (1,3)(2,4))

We now construct the union of two existing disjoint sets::

sage: C = DisjointUnionEnumeratedSets(

....: [ Compositions(4), Permutations(3)] )

sage: C

Disjoint union of Family (Compositions of 4,

Standard permutations of 3)

sage: C.cardinality()

sage: C.list()

[[1, 1, 1, 1], [1, 1, 2], [1, 2, 1], [1, 3], [2, 1, 1], [2, 2],

[3, 1], [4], [1, 2, 3], [1, 3, 2], [2, 1, 3], [2, 3, 1],

[3, 1, 2], [3, 2, 1]]

It is also possible to take the union of more than two disjoint sets, or

even an infinite number of them. We will now construct the set of all

permutations, viewed as the union of the sets `P_n` of

permutations of size `n`. We begin by constructing the infinite

family `F=(P_n)_{n\in N}`::

sage: F = Family(NonNegativeIntegers(), Permutations); F

Lazy family (<class 'sage.combinat.permutation.Permutations'>(i))_{i in Non negative integers}

sage: F.keys()

Non negative integers

sage: F[1000]

Standard permutations of 1000

Now we can construct the disjoint union `\bigcup_{n\in \NN}P_n`::

sage: U = DisjointUnionEnumeratedSets(F); U

Disjoint union of

Lazy family (<class 'sage.combinat.permutation.Permutations'>(i))_{i in Non negative integers}

It is an infinite set::

sage: U.cardinality()

+Infinity

which doesn’t prohibit iteration through its elements, though it will be

necessary to interrupt it at some point::

sage: for p in U: # not tested

....: print(p)

[]

[1]

[1, 2]

[2, 1]

[1, 2, 3]

[1, 3, 2]

[2, 1, 3]

[2, 3, 1]

[3, 1, 2]

...

Note: the above set could also have been constructed directly with::

sage: U = Permutations(); U

Standard permutations

Summary

~~~~~~~

``Sage`` provides a library of common enumerated sets, which can be

combined by standard constructions, giving a toolbox that is flexible

(but which could still be expanded). It is also possible to add new

building blocks to ``Sage`` with a few lines (see the code in

``FiniteEnumeratedSets().example()``). This is made possible by the

uniformity of the interfaces and the fact that ``Sage`` is based on an

object-oriented language. Also, very large or even infinite sets can

be manipulated thanks to lazy evaluation strategies (iterators, etc.).

There is no magic to any of this: under the hood, ``Sage`` applies the

usual rules (for example, that the cardinality of `E\times E` is

`|E|^2`); the added value comes from the capacity to manipulate

complicated constructions. The situation is comparable to ``Sage``’s

implementation of differential calculus: ``Sage`` applies the usual

rules for differentiation of functions and their compositions, where

the added value comes from the possibility of manipulating complicated

formulas. In this sense, ``Sage`` implements a *calculus* of finite

enumerated sets.

.. _section-generic:

Generic algorithms

------------------

.. _section-generic-integerlistlex:

Lexicographic generation of lists of integers

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

Among the classic enumerated sets, especially in algebraic

combinatorics, a certain number are composed of lists of integers of

fixed sum, such as partitions, compositions, or integer vectors. These

examples can also have supplementary constraints added to them. Here are

some examples. We start with the integer vectors with sum `10`

and length `3`, with parts bounded below by `2`,

`4` and `2` respectively::

sage: IntegerVectors(10, 3, min_part=2, max_part=5,

....: inner=[2, 4, 2]).list()

[[4, 4, 2], [3, 5, 2], [3, 4, 3], [2, 5, 3], [2, 4, 4]]

The compositions of `5` with each part at most `3`, and

with length `2` or `3`::

sage: Compositions(5, max_part=3,

....: min_length=2, max_length=3).list()

[[3, 2], [3, 1, 1], [2, 3], [2, 2, 1], [2, 1, 2], [1, 3, 1],

[1, 2, 2], [1, 1, 3]]

The strictly decreasing partitions of `5`::

sage: Partitions(5, max_slope=-1).list()

[[5], [4, 1], [3, 2]]

These sets share the same underlying algorithmic structure, implemented

in the more general (and slightly more cumbersome) class

``IntegerListsLex``. This class models sets of vectors

`(\ell_0,\dots,\ell_k)` of non-negative integers, with

constraints on the sum and the length, and bounds on the parts and on

the consecutive differences between the parts. Here are some more

examples::

sage: IntegerListsLex(10, length=3,

....: min_part=2, max_part=5,

....: floor=[2, 4, 2]).list()

[[4, 4, 2], [3, 5, 2], [3, 4, 3], [2, 5, 3], [2, 4, 4]]

sage: IntegerListsLex(5, min_part=1, max_part=3,

....: min_length=2, max_length=3).list()

[[3, 2], [3, 1, 1], [2, 3], [2, 2, 1], [2, 1, 2],

[1, 3, 1], [1, 2, 2], [1, 1, 3]]

sage: IntegerListsLex(5, min_part=1, max_slope=-1).list()

[[5], [4, 1], [3, 2]]

sage: list(Compositions(5, max_length=2))

[[5], [4, 1], [3, 2], [2, 3], [1, 4]]

sage: list(IntegerListsLex(5, max_length=2, min_part=1))

[[5], [4, 1], [3, 2], [2, 3], [1, 4]]

The point of the model of ``IntegerListsLex`` is in the compromise

between generality and efficiency. The main algorithm permits

iteration through the elements of such a set `S` in reverse

lexicographic order with a good complexity in most practical use

cases. Roughly speaking, the time needed to iterate through all the

elements of `S` is proportional to the number of elements, where the

proportion factor is controlled by the length `l` of the longest

element of `S`. In addition, the memory usage is also controlled by

`l`, which is to say negligible in practice.

This algorithm is based on a very general principle for traversing a

decision tree, called *branch and bound*: at the top level, we run

through all the possible choices for `\ell_0`; for each of these

choices, we run through all the possible choices for `\ell_1`,

and so on. Mathematically speaking, we have put the structure of a

prefix tree on the elements of `S`: a node of the tree at depth

`k` corresponds to a prefix `\ell_0,\dots,\ell_k` of one

(or more) elements of `S` (see :ref:`figure-prefix-tree-partitions`).

.. _figure-prefix-tree-partitions:

.. figure:: ../../media/prefix-tree-partitions-5.png

:scale: 150%

Figure: The prefix tree of the partitions of 5.

The usual problem with this type of approach is to avoid bad decisions

which lead to leaving the prefix tree and exploring dead branches;

this is particularly problematic because the growth of the number of

elements is usually exponential in the depth. It turns out that the

constraints listed above are simple enough to be able to reasonably

predict when a sequence `\ell_0,\dots,\ell_k` is a prefix of some

element `S`. Hence, most dead branches can be pruned.

.. _section-generic-polytopes:

Integer points in polytopes

~~~~~~~~~~~~~~~~~~~~~~~~~~~

Although the algorithm for iteration in ``IntegerListsLex`` is

efficient, its counting algorithm is naive: it just iterates over all

the elements.

There is an alternative approach to treating this problem: modelling the

desired lists of integers as the set of integer points of a polytope,

that is to say, the set of solutions with integer coordinates of a

system of linear inequalities. This is a very general context in which

there exist advanced counting algorithms (e.g. Barvinok), which are

implemented in libraries like ``LattE``. Iteration does not pose a hard problem

in principle. However, there are two limitations that justify the

existence of ``IntegerListsLex``. The first is theoretical: lattice

points in a polytope only allow modelling of problems of a fixed

dimension (length). The second is practical: at the moment only the

library ``PALP`` has a ``Sage`` interface, and though it offers multiple

capabilities for the study of polytopes, in the present application it

only produces a list of lattice points, without providing either an

iterator or non-naive counting::

sage: A = random_matrix(ZZ, 6, 3, x=7)

sage: L = LatticePolytope(A.rows())

sage: L.points() # random

M(4, 1, 0),

M(0, 3, 5),

M(2, 2, 3),

M(6, 1, 3),

M(1, 3, 6),

M(6, 2, 3),

M(3, 2, 4),

M(3, 2, 3),

M(4, 2, 4),

M(4, 2, 3),

M(5, 2, 3)

in 3-d lattice M

sage: L.npoints() # random

This polytope can be visualized in 3D with ``L.plot3d()`` (see

:ref:`figure-polytope`).

.. _figure-polytope:

.. figure:: ../../media/polytope.png

:scale: 75%

Figure: The polytope `L` and its integer points, in cross-eyed stereographic perspective.

.. _section-generic-species:

Species, decomposable combinatorial classes

~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

In :ref:`section-examples-catalan`, we showed how to use the recursive

definition of binary trees to count them efficiently using generating

functions. The techniques we used there are very general, and apply

whenever the sets involved can be defined recursively (depending on

who you ask, such a set is called a *decomposable combinatorial class*

or, roughly speaking, a *combinatorial species*). This includes all

the types of trees, but also permutations, compositions, functional

graphs, etc.

Here, we illustrate just a few examples using the ``Sage`` library on

combinatorial species::

sage: from sage.combinat.species.library import *

sage: o = var('o')

We begin by redefining the complete binary trees; to do so, we stipulate

the recurrence relation directly on the sets::

sage: BT = CombinatorialSpecies()

sage: Leaf = SingletonSpecies()

sage: BT.define( Leaf + (BT*BT) )

Now we can construct the set of trees with five nodes, list them, count

them...::

sage: BT5 = BT.isotypes([o]*5)

sage: BT5.cardinality()

sage: BT5.list()

[o*(o*(o*(o*o))), o*(o*((o*o)*o)), o*((o*o)*(o*o)),

o*((o*(o*o))*o), o*(((o*o)*o)*o), (o*o)*(o*(o*o)),

(o*o)*((o*o)*o), (o*(o*o))*(o*o), ((o*o)*o)*(o*o),

(o*(o*(o*o)))*o, (o*((o*o)*o))*o, ((o*o)*(o*o))*o,

((o*(o*o))*o)*o, (((o*o)*o)*o)*o]

The trees are constructed using a generic recursive structure; the

display is therefore not wonderful. To do better, it would be necessary

to provide ``Sage`` with a more specialized data structure with the

desired display capabilities.

We recover the generating function for the Catalan numbers::

sage: g = BT.isotype_generating_series(); g

x + x^2 + 2*x^3 + 5*x^4 + 14*x^5 + O(x^6)

which is returned in the form of a lazy power series::

sage: g[100]

227508830794229349661819540395688853956041682601541047340

We finish with the Fibonacci words, which are binary words without two

consecutive “`1`”s. They admit a natural recursive definition::

sage: Eps = EmptySetSpecies()

sage: Z0 = SingletonSpecies()

sage: Z1 = Eps*SingletonSpecies()

sage: FW = CombinatorialSpecies()

sage: FW.define(Eps + Z0*FW + Z1*Eps + Z1*Z0*FW)

The Fibonacci sequence is easily recognized here, hence the name::

sage: L = FW.isotype_generating_series().coefficients(15); L

[1, 2, 3, 5, 8, 13, 21, 34, 55, 89, 144, 233, 377, 610, 987]

sage: oeis(L) # optional -- internet

0: A000045: Fibonacci numbers: F(n) = F(n-1) + F(n-2) with F(0) = 0 and F(1) = 1.

1: A212804: Expansion of (1-x)/(1-x-x^2).

2: A132636: Fib(n) mod n^3.

This is an immediate consequence of the recurrence relation. One can

also generate immediately all the Fibonacci words of a given length,

with the same limitations resulting from the generic display.

sage: FW3 = FW.isotypes([o]*3)

sage: FW3.list()

[o*(o*(o*{})), o*(o*(({}*o)*{})), o*((({}*o)*o)*{}),

(({}*o)*o)*(o*{}), (({}*o)*o)*(({}*o)*{})]

.. _section-generic-isomorphism:

Graphs up to isomorphism

~~~~~~~~~~~~~~~~~~~~~~~~

We saw in :ref:`section-bricks-divers` that ``Sage`` could generate

graphs and partial orders up to isomorphism. We will now describe the

underlying algorithm, which is the same in both cases, and covers a

substantially wider class of problems.

We begin by recalling some notions. A graph `G=(V,E)` is a set

`V` of vertices and a set `E` of edges connecting these

vertices; an edge is described by a pair `\{u,v\}` of distinct

vertices of `V`. Such a graph is called labelled; its vertices

are typically numbered by considering `V=\{1,2,3,4,5\}`.

In many problems, the labels on the vertices play no role. Typically a

chemist wants to study all the possible molecules with a given

composition, for example the alkanes with `n=8` atoms of carbon

and `2n+2=18` atoms of hydrogen. He therefore wants to find all

the graphs consisting of `8` vertices with `4` neighbours, and

`18` vertices with a single neighbour. The different carbon atoms,

however, are all considered to be identical, and the same for

the hydrogen atoms. The problem of our chemist is not imaginary; this

type of application is actually at the origin of an important part of

the research in graph theory on isomorphism problems.

Working by hand on a small graph it is possible, as in the example of

:ref:`section-bricks-divers`, to make a drawing, erase the labels, and

“forget” the geometrical information about the location of the

vertices in the plane. However, to represent a graph in a computer

program, it is necessary to introduce labels on the vertices so as to

be able to describe how the edges connect them together. To compensate

for the extra information which we have introduced, we then say that

two labelled graphs `g_1` and `g_2` are *isomorphic* if there is a

bijection from the vertices of `g_1` to those of `g_2`, which maps

bijectively the edges of `g_1` to those of `g_2`; an *unlabelled

graph* is then an equivalence class of labelled graphs.

In general, testing if two labelled graphs are isomorphic is expensive.

However, the number of graphs, even unlabelled, grows very

rapidly. Nonetheless, it is possible to list unlabelled graphs very efficiently

considering their number. For example, the program ``Nauty`` can list the

`12005168` simple graphs with `10` vertices in

`20` seconds.

As in :ref:`section-generic-integerlistlex`, the general principle

of the algorithm is to organize the objects to be enumerated into a tree

that one traverses.

For this, in each equivalence class of labelled graphs (that is to say,

for each unlabelled graph) one fixes a convenient canonical

representative. The following are the fundamental operations:

* Testing whether a labelled graph is canonical

* Calculating the canonical representative of a labelled graph

These unavoidable operations remain expensive; one therefore tries to

minimize the number of calls to them.

The canonical representatives are chosen in such a way that, for each

canonical labelled graph `G`, there is a canonical choice of an edge

whose removal produces a canonical graph again, which is called the

father of `G`. This property implies that it is possible to organize

the set of canonical representatives as a tree: at the root, the graph

with no edges; below it, its unique child, the graph with one edge;

then the graphs with two edges, and so on. The set of children of a

graph `G` can be constructed by *augmentation*, adding an edge in all

the possible ways to `G`, and then selecting, from among those graphs,

the ones that are still canonical [3]_. Recursively, one obtains all

the canonical graphs.

.. figure:: ../../media/prefix-tree-graphs-4.png

Figure: The generation tree of simple graphs with `4` vertices.

In what sense is this algorithm generic? Consider for example planar

graphs (graphs which can be drawn in the plane without edges crossing):

by removing an edge from a planar graph, one obtains another planar

graph; so planar graphs form a subtree of the previous tree. To generate

them, exactly the same algorithm can be used,

selecting only the children which are planar::

sage: [len(list(graphs(n, property = lambda G: G.is_planar())))

....: for n in range(7)]

[1, 1, 2, 4, 11, 33, 142]

In a similar fashion, one can generate any family of graphs closed

under deletion of an edge, and in particular any family characterized

by a forbidden subgraph. This includes for example forests (graphs

without cycles), bipartite graphs (graphs without odd cycles),

etc. This can be applied to generate:

- partial orders, via the bijection with Hasse diagrams which are

oriented graphs without cycles and without edges implied by the

transitivity of the order relation;

- lattices (not implemented in ``Sage``), via the bijection with the

meet semi-lattice obtained by deleting the maximal vertex; in this

case an augmentation by vertices rather than by edges is used.

REFERENCES:

.. [CMS2012] Alexandre Casamayou, Nathann Cohen, Guillaume Connan, Thierry Dumont, Laurent Fousse, François Maltey, Matthias Meulien, Marc Mezzarobba, Clément Pernet, Nicolas M. Thiéry, Paul Zimmermann

*Calcul Mathématique avec Sage*

http://sagebook.gforge.inria.fr/

.. [1]

Or at least that should be the case; there are still many corners to

clean up.

.. [2]

Technical detail: ``range`` returns an iterator on

`\{0,\dots,8\}` while ``range`` returns the corresponding

list. Starting in ``Python`` 3.0, ``range`` will behave like ``range``, and

``range`` will no longer be needed.

.. [3]

In practice, an efficient implementation would exploit the symmetries

of `G`, i.e., its automorphism group, to reduce the number of

children to explore, and to reduce the cost of each test of

canonicity.

"""

Coverage for local/lib/python2.7/site-packages/sage/combinat/tutorial.py : 0%

1 statements 0 run 1 missing 0 excluded