WebSVN – nexmon – Blame – Rev 1 – /buildtools/isl-0.10/doc/implementation.tex

lexmax { [j1,j2] -> [i1,i2,i3,i4,i5,i6,i7,i8,i9,i10] : 1 <= i1,j1 <= 8 and 1 <= i2,i3,i4,i5,i6,i7,i8,i9,i10 <= 2 and 1 <= j2 <= 128 and i1-1 = j1-1 and i2-1+2*i3-2+4*i4-4+8*i5-8+16*i6-16+32*i7-32+64*i8-64+128*i9-128+256*i10-256=3*j2-3+66 };

702

\end{lstlisting}

703

This problem was the main inspiration

704

for some of the optimizations in \autoref{s:GBR}.

705

The second group of test cases are projections used during counting.

706

The first nine of these come from \shortciteN{Seghir2006minimizing}.

707

The remaining two come from \shortciteN{Verdoolaege2005experiences} and

708

were used to drive the first, Gomory cuts based, implementation

709

in {\tt isl}.

710

The third and final group of test cases are borrowed from

711

\shortciteN{Bygde2010licentiate} and inspired the offline symmetry detection

712

of \autoref{s:offline}. Without symmetry detection, the running times

713

are 11s and 5.9s.

714

All running times of {\tt barvinok} and {\tt isl} include a conversion

715

to disjunctive normal form. Without this conversion, the final two

716

cases can be solved in 0.07s and 0.21s.

717

The {\tt PipLib} implementation has some fixed limits and will

718

sometimes report the problem to be too complex (TC), while on some other

719

problems it will run out of memory (OOM).

720

The {\tt barvinok} implementation does not support problems

721

with a non-trivial lineality space (line) nor maximization problems (max).

722

The Gomory cuts based {\tt isl} implementation was terminated after 1000

723

minutes on the first problem. The gbr version introduces some

724

overhead on some of the easier problems, but is overall the clear winner.

\begin{table}

\begin{center}

\begin{tabular}{lrrrrr}

729

& {\tt PipLib} & {\tt barvinok} & {\tt isl} cut & {\tt isl} gbr & {\tt PPL} \\

\hline

\hline

% bart.pip

Phideo & TC & 793m & $>$999m & 2.7s & 372m \\

734

\hline

735

e1 & 0.33s & 3.5s & 0.08s & 0.11s & 0.18s \\

736

e3 & 0.14s & 0.13s & 0.10s & 0.10s & 0.17s \\

737

e4 & 0.24s & 9.1s & 0.09s & 0.11s & 0.70s \\

738

e5 & 0.12s & 6.0s & 0.06s & 0.14s & 0.17s \\

739

e6 & 0.10s & 6.8s & 0.17s & 0.08s & 0.21s \\

740

e7 & 0.03s & 0.27s & 0.04s & 0.04s & 0.03s \\

741

e8 & 0.03s & 0.18s & 0.03s & 0.04s & 0.01s \\

742

e9 & OOM & 70m & 2.6s & 0.94s & 22s \\

743

vd & 0.04s & 0.10s & 0.03s & 0.03s & 0.03s \\

744

bouleti & 0.25s & line & 0.06s & 0.06s & 0.15s \\

745

difficult & OOM & 1.3s & 1.7s & 0.33s & 1.4s \\

746

\hline

747

cnt/sum & TC & max & 2.2s & 2.2s & OOM \\

748

jcomplex & TC & max & 3.7s & 3.9s & OOM \\

749

\end{tabular}

750

\caption{Comparison of Execution Times}

751

\label{t:comparison}

\end{center}

\end{table}

\subsection{Online Symmetry Detection}\label{s:online}

756

757

Manual experiments on small instances of the problems of

758

\shortciteN{Bygde2010licentiate} and an analysis of the results

759

by the approximate MPA method developed by \shortciteN{Bygde2010licentiate}

760

have revealed that these problems contain many more symmetries

761

than can be detected using the offline method of \autoref{s:offline}.

762

In this section, we present an online detection mechanism that has

763

not been implemented yet, but that has shown promising results

764

in manual applications.

765

766

Let us first consider what happens when we do not perform offline

767

symmetry detection. At some point, one of the

768

$b_i(\vec p) + \sp {\vec a} {\vec x} \ge 0$ constraints,

769

say the $j$th constraint, appears as a column

770

variable, say $c_1$, while the other constraints are represented

771

as rows of the form $b_i(\vec p) - b_j(\vec p) + c$.

772

The context is then split according to the relative order of

773

$b_j(\vec p)$ and one of the remaining $b_i(\vec p)$.

774

The offline method avoids this split by replacing all $b_i(\vec p)$

775

by a single newly introduced parameter that represents the minimum

776

of these $b_i(\vec p)$.

777

In the online method the split is similarly avoided by the introduction

778

of a new parameter. In particular, a new parameter is introduced

779

that represents

780

$\left| b_j(\vec p) - b_i(\vec p) \right|_+ =

781

\max(b_j(\vec p) - b_i(\vec p), 0)$.

782

783

In general, let $r = b(\vec p) + \sp {\vec a} {\vec c}$ be a row

784

of the tableau such that the sign of $b(\vec p)$ is indeterminate

785

and such that exactly one of the elements of $\vec a$ is a $1$,

786

while all remaining elements are non-positive.

787

That is, $r = b(\vec p) + c_j - f$ with $f = -\sum_{i\ne j} a_i c_i \ge 0$.

788

We introduce a new parameter $t$ with

789

context constraints $t \ge -b(\vec p)$ and $t \ge 0$ and replace

790

the column variable $c_j$ by $c' + t$. The row $r$ is now equal

791

to $b(\vec p) + t + c' - f$. The constant term of this row is always

792

non-negative because any negative value of $b(\vec p)$ is compensated

793

by $t \ge -b(\vec p)$ while and non-negative value remains non-negative

794

because $t \ge 0$.

795

796

We need to show that this transformation does not eliminate any valid

797

solutions and that it does not introduce any spurious solutions.

798

Given a valid solution for the original problem, we need to find

799

a non-negative value of $c'$ satisfying the constraints.

800

If $b(\vec p) \ge 0$, we can take $t = 0$ so that

801

$c' = c_j - t = c_j \ge 0$.

802

If $b(\vec p) < 0$, we can take $t = -b(\vec p)$.

803

Since $r = b(\vec p) + c_j - f \ge 0$ and $f \ge 0$, we have

804

$c' = c_j + b(\vec p) \ge 0$.

805

Note that these choices amount to plugging in

806

$t = \left|-b(\vec p)\right|_+ = \max(-b(\vec p), 0)$.

807

Conversely, given a solution to the new problem, we need to find

808

a non-negative value of $c_j$, but this is easy since $c_j = c' + t$

809

and both of these are non-negative.

810

811

Plugging in $t = \max(-b(\vec p), 0)$ can be performed as in

812

\autoref{s:post}, but, as in the case of offline symmetry detection,

813

it may be better to provide a direct representation for such

814

expressions in the internal representation of sets and relations

815

or at least in a quast-like output format.

816

817

\section{Coalescing}\label{s:coalescing}

818

819

See \shortciteN{Verdoolaege2009isl}, for now.

820

More details will be added later.

821

822

\section{Transitive Closure}

823

824

\subsection{Introduction}

825

826

\begin{definition}[Power of a Relation]

827

Let $R \in \Z^n \to 2^{\Z^{d+d}}$ be a relation and

828

$k \in \Z_{\ge 1}$

829

a positive number, then power $k$ of relation $R$ is defined as

830

\begin{equation}

831

\label{eq:transitive:power}

832

R^k \coloneqq

833

\begin{cases}

834

R & \text{if $k = 1$}

835

\\

836

R \circ R^{k-1} & \text{if $k \ge 2$}

.

\end{cases}

\end{equation}

\end{definition}

\begin{definition}[Transitive Closure of a Relation]

843

Let $R \in \Z^n \to 2^{\Z^{d+d}}$ be a relation,

844

then the transitive closure $R^+$ of $R$ is the union

845

of all positive powers of $R$,

846

$$

847

R^+ \coloneqq \bigcup_{k \ge 1} R^k

.

$$

\end{definition}

Alternatively, the transitive closure may be defined

852

inductively as

853

\begin{equation}

854

\label{eq:transitive:inductive}

855

R^+ \coloneqq R \cup \left(R \circ R^+\right)

.

\end{equation}

Since the transitive closure of a polyhedral relation

860

may no longer be a polyhedral relation \shortcite{Kelly1996closure},

861

we can, in the general case, only compute an approximation

862

of the transitive closure.

863

Whereas \shortciteN{Kelly1996closure} compute underapproximations,

864

we, like \shortciteN{Beletska2009}, compute overapproximations.

865

That is, given a relation $R$, we will compute a relation $T$

866

such that $R^+ \subseteq T$. Of course, we want this approximation

867

to be as close as possible to the actual transitive closure

868

$R^+$ and we want to detect the cases where the approximation is

869

exact, i.e., where $T = R^+$.

870

871

For computing an approximation of the transitive closure of $R$,

872

we follow the same general strategy as \shortciteN{Beletska2009}

873

and first compute an approximation of $R^k$ for $k \ge 1$ and then project

874

out the parameter $k$ from the resulting relation.

875

876

\begin{example}

877

As a trivial example, consider the relation

878

$R = \{\, x \to x + 1 \,\}$. The $k$th power of this map

879

for arbitrary $k$ is

880

$$

881

R^k = k \mapsto \{\, x \to x + k \mid k \ge 1 \,\}

882

.

883

$$

884

The transitive closure is then

885

$$

886

\begin{aligned}

887

R^+ & = \{\, x \to y \mid \exists k \in \Z_{\ge 1} : y = x + k \,\}

888

\\

889

& = \{\, x \to y \mid y \ge x + 1 \,\}

.

\end{aligned}

$$

\end{example}

\subsection{Computing an Approximation of $R^k$}

896

\label{s:power}

897

898

There are some special cases where the computation of $R^k$ is very easy.

899

One such case is that where $R$ does not compose with itself,

900

i.e., $R \circ R = \emptyset$ or $\domain R \cap \range R = \emptyset$.

901

In this case, $R^k$ is only non-empty for $k=1$ where it is equal

902

to $R$ itself.

903

904

In general, it is impossible to construct a closed form

905

of $R^k$ as a polyhedral relation.

906

We will therefore need to make some approximations.

907

As a first approximations, we will consider each of the basic

908

relations in $R$ as simply adding one or more offsets to a domain element

909

to arrive at an image element and ignore the fact that some of these

910

offsets may only be applied to some of the domain elements.

911

That is, we will only consider the difference set $\Delta\,R$ of the relation.

912

In particular, we will first construct a collection $P$ of paths

913

that move through

914

a total of $k$ offsets and then intersect domain and range of this

915

collection with those of $R$.

916

That is,

917

\begin{equation}

918

\label{eq:transitive:approx}

919

K = P \cap \left(\domain R \to \range R\right)

,

\end{equation}

with

\begin{equation}

\label{eq:transitive:path}

925

P = \vec s \mapsto \{\, \vec x \to \vec y \mid

926

\exists k_i \in \Z_{\ge 0}, \vec\delta_i \in k_i \, \Delta_i(\vec s) :

927

\vec y = \vec x + \sum_i \vec\delta_i

\wedge

\sum_i k_i = k > 0

\,\}

\end{equation}

and with $\Delta_i$ the basic sets that compose

933

the difference set $\Delta\,R$.

934

Note that the number of basic sets $\Delta_i$ need not be

935

the same as the number of basic relations in $R$.

936

Also note that since addition is commutative, it does not

937

matter in which order we add the offsets and so we are allowed

938

to group them as we did in \eqref{eq:transitive:path}.

939

940

If all the $\Delta_i$s are singleton sets

941

$\Delta_i = \{\, \vec \delta_i \,\}$ with $\vec \delta_i \in \Z^d$,

942

then \eqref{eq:transitive:path} simplifies to

943

\begin{equation}

944

\label{eq:transitive:singleton}

945

P = \{\, \vec x \to \vec y \mid

946

\exists k_i \in \Z_{\ge 0} :

947

\vec y = \vec x + \sum_i k_i \, \vec \delta_i

\wedge

\sum_i k_i = k > 0

\,\}

\end{equation}

and then the approximation computed in \eqref{eq:transitive:approx}

953

is essentially the same as that of \shortciteN{Beletska2009}.

954

If some of the $\Delta_i$s are not singleton sets or if

955

some of $\vec \delta_i$s are parametric, then we need

956

to resort to further approximations.

957

958

To ease both the exposition and the implementation, we will for

959

the remainder of this section work with extended offsets

960

$\Delta_i' = \Delta_i \times \{\, 1 \,\}$.

961

That is, each offset is extended with an extra coordinate that is

962

set equal to one. The paths constructed by summing such extended

963

offsets have the length encoded as the difference of their

964

final coordinates. The path $P'$ can then be decomposed into

965

paths $P_i'$, one for each $\Delta_i$,

966

\begin{equation}

967

\label{eq:transitive:decompose}

968

P' = \left(

969

(P_m' \cup \identity) \circ \cdots \circ

970

(P_2' \cup \identity) \circ

971

(P_1' \cup \identity)

972

\right) \cap

973

\{\,

974

\vec x' \to \vec y' \mid y_{d+1} - x_{d+1} = k > 0

\,\}

,

\end{equation}

with

$$

P_i' = \vec s \mapsto \{\, \vec x' \to \vec y' \mid

981

\exists k \in \Z_{\ge 1}, \vec \delta \in k \, \Delta_i'(\vec s) :

982

\vec y' = \vec x' + \vec \delta

\,\}

.

$$

Note that each $P_i'$ contains paths of length at least one.

987

We therefore need to take the union with the identity relation

988

when composing the $P_i'$s to allow for paths that do not contain

989

any offsets from one or more $\Delta_i'$.

990

The path that consists of only identity relations is removed

991

by imposing the constraint $y_{d+1} - x_{d+1} > 0$.

992

Taking the union with the identity relation means that

993

that the relations we compose in \eqref{eq:transitive:decompose}

994

each consist of two basic relations. If there are $m$

995

disjuncts in the input relation, then a direct application

996

of the composition operation may therefore result in a relation

997

with $2^m$ disjuncts, which is prohibitively expensive.

998

It is therefore crucial to apply coalescing (\autoref{s:coalescing})

999

after each composition.

1000

1001

Let us now consider how to compute an overapproximation of $P_i'$.

1002

Those that correspond to singleton $\Delta_i$s are grouped together

1003

and handled as in \eqref{eq:transitive:singleton}.

1004

Note that this is just an optimization. The procedure described

1005

below would produce results that are at least as accurate.

1006

For simplicity, we first assume that no constraint in $\Delta_i'$

1007

involves any existentially quantified variables.

1008

We will return to existentially quantified variables at the end

1009

of this section.

1010

Without existentially quantified variables, we can classify

1011

the constraints of $\Delta_i'$ as follows

1012

\begin{enumerate}

1013

\item non-parametric constraints

1014

\begin{equation}

1015

\label{eq:transitive:non-parametric}

1016

A_1 \vec x + \vec c_1 \geq \vec 0

1017

\end{equation}

1018

\item purely parametric constraints

1019

\begin{equation}

1020

\label{eq:transitive:parametric}

1021

B_2 \vec s + \vec c_2 \geq \vec 0

1022

\end{equation}

1023

\item negative mixed constraints

1024

\begin{equation}

1025

\label{eq:transitive:mixed}

1026

A_3 \vec x + B_3 \vec s + \vec c_3 \geq \vec 0

1027

\end{equation}

1028

such that for each row $j$ and for all $\vec s$,

1029

$$

1030

\Delta_i'(\vec s) \cap

1031

\{\, \vec \delta' \mid B_{3,j} \vec s + c_{3,j} > 0 \,\}

1032

= \emptyset

1033

$$

1034

\item positive mixed constraints

1035

$$

1036

A_4 \vec x + B_4 \vec s + \vec c_4 \geq \vec 0

1037

$$

1038

such that for each row $j$, there is at least one $\vec s$ such that

1039

$$

1040

\Delta_i'(\vec s) \cap

1041

\{\, \vec \delta' \mid B_{4,j} \vec s + c_{4,j} > 0 \,\}

\ne \emptyset

$$

\end{enumerate}

We will use the following approximation $Q_i$ for $P_i'$:

1046

\begin{equation}

1047

\label{eq:transitive:Q}

1048

\begin{aligned}

1049

Q_i = \vec s \mapsto

1050

\{\,

1051

\vec x' \to \vec y'

1052

\mid {} & \exists k \in \Z_{\ge 1}, \vec f \in \Z^d :

1053

\vec y' = \vec x' + (\vec f, k)

\wedge {}

\\

&

A_1 \vec f + k \vec c_1 \geq \vec 0

1058

\wedge

1059

B_2 \vec s + \vec c_2 \geq \vec 0

1060

\wedge

1061

A_3 \vec f + B_3 \vec s + \vec c_3 \geq \vec 0

\,\}

.

\end{aligned}

\end{equation}

To prove that $Q_i$ is indeed an overapproximation of $P_i'$,

1067

we need to show that for every $\vec s \in \Z^n$, for every

1068

$k \in \Z_{\ge 1}$ and for every $\vec f \in k \, \Delta_i(\vec s)$

1069

we have that

1070

$(\vec f, k)$ satisfies the constraints in \eqref{eq:transitive:Q}.

1071

If $\Delta_i(\vec s)$ is non-empty, then $\vec s$ must satisfy

1072

the constraints in \eqref{eq:transitive:parametric}.

1073

Each element $(\vec f, k) \in k \, \Delta_i'(\vec s)$ is a sum

1074

of $k$ elements $(\vec f_j, 1)$ in $\Delta_i'(\vec s)$.

1075

Each of these elements satisfies the constraints in

1076

\eqref{eq:transitive:non-parametric}, i.e.,

$$

\left[

\begin{matrix}

A_1 & \vec c_1

\end{matrix}

\right]

\left[

\begin{matrix}

\vec f_j \\ 1

\end{matrix}

\right]

\ge \vec 0

.

$$

The sum of these elements therefore satisfies the same set of inequalities,

1092

i.e., $A_1 \vec f + k \vec c_1 \geq \vec 0$.

1093

Finally, the constraints in \eqref{eq:transitive:mixed} are such

1094

that for any $\vec s$ in the parameter domain of $\Delta$,

1095

we have $-\vec r(\vec s) \coloneqq B_3 \vec s + \vec c_3 \le \vec 0$,

1096

i.e., $A_3 \vec f_j \ge \vec r(\vec s) \ge \vec 0$

1097

and therefore also $A_3 \vec f \ge \vec r(\vec s)$.

1098

Note that if there are no mixed constraints and if the

1099

rational relaxation of $\Delta_i(\vec s)$, i.e.,

1100

$\{\, \vec x \in \Q^d \mid A_1 \vec x + \vec c_1 \ge \vec 0\,\}$,

1101

has integer vertices, then the approximation is exact, i.e.,

1102

$Q_i = P_i'$. In this case, the vertices of $\Delta'_i(\vec s)$

1103

generate the rational cone

1104

$\{\, \vec x' \in \Q^{d+1} \mid \left[

\begin{matrix}

A_1 & \vec c_1

\end{matrix}

\right] \vec x' \,\}$ and therefore $\Delta'_i(\vec s)$ is

1109

a Hilbert basis of this cone \shortcite[Theorem~16.4]{Schrijver1986}.

1110

1111

Note however that, as pointed out by \shortciteN{DeSmet2010personal},

1112

if there \emph{are} any mixed constraints, then the above procedure may

1113

not compute the most accurate affine approximation of

1114

$k \, \Delta_i(\vec s)$ with $k \ge 1$.

1115

In particular, we only consider the negative mixed constraints that

1116

happen to appear in the description of $\Delta_i(\vec s)$, while we

1117

should instead consider \emph{all} valid such constraints.

1118

It is also sufficient to consider those constraints because any

1119

constraint that is valid for $k \, \Delta_i(\vec s)$ is also

1120

valid for $1 \, \Delta_i(\vec s) = \Delta_i(\vec s)$.

1121

Take therefore any constraint

1122

$\spv a x + \spv b s + c \ge 0$ valid for $\Delta_i(\vec s)$.

1123

This constraint is also valid for $k \, \Delta_i(\vec s)$ iff

1124

$k \, \spv a x + \spv b s + c \ge 0$.

1125

If $\spv b s + c$ can attain any positive value, then $\spv a x$

1126

may be negative for some elements of $\Delta_i(\vec s)$.

1127

We then have $k \, \spv a x < \spv a x$ for $k > 1$ and so the constraint

1128

is not valid for $k \, \Delta_i(\vec s)$.

1129

We therefore need to impose $\spv b s + c \le 0$ for all values

1130

of $\vec s$ such that $\Delta_i(\vec s)$ is non-empty, i.e.,

1131

$\vec b$ and $c$ need to be such that $- \spv b s - c \ge 0$ is a valid

1132

constraint of $\Delta_i(\vec s)$. That is, $(\vec b, c)$ are the opposites

1133

of the coefficients of a valid constraint of $\Delta_i(\vec s)$.

1134

The approximation of $k \, \Delta_i(\vec s)$ can therefore be obtained

1135

using three applications of Farkas' lemma. The first obtains the coefficients

1136

of constraints valid for $\Delta_i(\vec s)$. The second obtains

1137

the coefficients of constraints valid for the projection of $\Delta_i(\vec s)$

1138

onto the parameters. The opposite of the second set is then computed

1139

and intersected with the first set. The result is the set of coefficients

1140

of constraints valid for $k \, \Delta_i(\vec s)$. A final application

1141

of Farkas' lemma is needed to obtain the approximation of

1142

$k \, \Delta_i(\vec s)$ itself.

1143

1144

\begin{example}

1145

Consider the relation

1146

$$

1147

n \to \{\, (x, y) \to (1 + x, 1 - n + y) \mid n \ge 2 \,\}

1148

.

1149

$$

1150

The difference set of this relation is

1151

$$

1152

\Delta = n \to \{\, (1, 1 - n) \mid n \ge 2 \,\}

1153

.

1154

$$

1155

Using our approach, we would only consider the mixed constraint

1156

$y - 1 + n \ge 0$, leading to the following approximation of the

1157

transitive closure:

1158

$$

1159

n \to \{\, (x, y) \to (o_0, o_1) \mid n \ge 2 \wedge o_1 \le 1 - n + y \wedge o_0 \ge 1 + x \,\}

1160

.

1161

$$

1162

If, instead, we apply Farkas's lemma to $\Delta$, i.e.,

1163

\begin{verbatim}

1164

D := [n] -> { [1, 1 - n] : n >= 2 };

1165

CD := coefficients D;

CD;

\end{verbatim}

we obtain

\begin{verbatim}

{ rat: coefficients[[c_cst, c_n] -> [i2, i3]] : i3 <= c_n and

1171

i3 <= c_cst + 2c_n + i2 }

1172

\end{verbatim}

1173

The pure-parametric constraints valid for $\Delta$,

1174

\begin{verbatim}

1175

P := { [a,b] -> [] }(D);

1176

CP := coefficients P;

CP;

\end{verbatim}

are

\begin{verbatim}

{ rat: coefficients[[c_cst, c_n] -> []] : c_n >= 0 and 2c_n >= -c_cst }

1182

\end{verbatim}

1183

Negating these coefficients and intersecting with \verb+CD+,

1184

\begin{verbatim}

1185

NCP := { rat: coefficients[[a,b] -> []]

1186

-> coefficients[[-a,-b] -> []] }(CP);

1187

CK := wrap((unwrap CD) * (dom (unwrap NCP)));

CK;

\end{verbatim}

we obtain

\begin{verbatim}

{ rat: [[c_cst, c_n] -> [i2, i3]] : i3 <= c_n and

1193

i3 <= c_cst + 2c_n + i2 and c_n <= 0 and 2c_n <= -c_cst }

1194

\end{verbatim}

1195

The approximation for $k\,\Delta$,

\begin{verbatim}

K := solutions CK;

K;

\end{verbatim}

is then

\begin{verbatim}

[n] -> { rat: [i0, i1] : i1 <= -i0 and i0 >= 1 and i1 <= 2 - n - i0 }

1203

\end{verbatim}

1204

Finally, the computed approximation for $R^+$,

1205

\begin{verbatim}

1206

T := unwrap({ [dx,dy] -> [[x,y] -> [x+dx,y+dy]] }(K));

1207

R := [n] -> { [x,y] -> [x+1,y+1-n] : n >= 2 };

1208

T := T * ((dom R) -> (ran R));

T;

\end{verbatim}

is

\begin{verbatim}

[n] -> { [x, y] -> [o0, o1] : o1 <= x + y - o0 and

1214

o0 >= 1 + x and o1 <= 2 - n + x + y - o0 and n >= 2 }

\end{verbatim}

\end{example}

Existentially quantified variables can be handled by

1219

classifying them into variables that are uniquely

1220

determined by the parameters, variables that are independent

1221

of the parameters and others. The first set can be treated

1222

as parameters and the second as variables. Constraints involving

1223

the other existentially quantified variables are removed.

1224

1225

\begin{example}

1226

Consider the relation

1227

$$

1228

R =

1229

n \to \{\, x \to y \mid \exists \, \alpha_0, \alpha_1: 7\alpha_0 = -2 + n \wedge 5\alpha_1 = -1 - x + y \wedge y \ge 6 + x \,\}

1230

.

1231

$$

1232

The difference set of this relation is

1233

$$

1234

\Delta = \Delta \, R =

1235

n \to \{\, x \mid \exists \, \alpha_0, \alpha_1: 7\alpha_0 = -2 + n \wedge 5\alpha_1 = -1 + x \wedge x \ge 6 \,\}

1236

.

1237

$$

1238

The existentially quantified variables can be defined in terms

1239

of the parameters and variables as

1240

$$

1241

\alpha_0 = \floor{\frac{-2 + n}7}

\qquad

\text{and}

\qquad

\alpha_1 = \floor{\frac{-1 + x}5}

1246

.

1247

$$

1248

$\alpha_0$ can therefore be treated as a parameter,

1249

while $\alpha_1$ can be treated as a variable.

1250

This in turn means that $7\alpha_0 = -2 + n$ can be treated as

1251

a purely parametric constraint, while the other two constraints are

1252

non-parametric.

1253

The corresponding $Q$~\eqref{eq:transitive:Q} is therefore

1254

$$

1255

\begin{aligned}

1256

n \to \{\, (x,z) \to (y,w) \mid

1257

\exists\, \alpha_0, \alpha_1, k, f : {} &

1258

k \ge 1 \wedge

1259

y = x + f \wedge

1260

w = z + k \wedge {} \\

1261

&

1262

7\alpha_0 = -2 + n \wedge

1263

5\alpha_1 = -k + x \wedge

x \ge 6 k

\,\}

.

\end{aligned}

$$

Projecting out the final coordinates encoding the length of the paths,

1270

results in the exact transitive closure

1271

$$

1272

R^+ =

1273

n \to \{\, x \to y \mid \exists \, \alpha_0, \alpha_1: 7\alpha_1 = -2 + n \wedge 6\alpha_0 \ge -x + y \wedge 5\alpha_0 \le -1 - x + y \,\}

.

$$

\end{example}

The fact that we ignore some impure constraints clearly leads

1279

to a loss of accuracy. In some cases, some of this loss can be recovered

1280

by not considering the parameters in a special way.

1281

That is, instead of considering the set

$$

\Delta = \diff R =

\vec s \mapsto

\{\, \vec \delta \in \Z^{d} \mid \exists \vec x \to \vec y \in R :

1286

\vec \delta = \vec y - \vec x

1287

\,\}

1288

$$

1289

we consider the set

1290

$$

1291

\Delta' = \diff R' =

1292

\{\, \vec \delta \in \Z^{n+d} \mid \exists

1293

(\vec s, \vec x) \to (\vec s, \vec y) \in R' :

1294

\vec \delta = (\vec s - \vec s, \vec y - \vec x)

\,\}

.

$$

The first $n$ coordinates of every element in $\Delta'$ are zero.

1299

Projecting out these zero coordinates from $\Delta'$ is equivalent

1300

to projecting out the parameters in $\Delta$.

1301

The result is obviously a superset of $\Delta$, but all its constraints

1302

are of type \eqref{eq:transitive:non-parametric} and they can therefore

1303

all be used in the construction of $Q_i$.

1304

1305

\begin{example}

1306

Consider the relation

1307

$$

1308

% [n] -> { [x, y] -> [1 + x, 1 - n + y] | n >= 2 }

1309

R = n \to \{\, (x, y) \to (1 + x, 1 - n + y) \mid n \ge 2 \,\}

.

$$

We have

$$

\diff R = n \to \{\, (1, 1 - n) \mid n \ge 2 \,\}

1315

$$

1316

and so, by treating the parameters in a special way, we obtain

1317

the following approximation for $R^+$:

1318

$$

1319

n \to \{\, (x, y) \to (x', y') \mid n \ge 2 \wedge y' \le 1 - n + y \wedge x' \ge 1 + x \,\}

1320

.

1321

$$

1322

If we consider instead

1323

$$

1324

R' = \{\, (n, x, y) \to (n, 1 + x, 1 - n + y) \mid n \ge 2 \,\}

$$

then

$$

\diff R' = \{\, (0, 1, y) \mid y \le -1 \,\}

1329

$$

1330

and we obtain the approximation

1331

$$

1332

n \to \{\, (x, y) \to (x', y') \mid n \ge 2 \wedge x' \ge 1 + x \wedge y' \le x + y - x' \,\}

1333

.

1334

$$

1335

If we consider both $\diff R$ and $\diff R'$, then we obtain

1336

$$

1337

n \to \{\, (x, y) \to (x', y') \mid n \ge 2 \wedge y' \le 1 - n + y \wedge x' \ge 1 + x \wedge y' \le x + y - x' \,\}

1338

.

1339

$$

1340

Note, however, that this is not the most accurate affine approximation that

1341

can be obtained. That would be

1342

$$

1343

n \to \{\, (x, y) \to (x', y') \mid y' \le 2 - n + x + y - x' \wedge n \ge 2 \wedge x' \ge 1 + x \,\}

.

$$

\end{example}

\subsection{Checking Exactness}

1349

1350

The approximation $T$ for the transitive closure $R^+$ can be obtained

1351

by projecting out the parameter $k$ from the approximation $K$

1352

\eqref{eq:transitive:approx} of the power $R^k$.

1353

Since $K$ is an overapproximation of $R^k$, $T$ will also be an

1354

overapproximation of $R^+$.

1355

To check whether the results are exact, we need to consider two

1356

cases depending on whether $R$ is {\em cyclic}, where $R$ is defined

1357

to be cyclic if $R^+$ maps any element to itself, i.e.,

1358

$R^+ \cap \identity \ne \emptyset$.

1359

If $R$ is acyclic, then the inductive definition of

1360

\eqref{eq:transitive:inductive} is equivalent to its completion,

1361

i.e.,

1362

$$

1363

R^+ = R \cup \left(R \circ R^+\right)

1364

$$

1365

is a defining property.

1366

Since $T$ is known to be an overapproximation, we only need to check

1367

whether

1368

$$

1369

T \subseteq R \cup \left(R \circ T\right)

1370

.

1371

$$

1372

This is essentially Theorem~5 of \shortciteN{Kelly1996closure}.

1373

The only difference is that they only consider lexicographically

1374

forward relations, a special case of acyclic relations.

1375

1376

If, on the other hand, $R$ is cyclic, then we have to resort

1377

to checking whether the approximation $K$ of the power is exact.

1378

Note that $T$ may be exact even if $K$ is not exact, so the check

1379

is sound, but incomplete.

1380

To check exactness of the power, we simply need to check

1381

\eqref{eq:transitive:power}. Since again $K$ is known

1382

to be an overapproximation, we only need to check whether

1383

$$

1384

\begin{aligned}

1385

K'|_{y_{d+1} - x_{d+1} = 1} & \subseteq R'

1386

\\

1387

K'|_{y_{d+1} - x_{d+1} \ge 2} & \subseteq R' \circ K'|_{y_{d+1} - x_{d+1} \ge 1}

,

\end{aligned}

$$

where $R' = \{\, \vec x' \to \vec y' \mid \vec x \to \vec y \in R

1392

\wedge y_{d+1} - x_{d+1} = 1\,\}$, i.e., $R$ extended with path

1393

lengths equal to 1.

1394

1395

All that remains is to explain how to check the cyclicity of $R$.

1396

Note that the exactness on the power is always sound, even

1397

in the acyclic case, so we only need to be careful that we find

1398

all cyclic cases. Now, if $R$ is cyclic, i.e.,

1399

$R^+ \cap \identity \ne \emptyset$, then, since $T$ is

1400

an overapproximation of $R^+$, also

1401

$T \cap \identity \ne \emptyset$. This in turn means

1402

that $\Delta \, K'$ contains a point whose first $d$ coordinates

1403

are zero and whose final coordinate is positive.

1404

In the implementation we currently perform this test on $P'$ instead of $K'$.

1405

Note that if $R^+$ is acyclic and $T$ is not, then the approximation

1406

is clearly not exact and the approximation of the power $K$

1407

will not be exact either.

1408

1409

\subsection{Decomposing $R$ into strongly connected components}

1410

1411

If the input relation $R$ is a union of several basic relations

1412

that can be partially ordered

1413

then the accuracy of the approximation may be improved by computing

1414

an approximation of each strongly connected components separately.

1415

For example, if $R = R_1 \cup R_2$ and $R_1 \circ R_2 = \emptyset$,

1416

then we know that any path that passes through $R_2$ cannot later

1417

pass through $R_1$, i.e.,

1418

\begin{equation}

1419

\label{eq:transitive:components}

1420

R^+ = R_1^+ \cup R_2^+ \cup \left(R_2^+ \circ R_1^+\right)

1421

.

1422

\end{equation}

1423

We can therefore compute (approximations of) transitive closures

1424

of $R_1$ and $R_2$ separately.

1425

Note, however, that the condition $R_1 \circ R_2 = \emptyset$

1426

is actually too strong.

1427

If $R_1 \circ R_2$ is a subset of $R_2 \circ R_1$

1428

then we can reorder the segments

1429

in any path that moves through both $R_1$ and $R_2$ to

1430

first move through $R_1$ and then through $R_2$.

1431

1432

This idea can be generalized to relations that are unions

1433

of more than two basic relations by constructing the

1434

strongly connected components in the graph with as vertices

1435

the basic relations and an edge between two basic relations

1436

$R_i$ and $R_j$ if $R_i$ needs to follow $R_j$ in some paths.

1437

That is, there is an edge from $R_i$ to $R_j$ iff

1438

\begin{equation}

1439

\label{eq:transitive:edge}

R_i \circ R_j

\not\subseteq

R_j \circ R_i

.

\end{equation}

The components can be obtained from the graph by applying

1446

Tarjan's algorithm \shortcite{Tarjan1972}.

1447

1448

In practice, we compute the (extended) powers $K_i'$ of each component

1449

separately and then compose them as in \eqref{eq:transitive:decompose}.

1450

Note, however, that in this case the order in which we apply them is

1451

important and should correspond to a topological ordering of the

1452

strongly connected components. Simply applying Tarjan's

1453

algorithm will produce topologically sorted strongly connected components.

1454

The graph on which Tarjan's algorithm is applied is constructed on-the-fly.

1455

That is, whenever the algorithm checks if there is an edge between

1456

two vertices, we evaluate \eqref{eq:transitive:edge}.

1457

The exactness check is performed on each component separately.

1458

If the approximation turns out to be inexact for any of the components,

1459

then the entire result is marked inexact and the exactness check

1460

is skipped on the components that still need to be handled.

1461

1462

It should be noted that \eqref{eq:transitive:components}

1463

is only valid for exact transitive closures.

1464

If overapproximations are computed in the right hand side, then the result will

1465

still be an overapproximation of the left hand side, but this result

1466

may not be transitively closed. If we only separate components based

1467

on the condition $R_i \circ R_j = \emptyset$, then there is no problem,

1468

as this condition will still hold on the computed approximations

1469

of the transitive closures. If, however, we have exploited

1470

\eqref{eq:transitive:edge} during the decomposition and if the

1471

result turns out not to be exact, then we check whether

1472

the result is transitively closed. If not, we recompute

1473

the transitive closure, skipping the decomposition.

1474

Note that testing for transitive closedness on the result may

1475

be fairly expensive, so we may want to make this check

configurable.

\begin{figure}

\begin{center}

\begin{tikzpicture}[x=0.5cm,y=0.5cm,>=stealth,shorten >=1pt]

1481

\foreach \x in {1,...,10}{

1482

\foreach \y in {1,...,10}{

1483

\draw[->] (\x,\y) -- (\x,\y+1);

1484

}

1485

}

1486

\foreach \x in {1,...,20}{

1487

\foreach \y in {5,...,15}{

1488

\draw[->] (\x,\y) -- (\x+1,\y);

}

}

\end{tikzpicture}

\end{center}

\caption{The relation from \autoref{ex:closure4}}

\label{f:closure4}

\end{figure}

\begin{example}

\label{ex:closure4}

1498

Consider the relation in example {\tt closure4} that comes with

1499

the Omega calculator~\shortcite{Omega_calc}, $R = R_1 \cup R_2$,

with

$$

\begin{aligned}

R_1 & = \{\, (x,y) \to (x,y+1) \mid 1 \le x,y \le 10 \,\}

1504

\\

1505

R_2 & = \{\, (x,y) \to (x+1,y) \mid 1 \le x \le 20 \wedge 5 \le y \le 15 \,\}

.

\end{aligned}

$$

This relation is shown graphically in \autoref{f:closure4}.

We have

$$

\begin{aligned}

R_1 \circ R_2 &=

\{\, (x,y) \to (x+1,y+1) \mid 1 \le x \le 9 \wedge 5 \le y \le 10 \,\}

1515

\\

1516

R_2 \circ R_1 &=

1517

\{\, (x,y) \to (x+1,y+1) \mid 1 \le x \le 10 \wedge 4 \le y \le 10 \,\}

.

\end{aligned}

$$

Clearly, $R_1 \circ R_2 \subseteq R_2 \circ R_1$ and so

$$

\left(

R_1 \cup R_2

\right)^+

=

\left(R_2^+ \circ R_1^+\right)

\cup R_1^+

\cup R_2^+

.

$$

\end{example}

\begin{figure}

\newcounter{n}

\newcounter{t1}

\newcounter{t2}

\newcounter{t3}

\newcounter{t4}

\begin{center}

\begin{tikzpicture}[>=stealth,shorten >=1pt]

1542

\setcounter{n}{7}

1543

\foreach \i in {1,...,\value{n}}{

1544

\foreach \j in {1,...,\value{n}}{

1545

\setcounter{t1}{2 * \j - 4 - \i + 1}

1546

\setcounter{t2}{\value{n} - 3 - \i + 1}

1547

\setcounter{t3}{2 * \i - 1 - \j + 1}

1548

\setcounter{t4}{\value{n} - \j + 1}

1549

\ifnum\value{t1}>0\ifnum\value{t2}>0

1550

\ifnum\value{t3}>0\ifnum\value{t4}>0

1551

\draw[thick,->] (\i,\j) to[out=20] (\i+3,\j);

1552

\fi\fi\fi\fi

1553

\setcounter{t1}{2 * \j - 1 - \i + 1}

1554

\setcounter{t2}{\value{n} - \i + 1}

1555

\setcounter{t3}{2 * \i - 4 - \j + 1}

1556

\setcounter{t4}{\value{n} - 3 - \j + 1}

1557

\ifnum\value{t1}>0\ifnum\value{t2}>0

1558

\ifnum\value{t3}>0\ifnum\value{t4}>0

1559

\draw[thick,->] (\i,\j) to[in=-20,out=20] (\i,\j+3);

1560

\fi\fi\fi\fi

1561

\setcounter{t1}{2 * \j - 1 - \i + 1}

1562

\setcounter{t2}{\value{n} - 1 - \i + 1}

1563

\setcounter{t3}{2 * \i - 1 - \j + 1}

1564

\setcounter{t4}{\value{n} - 1 - \j + 1}

1565

\ifnum\value{t1}>0\ifnum\value{t2}>0

1566

\ifnum\value{t3}>0\ifnum\value{t4}>0

1567

\draw[thick,->] (\i,\j) to (\i+1,\j+1);

\fi\fi\fi\fi

}

}

\end{tikzpicture}

\end{center}

\caption{The relation from \autoref{ex:decomposition}}

1574

\label{f:decomposition}

1575

\end{figure}

1576

\begin{example}

1577

\label{ex:decomposition}

1578

Consider the relation on the right of \shortciteN[Figure~2]{Beletska2009},

1579

reproduced in \autoref{f:decomposition}.

1580

The relation can be described as $R = R_1 \cup R_2 \cup R_3$,

with

$$

\begin{aligned}

R_1 &= n \mapsto \{\, (i,j) \to (i+3,j) \mid

1585

i \le 2 j - 4 \wedge

1586

i \le n - 3 \wedge

1587

j \le 2 i - 1 \wedge

1588

j \le n \,\}

1589

\\

1590

R_2 &= n \mapsto \{\, (i,j) \to (i,j+3) \mid

1591

i \le 2 j - 1 \wedge

1592

i \le n \wedge

1593

j \le 2 i - 4 \wedge

1594

j \le n - 3 \,\}

1595

\\

1596

R_3 &= n \mapsto \{\, (i,j) \to (i+1,j+1) \mid

1597

i \le 2 j - 1 \wedge

1598

i \le n - 1 \wedge

1599

j \le 2 i - 1 \wedge

j \le n - 1\,\}

.

\end{aligned}

$$

The figure shows this relation for $n = 7$.

1605

Both

1606

$R_3 \circ R_1 \subseteq R_1 \circ R_3$

1607

and

1608

$R_3 \circ R_2 \subseteq R_2 \circ R_3$,

1609

which the reader can verify using the {\tt iscc} calculator:

1610

\begin{verbatim}

1611

R1 := [n] -> { [i,j] -> [i+3,j] : i <= 2 j - 4 and i <= n - 3 and

1612

j <= 2 i - 1 and j <= n };

1613

R2 := [n] -> { [i,j] -> [i,j+3] : i <= 2 j - 1 and i <= n and

1614

j <= 2 i - 4 and j <= n - 3 };

1615

R3 := [n] -> { [i,j] -> [i+1,j+1] : i <= 2 j - 1 and i <= n - 1 and

1616

j <= 2 i - 1 and j <= n - 1 };

1617

(R1 . R3) - (R3 . R1);

1618

(R2 . R3) - (R3 . R2);

1619

\end{verbatim}

1620

$R_3$ can therefore be moved forward in any path.

1621

For the other two basic relations, we have both

1622

$R_2 \circ R_1 \not\subseteq R_1 \circ R_2$

1623

and

1624

$R_1 \circ R_2 \not\subseteq R_2 \circ R_1$

1625

and so $R_1$ and $R_2$ form a strongly connected component.

1626

By computing the power of $R_3$ and $R_1 \cup R_2$ separately

1627

and composing the results, the power of $R$ can be computed exactly

1628

using \eqref{eq:transitive:singleton}.

1629

As explained by \shortciteN{Beletska2009}, applying the same formula

1630

to $R$ directly, without a decomposition, would result in

1631

an overapproximation of the power.

1632

\end{example}

1633

1634

\subsection{Partitioning the domains and ranges of $R$}

1635

1636

The algorithm of \autoref{s:power} assumes that the input relation $R$

1637

can be treated as a union of translations.

1638

This is a reasonable assumption if $R$ maps elements of a given

1639

abstract domain to the same domain.

1640

However, if $R$ is a union of relations that map between different

1641

domains, then this assumption no longer holds.

1642

In particular, when an entire dependence graph is encoded

1643

in a single relation, as is done by, e.g.,

1644

\shortciteN[Section~6.1]{Barthou2000MSE}, then it does not make

1645

sense to look at differences between iterations of different domains.

1646

Now, arguably, a modified Floyd-Warshall algorithm should

1647

be applied to the dependence graph, as advocated by

1648

\shortciteN{Kelly1996closure}, with the transitive closure operation

1649

only being applied to relations from a given domain to itself.

1650

However, it is also possible to detect disjoint domains and ranges

1651

and to apply Floyd-Warshall internally.

\linesnumbered

\begin{algorithm}

\caption{The modified Floyd-Warshall algorithm of

1656

\protect\shortciteN{Kelly1996closure}}

1657

\label{a:Floyd}

1658

\SetKwInput{Input}{Input}

1659

\SetKwInput{Output}{Output}

1660

\Input{Relations $R_{pq}$, $0 \le p, q < n$}

1661

\Output{Updated relations $R_{pq}$ such that each relation

1662

$R_{pq}$ contains all indirect paths from $p$ to $q$ in the input graph}

%

\BlankLine

\SetVline

\dontprintsemicolon

1667

%

1668

\For{$r \in [0, n-1]$}{

1669

$R_{rr} \coloneqq R_{rr}^+$ \nllabel{l:Floyd:closure}\;

1670

\For{$p \in [0, n-1]$}{

1671

\For{$q \in [0, n-1]$}{

1672

\If{$p \ne r$ or $q \ne r$}{

1673

$R_{pq} \coloneqq R_{pq} \cup \left(R_{rq} \circ R_{pr}\right)

1674

\cup \left(R_{rq} \circ R_{rr} \circ R_{pr}\right)$

1675

\nllabel{l:Floyd:update}

}

}

}

}

\end{algorithm}

Let the input relation $R$ be a union of $m$ basic relations $R_i$.

1683

Let $D_{2i}$ be the domains of $R_i$ and $D_{2i+1}$ the ranges of $R_i$.

1684

The first step is to group overlapping $D_j$ until a partition is

1685

obtained. If the resulting partition consists of a single part,

1686

then we continue with the algorithm of \autoref{s:power}.

1687

Otherwise, we apply Floyd-Warshall on the graph with as vertices

1688

the parts of the partition and as edges the $R_i$ attached to

1689

the appropriate pairs of vertices.

1690

In particular, let there be $n$ parts $P_k$ in the partition.

1691

We construct $n^2$ relations

1692

$$

1693

R_{pq} \coloneqq \bigcup_{i \text{ s.t. } \domain R_i \subseteq P_p \wedge

1694

\range R_i \subseteq P_q} R_i

1695

,

1696

$$

1697

apply \autoref{a:Floyd} and return the union of all resulting

1698

$R_{pq}$ as the transitive closure of $R$.

1699

Each iteration of the $r$-loop in \autoref{a:Floyd} updates

1700

all relations $R_{pq}$ to include paths that go from $p$ to $r$,

1701

possibly stay there for a while, and then go from $r$ to $q$.

1702

Note that paths that ``stay in $r$'' include all paths that

1703

pass through earlier vertices since $R_{rr}$ itself has been updated

1704

accordingly in previous iterations of the outer loop.

1705

In principle, it would be sufficient to use the $R_{pr}$

1706

and $R_{rq}$ computed in the previous iteration of the

1707

$r$-loop in Line~\ref{l:Floyd:update}.

1708

However, from an implementation perspective, it is easier

1709

to allow either or both of these to have been updated

1710

in the same iteration of the $r$-loop.

1711

This may result in duplicate paths, but these can usually

1712

be removed by coalescing (\autoref{s:coalescing}) the result of the union

1713

in Line~\ref{l:Floyd:update}, which should be done in any case.

1714

The transitive closure in Line~\ref{l:Floyd:closure}

1715

is performed using a recursive call. This recursive call

1716

includes the partitioning step, but the resulting partition will

1717

usually be a singleton.

1718

The result of the recursive call will either be exact or an

1719

overapproximation. The final result of Floyd-Warshall is therefore

1720

also exact or an overapproximation.

\begin{figure}

\begin{center}

\begin{tikzpicture}[x=1cm,y=1cm,>=stealth,shorten >=3pt]

1725

\foreach \x/\y in {0/0,1/1,3/2} {

1726

\fill (\x,\y) circle (2pt);

1727

}

1728

\foreach \x/\y in {0/1,2/2,3/3} {

1729

\draw (\x,\y) circle (2pt);

1730

}

1731

\draw[->] (0,0) -- (0,1);

1732

\draw[->] (0,1) -- (1,1);

1733

\draw[->] (2,2) -- (3,2);

1734

\draw[->] (3,2) -- (3,3);

1735

\draw[->,dashed] (2,2) -- (3,3);

1736

\draw[->,dotted] (0,0) -- (1,1);

1737

\end{tikzpicture}

1738

\end{center}

1739

\caption{The relation (solid arrows) on the right of Figure~1 of

1740

\protect\shortciteN{Beletska2009} and its transitive closure}

\label{f:COCOA:1}

\end{figure}

\begin{example}

Consider the relation on the right of Figure~1 of

1745

\shortciteN{Beletska2009},

1746

reproduced in \autoref{f:COCOA:1}.

1747

This relation can be described as

1748

$$

1749

\begin{aligned}

1750

\{\, (x, y) \to (x_2, y_2) \mid {} & (3y = 2x \wedge x_2 = x \wedge 3y_2 = 3 + 2x \wedge x \ge 0 \wedge x \le 3) \vee {} \\

1751

& (x_2 = 1 + x \wedge y_2 = y \wedge x \ge 0 \wedge 3y \ge 2 + 2x \wedge x \le 2 \wedge 3y \le 3 + 2x) \,\}

.

\end{aligned}

$$

Note that the domain of the upward relation overlaps with the range

1756

of the rightward relation and vice versa, but that the domain

1757

of neither relation overlaps with its own range or the domain of

1758

the other relation.

1759

The domains and ranges can therefore be partitioned into two parts,

1760

$P_0$ and $P_1$, shown as the white and black dots in \autoref{f:COCOA:1},

respectively.

Initially, we have

$$

\begin{aligned}

R_{00} & = \emptyset

1766

\\

1767

R_{01} & =

1768

\{\, (x, y) \to (x+1, y) \mid

1769

(x \ge 0 \wedge 3y \ge 2 + 2x \wedge x \le 2 \wedge 3y \le 3 + 2x) \,\}

1770

\\

1771

R_{10} & =

1772

\{\, (x, y) \to (x_2, y_2) \mid (3y = 2x \wedge x_2 = x \wedge 3y_2 = 3 + 2x \wedge x \ge 0 \wedge x \le 3) \,\}

1773

\\

1774

R_{11} & = \emptyset

.

\end{aligned}

$$

In the first iteration, $R_{00}$ remains the same ($\emptyset^+ = \emptyset$).

1779

$R_{01}$ and $R_{10}$ are therefore also unaffected, but

1780

$R_{11}$ is updated to include $R_{01} \circ R_{10}$, i.e.,

1781

the dashed arrow in the figure.

1782

This new $R_{11}$ is obviously transitively closed, so it is not

1783

changed in the second iteration and it does not have an effect

1784

on $R_{01}$ and $R_{10}$. However, $R_{00}$ is updated to

1785

include $R_{10} \circ R_{01}$, i.e., the dotted arrow in the figure.

1786

The transitive closure of the original relation is then equal to

1787

$R_{00} \cup R_{01} \cup R_{10} \cup R_{11}$.

1788

\end{example}

1789

1790

\subsection{Incremental Computation}

1791

\label{s:incremental}

1792

1793

In some cases it is possible and useful to compute the transitive closure

1794

of union of basic relations incrementally. In particular,

1795

if $R$ is a union of $m$ basic maps,

$$

R = \bigcup_j R_j

,

$$

then we can pick some $R_i$ and compute the transitive closure of $R$ as

1801

\begin{equation}

1802

\label{eq:transitive:incremental}

R^+ = R_i^+ \cup

\left(

\bigcup_{j \ne i}

R_i^* \circ R_j \circ R_i^*

\right)^+

.

\end{equation}

For this approach to be successful, it is crucial that each

1811

of the disjuncts in the argument of the second transitive

1812

closure in \eqref{eq:transitive:incremental} be representable

1813

as a single basic relation, i.e., without a union.

1814

If this condition holds, then by using \eqref{eq:transitive:incremental},

1815

the number of disjuncts in the argument of the transitive closure

1816

can be reduced by one.

1817

Now, $R_i^* = R_i^+ \cup \identity$, but in some cases it is possible

1818

to relax the constraints of $R_i^+$ to include part of the identity relation,

1819

say on domain $D$. We will use the notation

1820

${\cal C}(R_i,D) = R_i^+ \cup \identity_D$ to represent

1821

this relaxed version of $R^+$.

1822

\shortciteN{Kelly1996closure} use the notation $R_i^?$.

1823

${\cal C}(R_i,D)$ can be computed by allowing $k$ to attain

1824

the value $0$ in \eqref{eq:transitive:Q} and by using

1825

$$

1826

P \cap \left(D \to D\right)

1827

$$

1828

instead of \eqref{eq:transitive:approx}.

1829

Typically, $D$ will be a strict superset of both $\domain R_i$

1830

and $\range R_i$. We therefore need to check that domain

1831

and range of the transitive closure are part of ${\cal C}(R_i,D)$,

1832

i.e., the part that results from the paths of positive length ($k \ge 1$),

1833

are equal to the domain and range of $R_i$.

1834

If not, then the incremental approach cannot be applied for

1835

the given choice of $R_i$ and $D$.

1836

1837

In order to be able to replace $R^*$ by ${\cal C}(R_i,D)$

1838

in \eqref{eq:transitive:incremental}, $D$ should be chosen

1839

to include both $\domain R$ and $\range R$, i.e., such

1840

that $\identity_D \circ R_j \circ \identity_D = R_j$ for all $j\ne i$.

1841

\shortciteN{Kelly1996closure} say that they use

1842

$D = \domain R_i \cup \range R_i$, but presumably they mean that

1843

they use $D = \domain R \cup \range R$.

1844

Now, this expression of $D$ contains a union, so it not directly usable.

1845

\shortciteN{Kelly1996closure} do not explain how they avoid this union.

1846

Apparently, in their implementation,

1847

they are using the convex hull of $\domain R \cup \range R$

1848

or at least an approximation of this convex hull.

1849

We use the simple hull (\autoref{s:simple hull}) of $\domain R \cup \range R$.

1850

1851

It is also possible to use a domain $D$ that does {\em not\/}

1852

include $\domain R \cup \range R$, but then we have to

1853

compose with ${\cal C}(R_i,D)$ more selectively.

1854

In particular, if we have

1855

\begin{equation}

1856

\label{eq:transitive:right}

1857

\text{for each $j \ne i$ either }

1858

\domain R_j \subseteq D \text{ or } \domain R_j \cap \range R_i = \emptyset

\end{equation}

and, similarly,

\begin{equation}

\label{eq:transitive:left}

1863

\text{for each $j \ne i$ either }

1864

\range R_j \subseteq D \text{ or } \range R_j \cap \domain R_i = \emptyset

1865

\end{equation}

1866

then we can refine \eqref{eq:transitive:incremental} to

$$

R_i^+ \cup

\left(

\left(

\bigcup_{\shortstack{$\scriptstyle\domain R_j \subseteq D $\\

1872

$\scriptstyle\range R_j \subseteq D$}}

1873

{\cal C} \circ R_j \circ {\cal C}

\right)

\cup

\left(

\bigcup_{\shortstack{$\scriptstyle\domain R_j \cap \range R_i = \emptyset$\\

1878

$\scriptstyle\range R_j \subseteq D$}}

\!\!\!\!\!

{\cal C} \circ R_j

\right)

\cup

\left(

\bigcup_{\shortstack{$\scriptstyle\domain R_j \subseteq D $\\

1885

$\scriptstyle\range R_j \cap \domain R_i = \emptyset$}}

\!\!\!\!\!

R_j \circ {\cal C}

\right)

\cup

\left(

\bigcup_{\shortstack{$\scriptstyle\domain R_j \cap \range R_i = \emptyset$\\

1892

$\scriptstyle\range R_j \cap \domain R_i = \emptyset$}}

\!\!\!\!\!

R_j

\right)

\right)^+

.

$$

If only property~\eqref{eq:transitive:right} holds,

we can use

$$

R_i^+ \cup

\left(

\left(

R_i^+ \cup \identity

\right)

\circ

\left(

\left(

\bigcup_{\shortstack{$\scriptstyle\domain R_j \subseteq D $}}

R_j \circ {\cal C}

\right)

\cup

\left(

\bigcup_{\shortstack{$\scriptstyle\domain R_j \cap \range R_i = \emptyset$}}

\!\!\!\!\!

R_j

\right)

\right)^+

\right)

,

$$

while if only property~\eqref{eq:transitive:left} holds,

we can use

$$

R_i^+ \cup

\left(

\left(

\left(

\bigcup_{\shortstack{$\scriptstyle\range R_j \subseteq D $}}

{\cal C} \circ R_j

\right)

\cup

\left(

\bigcup_{\shortstack{$\scriptstyle\range R_j \cap \domain R_i = \emptyset$}}

\!\!\!\!\!

R_j

\right)

\right)^+

\circ

\left(

R_i^+ \cup \identity

\right)

\right)

.

$$

It should be noted that if we want the result of the incremental

1949

approach to be transitively closed, then we can only apply it

1950

if all of the transitive closure operations involved are exact.

1951

If, say, the second transitive closure in \eqref{eq:transitive:incremental}

1952

contains extra elements, then the result does not necessarily contain

1953

the composition of these extra elements with powers of $R_i$.

1954

1955

\subsection{An {\tt Omega}-like implementation}

1956

1957

While the main algorithm of \shortciteN{Kelly1996closure} is

1958

designed to compute and underapproximation of the transitive closure,

1959

the authors mention that they could also compute overapproximations.

1960

In this section, we describe our implementation of an algorithm

1961

that is based on their ideas.

1962

Note that the {\tt Omega} library computes underapproximations

1963

\shortcite[Section 6.4]{Omega_lib}.

1964

1965

The main tool is Equation~(2) of \shortciteN{Kelly1996closure}.

1966

The input relation $R$ is first overapproximated by a ``d-form'' relation

1967

$$

1968

\{\, \vec i \to \vec j \mid \exists \vec \alpha :

1969

\vec L \le \vec j - \vec i \le \vec U

1970

\wedge

1971

(\forall p : j_p - i_p = M_p \alpha_p)

\,\}

,

$$

where $p$ ranges over the dimensions and $\vec L$, $\vec U$ and

1976

$\vec M$ are constant integer vectors. The elements of $\vec U$

1977

may be $\infty$, meaning that there is no upper bound corresponding

1978

to that element, and similarly for $\vec L$.

1979

Such an overapproximation can be obtained by computing strides,

1980

lower and upper bounds on the difference set $\Delta \, R$.

1981

The transitive closure of such a ``d-form'' relation is

1982

\begin{equation}

1983

\label{eq:omega}

1984

\{\, \vec i \to \vec j \mid \exists \vec \alpha, k :

1985

k \ge 1 \wedge

1986

k \, \vec L \le \vec j - \vec i \le k \, \vec U

1987

\wedge

1988

(\forall p : j_p - i_p = M_p \alpha_p)

\,\}

.

\end{equation}

The domain and range of this transitive closure are then

1993

intersected with those of the input relation.

1994

This is a special case of the algorithm in \autoref{s:power}.

1995

1996

In their algorithm for computing lower bounds, the authors

1997

use the above algorithm as a substep on the disjuncts in the relation.

1998

At the end, they say

1999

\begin{quote}

2000

If an upper bound is required, it can be calculated in a manner

2001

similar to that of a single conjunct [sic] relation.

2002

\end{quote}

2003

Presumably, the authors mean that a ``d-form'' approximation

2004

of the whole input relation should be used.

2005

However, the accuracy can be improved by also trying to

2006

apply the incremental technique from the same paper,

2007

which is explained in more detail in \autoref{s:incremental}.

2008

In this case, ${\cal C}(R_i,D)$ can be obtained by

2009

allowing the value zero for $k$ in \eqref{eq:omega},

2010

i.e., by computing

2011

$$

2012

\{\, \vec i \to \vec j \mid \exists \vec \alpha, k :

2013

k \ge 0 \wedge

2014

k \, \vec L \le \vec j - \vec i \le k \, \vec U

2015

\wedge

2016

(\forall p : j_p - i_p = M_p \alpha_p)

\,\}

.

$$

In our implementation we take as $D$ the simple hull

2021

(\autoref{s:simple hull}) of $\domain R \cup \range R$.

2022

To determine whether it is safe to use ${\cal C}(R_i,D)$,

2023

we check the following conditions, as proposed by

2024

\shortciteN{Kelly1996closure}:

2025

${\cal C}(R_i,D) - R_i^+$ is not a union and for each $j \ne i$

2026

the condition

2027

$$

2028

\left({\cal C}(R_i,D) - R_i^+\right)

\circ

R_j

\circ

\left({\cal C}(R_i,D) - R_i^+\right)

=

R_j

$$

holds.

nexmon – Blame information for rev 1