前言

本文是南京大学李樾、谭添老师的《软件分析》课程的笔记。内容来自课程ppt以及课程讲解。

课程主页：Static Program Analysis | Tai-e (pascal-lab.net)

授课视频：视频南京大学《软件分析》课程01（Introduction）_哔哩哔哩_bilibili

课程实验repo：pascal-lab/Tai-e-assignments: Tai-e assignments for static program analysis (github.com)

Iterative Algorithm, Another View
Partial Order
Upper and Lower Bounds
Lattice, Semilattice, Complete and Product Lattice
Data Flow Analysis Framework via Lattice
Monotonicity and Fixed Point Theorem
Relate Iterative Algorithm to Fixed Point Theorem
May/Must Analysis, A Lattice View
MOP and Distributivity
Constant Propagation
Worklist Algorithm

Iterative Algorithm, Anthor View

Given a CFG (program) with k nodes, the iterative algorithm updates OUT[n] for every node n in each iteration.
Assume the domain of the values in data flow analysis is V, then we can define a k-tuple

$(OUT[n_1], OUT[n_2], …, OUT[n_k])$

as an element of set $(V_1 × V_2 × … × V_3)$denoted as $V^k$ , to hold the values of the analysis after each iteration
Each iteration can be considered as taking an action to map an element of Vk to a new element of Vk , through applying the transfer functions and control-flow handing, abstracted as a function$F: V^k \rightarrow V^k$
Then the algorithm outputs a series of k-tuples iteratively until a k-tuple is the same as the last one in two consecutive iterations

Given a CFG (program) with $k$ nodes, the iterative algorithm updates $OUT[n]$ for every node n in each iteration.

Each iteration takes an action$F: V^k \rightarrow V^k$

init → $(\bot, \bot …, \bot) = X_0$
iter 1 → $(v_1^1, v_2^1, …, v_k^1) = X_1 = F(X_0)$
iter 2 → $(v_1^2, v_2^2, …, v_k^2) = X_2 = F(X_1)$
…
iter i → $(v1^i, v_2^i, …, v_k^i) = X_i = F(X{i-1})\because Xi=X{i+1}$
iter i+1 → $(v1^i, v_2^i, …, v_k^i) = X{i+1} = F(X{i})\therefore X_i=X{i+1}=F(X_i)$

X is a fied of function F if $X=F(X)$

The iterative algorithm reaches a fixed point

The iterative algorithm (or the IN/OUT equation system) produces a solution to a data flow analysis

Is the algorithm guaranteed to terminate or reach the fixed point, or does it always have a solution?
If so, is there only one solution or only one fixed point? If more than one, is our solution the best one (most precise)?
When will the algorithm reach the fixed point, or when can we get the solution?

Partial Order

Definition

We define poset as a pair (P, ⊑) where ⊑ is a binary relation that defines a partial ordering over P, and ⊑ has the following properties:

∀x ∈ P, x ⊑ x (Reflexivity)
∀x, y ∈ P, x ⊑ y ∧ y ⊑ x ⟹ x = y (Antisymmetry)
∀x, y, z ∈ P, x ⊑ y ∧ y ⊑ z ⟹ x ⊑ z (Transitivity)

Example

Example 1

Is (S, ⊑) a poset where S is a set of integers and ⊑ represents ≤ (less than or equal to)?

Reflexivity 1 ≤ 1， 2 ≤ 2 √
Antisymmetry x ≤ y ∧ y ≤ x then x = y √
Transitivity 1 ≤ 2 ∧ 2 ≤ 3 then 1 ≤ 3 √

Example 2

Is (S, ⊑) a poset where S is a set of integers and ⊑ represents < (less than)?

(1) Reflexivity 1 < 1, 2< 2 ×

Example 3

Is (S, ⊑) a poset where S is a set of English words and ⊑ represents the substring relation, i.e., s1 ⊑ s2 means s1 is a substring of s2?

Reflexivity √
Antisymmetry √
Transitivity √

Example 4

Is (S, ⊑) a poset where S is the power set of set {a,b,c} and ⊑ represents ⊆ (subset)?

Reflexivity √
Antisymmetry √
Transitivity √

Upper and Lower Bounds

Given a poset (P, ⊑) and its subset S that S ⊆ P, we say that u ∈ P is an upper bound of S, if ∀x ∈ S, x ⊑ u. Similarly, l ∈ P is an lower bound of S, if ∀x ∈ S, l ⊑ x.

We define the least upper bound (lub or join) of S, written ⊔S, if for every upper bound of S, say u, ⊔S ⊑ u. Similarly, We define the greatest lower bound (glb, or meet) of S, written ⊓S, if for every lower bound of S, say l, l ⊑ ⊓S.

Usually, if S contains only two elements a and b (S = {a, b}), then ⊔S can be written a ⊔ b (the join of a and b) ⊓S can be written a ⊓ b (the meet of a and b)

Some Properties

Not every poset has lub or glb
But if a poset has lub or glb, it will be unique

Proof:

assume g1 and g2 are both glbs of poset P

according to the definition of glb

g1 ⊑ (g2 = ⊓P) and g2 ⊑ (g1 = ⊓P)

by the antisymmetry of partial order ⊑

g1 = g2

Lattice, Semilattice, Complete and Product Lattice

Lattice

Given a poset (P, ⊑), ∀a, b ∈ P, if a ⊔ b and a ⊓ b exist, then (P, ⊑) is called a lattice

A poset is a lattice if every pair of its elements has a least upper bound and a greatest lower bound