# Circuit Placement:

2000-Caldwell, Kahng, Markov; 2002-Kennings, Markov; 2006-Kennings, Vorwerk

Andrew A. Kennings, Univ. of Waterloo, Canada, http://gibbon.uwaterloo.ca/~akenning/ Igor L. Markov, Univ. of Michigan, USA, http://www.eecs.umich.edu/~imarkov/

**INDEX TERMS:** VSLI CAD, circuit, hypergraph, algorithm, physical design, placement, large-scale optimization, combinatorial optimization, partitioning, network flow, linear programming, non-linear optimization. **SYNONYMS:** EDA, netlist, layout, min-cut placement, min-cost max-flow, analytical placement, mathematical programming.

## **1 PROBLEM DEFINITION**

This problem is concerned with efficiently determining constrained positions of objects while minimizing a measure of interconnect between the objects, as in physical layout of integrated circuits, commonly done in 2-dimensions. While most formulations are NP-hard, modern circuits are so large that practical algorithms for placement must have near-linear runtime and memory requirements, but not necessarily produce optimal solutions. While early software for circuit placement was based on Simulated Annealing, research in algorithms identified more scalable techniques which are now being adopted in the Electronic Design Automation industry.

One models a circuit by a hypergraph  $G_h(V_h, E_h)$  with (i) vertices  $V_h = \{v_1, \ldots, v_n\}$  representing logic gates, standard cells, larger modules, or fixed I/O pads and (ii) hyperedges  $E_h = \{e_1, \ldots, e_m\}$ representing connections between modules. Every incident pair of a vertex and a hyperedge connect through a *pin* for a total of P pins in the hypergraph. Each vertex  $v_i \in V_h$  has width  $w_i$ , height  $h_i$ and area  $A_i$ . Hyperedges may also be weighted. Given  $G_h$ , circuit placement seeks center positions  $(x_i, y_i)$  for vertices that optimize a hypergraph-based objective subject to constraints (see below). A placement is captured by  $\mathbf{x} = (x_1, \cdots, x_n)$  and  $\mathbf{y} = (y_1, \cdots, y_n)$ .

**Objective:** Let  $C_k$  be the index set of the hypergraph vertices incident to hyperedge  $e_k$ . The total half-perimeter wire length (HPWL) of the circuit hypergraph is given by HPWL( $G_h$ ) =  $\sum_{e_k \in E_h} \text{HPWL}(e_k) = \sum_{e_k \in E_h} [\max_{i,j \in C_k} |x_i - x_j| + \max_{i,j \in C_k} |y_i - y_j|]$ . HPWL is piece-wise linear, separable in the x and y directions, convex, but not strictly convex. Among many objectives for circuit placement, it is the simplest and most common.

#### **Constraints:**

- 1. No overlap. The area occupied by any two vertices cannot overlap; i.e., either  $|x_i x_j| \ge \frac{1}{2}(w_i + w_j)$  or  $|y_i y_j| \ge \frac{1}{2}(h_i + h_j), \forall v_i, v_j \in V_h$ .
- 2. Fixed outline. Each vertex  $v_i \in V_h$  must be placed entirely within a specified rectangular region bounded by  $x_{\min}$  ( $y_{\min}$ ) and  $x_{\max}$  ( $y_{\max}$ ) which denote the left (bottom) and right (top) boundaries of the specified region.
- 3. **Discrete slots**. There is only a finite number of discrete positions, typically on a grid. However, in large-scale circuit layout, slot constraints are often ignored during *global placement*, and enforced only during *legalization* and *detail placement*.

Other constraints may include alignment, minimum and maximum spacing, etc. Many placement techniques temporarily relax overlap constraints into density constraints to avoid vertices clustered in small regions. A  $m \times n$  regular bin structure B is superimposed over the fixed outline and vertex area is assigned to bins based on the positions of vertices. Let  $D_{ij}$  denote the density of bin  $B_{ij} \in B$ , defined as the total cell area assigned to bin  $B_{ij}$  divided by its capacity. Vertex overlap is limited implicitly by satisfying  $D_{ij} \leq K$ ,  $\forall B_{ij} \in B$ , for some  $K \leq 1$  (density target).

#### Problem 1 (Circuit Placement).

INPUT: Circuit hypergraph  $G_h(V_h, E_h)$  and a fixed outline for the placement area. OUTPUT: Positions for each vertex  $v_i \in V_h$  such that (1) wire length is minimized and (2) the area-density constraints  $D_{ij} \leq K$  are satisfied for all  $B_{ij} \in B$ .

### 2 KEY RESULTS

An unconstrained optimal position of a single placeable vertex connected to fixed vertices can be found in linear time as the median of adjacent positions [7]. Unconstrained HPWL minimization for multiple placeable vertices can be formulated as a linear program [8,11]. For each  $e_k \in E_h$ , upper and lower bound variables  $U_k$  and  $L_k$  are added. The cost of  $e_k$  (x-direction only) is the difference between  $U_k$  and  $L_k$ . Each  $U_k(L_k)$  comes with  $p_k$  inequality constraints that restricts its value to be larger (smaller) than the position of every vertex  $i \in C_k$ . A hypergraph with n vertices and m hyperedges is represented by a linear program with n + 2m variables and 2P constraints.

Linear programming has poor scalability and integrating constraint-tracking into optimization is difficult. Other approaches include non-linear optimization and partitioning-based methods.

#### 2.1 Combinatorial techniques for wire length minimization

The no-overlap constraints are not convex and cannot be directly added to the linear program for HPWL minimization. Such a program is first solved directly or by casting its dual as an instance of the min-cost max-flow problem [12]. Vertices often cluster in small regions of high density. One can lower-bound the distance between closely-placed vertices with a single linear constraint that depends on the relative placement of these vertices [11]. The resulting optimization problem is incrementally re-solved, and the process repeats until the desired density is achieved.

The min-cut placement technique is based on balanced min-cut partitioning of hypergraphs and is more focused on density constraints [10]. Vertices of the initial hypergraph are first partitioned in two similar-sized groups. One of them is assigned to the left half of the placement region, and the other one to the right half. Partitioning is performed by the Multi-level Fiduccia-Mattheyses (MLFM) heuristic [9] to minimize connections between the two groups of vertices (the net-cut objective). Each half is partitioned again, but takes into account the connections to the other half [10]. At the large scale, ensuring the similar sizes of bi-partitions corresponds to density constraints and cut minimization corresponds to HPWL minimization. When regions become small and contain < 10 vertices, optimal positions can be found with respect to discrete slot constraints by branch-and-bound [2]. Balanced hypergaph partitioning is NP-hard [3], but the MLFM heuristic takes  $O((V + E) \log V)$  time. The entire min-cut placement procedure takes  $O((V + E)(\log V)^2)$ time and can process hypergraphs with millions of vertices in several hours.

A special case of interest is that of one-dimensional placement. When all vertices have identical width and none of them are fixed, one obtains the NP-hard MINIMUM LINEAR ARRANGEMENT problem [3] which can be approximated in polynomial time within  $O(\log V)$  and solved exactly for trees in  $O(V^3)$  time as shown by Yannakakis. The min-cut technique described above also works well for the related NP-hard MINIMUM-CUT LINEAR ARRANGEMENT problem [3].

#### 2.2 Nonlinear optimization

Quadratic and generic non-linear optimization may be faster than linear programming, while reasonably approximating the original formulation. The hypergraph is represented by a weighted graph where  $w_{ij}$  represents the weight on the 2-pin edge connecting vertices  $v_i$  and  $v_j$  in the weighted graph. When an edge is absent,  $w_{ij} = 0$ , and in general  $w_{ii} = -\sum_{i \neq j} w_{ij}$ .

**Quadratic placement:** A quadratic placement (*x*-direction only) is given by

$$\Phi(x) = \sum_{i,j} w_{ij} \left[ (x_i - x_j)^2 \right] = \frac{1}{2} \mathbf{x}^T \mathbf{Q} \mathbf{x} + \mathbf{c}^T \mathbf{x} + \text{const.}$$
(1)

The global minimum of  $\Phi(x)$  is found by solving  $\mathbf{Qx} + \mathbf{c} = \mathbf{0}$  which is a sparse, symmetric positive definite system of linear equations (assuming  $\geq 1$  fixed vertex), efficiently solved to sufficient accuracy using any number of iterative solvers. Quadratic placement may have different optima depending on the model (clique or star) used to represent hyperedges. However, for a k-pin hyperedge, if the weight on the 2-pin edges introduced is set to  $W_c$  in the clique mode and  $kW_c$  in the star model, then the models are equivalent in quadratic placement [8].

**Linearized quadratic placement:** Quadratic placement can produce lower quality placements. To approximate the linear objective, one can iteratively solve Equation 1 with  $w_{ij} = 1/|x_i - x_j|$  computed at every iteration. Alternatively, one can solve a single  $\beta$ -regularized optimization problem given by  $\Phi^{\beta}(\mathbf{x}) = \min_x \sum_{i,j} w_{ij} \sqrt{(x_i - x_j)^2 + \beta}$ ,  $\beta > 0$ , e.g., using a Primal-Dual Newton method with quadratic convergence [1].

Half-perimeter wire length placement: HPWL can be provably approximated by strictly convex and differentiable functions. For 2-pin hyperedges,  $\beta$ -regularization can be used [1]. For an m-pin hyperedge ( $m \geq 3$ ), one can rewrite HPWL as the maximum ( $l_{\infty}$ -norm) of all m(m-1)/2 pairwise distances  $|x_i - x_j|$  and approximate the  $l_{\infty}$ -norm by the  $l_p$ -norm (p-th root of the sum of p-th powers). This removes all non-differentiabilities except at **0** which is then removed with  $\beta$ -regularization. The resulting HPWL approximation is given by

$$\mathrm{HPWL}_{p-\beta-reg}(G_h) = \sum_{e_k \in E_h} \left( \sum_{i,j \in C_k} |x_i - x_j|^p + \beta \right)^{1/p}$$
(2)

which overestimates HPWL with arbitrarily small relative error as  $p \to \infty$  and  $\beta \to 0$  [8]. Alternatively, HPWL can be approximated via the log-sum-exp formula given by

$$\mathrm{HPWL}_{log-sum-exp}(G_h) = \alpha \sum_{e_k \in E_h} \left[ \ln(\sum_{i \in C_k} \exp(\frac{x_i}{\alpha})) + \ln(\sum_{v_i \in C_k} \exp(\frac{-x_i}{\alpha})) \right]$$
(3)

where  $\alpha > 0$  is a smoothing parameter [6]. Both approximations can be optimized using conjugate gradient methods.

#### 2.3 Analytic techniques for target density constraints

The target density constraints are non-differentiable and are typically handled by approximation.

**Force-based spreading:** The key idea is to add constant forces **f** that pull vertices always from overlaps, and recompute the forces over multiple iterations to reflect changes in vertex distribution. For quadratic placement, the new optimality conditions are  $\mathbf{Qx} + \mathbf{c} + \mathbf{f} = \mathbf{0}$  [7]. The constant force can perturb a placement in any number of ways to satisfy the target density constraints. The force **f** is computed using a discrete version of Poisson's equation.

**Fixed-point spreading:** A fixed point f is a pseudo-vertex with zero area, fixed at  $(x_f, y_f)$ , and connected to one vertex H(f) in the hypergraph through the use of a pseudo-edge with weight  $w_{f,H(f)}$ . Quadratic placement with fixed points is given by  $\Phi(x) = \sum_{i,j} w_{i,j}(x_i - x_j)^2 + \sum_f w_{f,H(f)}(x_{H(f)} - x_f)^2$ . Each each fixed point f introduces a quadratic term  $w_{f,H(f)}(x_{H(f)} - x_f)^2$ . By manipulating the positions of fixed points, one can perturb a placement to satisfy the target density constraints. Fixed points generalize constant forces [5]. Fixed points improve the controllability and stability of a placement iteration.

**Generalized force-directed spreading:** The Helmholtz equation models a diffusion process and makes it ideal for spreading vertices [4]. The Helmholz equation is given by

$$\frac{\partial^2 \phi(x,y)}{\partial x^2} + \frac{\partial^2 \phi(x,y)}{\partial y^2} - \epsilon \phi(x,y) = D(x,y), \quad (x,y) \in \mathbf{R}$$

$$\frac{\partial \phi}{\partial y} = 0, \qquad (x,y) \text{ on the boundary of } R$$
(4)

where  $\epsilon > 0$ , v is an outer unit normal, R represents the fixed outline, and D(x, y) represents the continuous density function. The boundary conditions,  $\frac{\partial \phi}{\partial v} = 0$ , specify that forces pointing outside of the fixed outline be set to zero —this is a key difference with the Poisson method which assumes that forces become zero at infinity. The value  $\phi_{ij}$  at the center of each bin  $B_{ij}$  is found by discretization of Equation 4 using finite differences. The density constraints are replaced by  $\phi_{ij} = \hat{K}, \forall B_{ij} \in B$  where  $\hat{K}$  is a scaled representative of the density target K. Wire length minimization subject to the smoothed density constraints can be solved via Uzawa's algorithm. For quadratic wire length, this algorithm is a generalization of force-based spreading.

**Potential function spreading:** Target density constraints can also be satisfied via a penalty function. The area assigned to bin  $B_{ij}$  by vertex  $v_i$  is represented by Potential $(v_i, B_{ij})$  which is a bell-shaped function. The use of piecewise quadratic functions make the potential function non-convex, but smooth and differentiable [6]. The penalty term given by

$$Penalty = \sum_{B_{ij} \in B} \left( \sum_{v_i \in V_h} Potential(v_i, B_{ij}) - K \right)^2$$
(5)

can be combined with a wire length approximation to arrive at an unconstrained optimization problem which is solved using an efficient conjugate gradient method [6].

### 3 APPLICATIONS

Practical applications involve more sophisticated interconnect objectives, such as circuit delay, routing congestion, power dissipation, power density, and maximum thermal gradient. The above techniques are adapted to handle multi-objective optimization. Many such extensions are based on heuristic assignment of net weights that encourage the shortening of some (e.g., timing-critical and frequently-switching) connections at the expense of other connections. To moderate routing congestion, predictive congestion maps are used to decrease the maximal density contraint for placement in congested regions. Another application is in physical synthesis, where incremental placement is used to evaluate changes in circuit topology.

### 4 EXPERIMENTAL RESULTS

Circuit placement has been actively studied for the past 30 years and a wealth of experimental results are reported throughout the literature. A 2003 result demonstrated that placement tools could produce results as much as  $1.41 \times$  to  $2.09 \times$  known optimal wire lengths on average (advances

have been made since this study). A 2005 placement contest found that a set of tools produced placements with wire lengths that differed by as much as  $1.84 \times$  on average. A 2006 placement contest found that a set of tools produced placements that differed by as much as  $1.39 \times$  on average when the objective was the simultaneous minimization of wire length, routability and run time. Placement run times range from minutes for smaller instances to hours for larger instances.

## 5 DATA SETS

Benchmarks include the ICCAD '04 suite (http://vlsicad.eecs.umich.edu/BK/ICCAD04bench/), the ISPD '05 suite (http://www.sigda.org/ispd2005/contest.htm) and the ISPD '06 suite (http://www.sigda.org/ispd2006/contest.htm). Instances in these benchmark suites contain between 10K to 2.5M placeable objects. Other common suites can be found, including large scale placement problems with known optimal solutions (http://cadlab.cs.ucla.edu/~pubbench).

## 6 CROSS REFERENCES: Floorplanning and Circuit Partitioning

## 7 RECOMMENDED READING

- ALPERT, C. J., CHAN, T., KAHNG, A. B., MARKOV, I. L. AND MULET, P., Faster minimization of linear wirelength for global placement. *IEEE Trans. CAD* 17(1), (1998), 3–13.
- [2] CALDWELL, A. E., KAHNG, A. B. AND MARKOV, I. L., Optimal partitioners and end-case placers for standard-cell layout *IEEE Trans. CAD*, 19(11), (2000), 1304–1314.
- [3] CRESCENZI, P. AND KANN, V., A compendium of NP optimization problems Springer, 1998.
- [4] CHAN, T., CONG, J. AND SZE, K., Multilevel generalized force-directed method for circuit placement. In *Proc. Intl. Symp. Physical Design* (2005), 185–192.
- [5] HU, B. AND MAREK-SADOWSKA, M., Multilevel fixed-point-addition-based VLSI placement. *IEEE Trans. CAD* 24(8), (2005), 1188–1203.
- [6] KAHNG, A. B. AND WANG, Q., Implementation and extensibility of an analytic placer. IEEE Trans. CAD 24(5), (2005), 734–747.
- [7] KENNINGS, A. AND VORWERK, K., Force-directed methods for generic placement. *IEEE Trans. CAD* 25(10), (2006), 2076–2087.
- [8] KENNINGS, A. AND MARKOV, I. L., Smoothing max-terms and analytical minimization of half-perimeter wirelength. VLSI Design 14(3), (2002), 229–237.
- [9] PAPA, D. A. AND MARKOV, I. L., Hypergraph partitioning and clustering in *Approximation* algorithms and metaheuristics, T. Gonzalez, ed.; CRC Press, 2007.
- [10] ROY, J. A., ADYA, S. N., PAPA, D. A. AND MARKOV, I. L., Min-cut floorplacement *IEEE Trans. CAD*, 25(7), (2006), 1313–1326.
- [11] REDA, S., AND CHOWDHARY, A., Effective linear programming based placement methods. Intl. Symp. on Physical Design, 2006, 186–191.
- [12] TANG, X., TIAN, R., AND WONG, M. D. F., Optimal redistribution of white space for wire length minimization. In Proc. Asia South Pac. Design Autom. Conf., 2005, 412–417.