Distributed stochastic power control in ad hoc networks: a nonconvex optimization case

Yang, Lei; Sagduyu, Yalin E; Zhang, Junshan; Li, Jason H

doi:10.1186/1687-1499-2012-231

Research
Open access
Published: 24 July 2012

Distributed stochastic power control in ad hoc networks: a nonconvex optimization case

Lei Yang¹,
Yalin E Sagduyu²,
Junshan Zhang¹ &
…
Jason H Li²

EURASIP Journal on Wireless Communications and Networking volume 2012, Article number: 231 (2012) Cite this article

2451 Accesses
5 Citations
Metrics details

Abstract

Signal-to-interference-plus-noise-based power allocation in wireless ad hoc networks is inherently a nonconvex optimization problem because of the global coupling induced by the co-channel interference. To tackle this challenge, we first show that the globally optimal point lies on the boundary of the feasible region. This property is utilized to transform the utility maximization problem into an equivalent max–min problem with more structure. By using extended duality theory, penalty multipliers are introduced for penalizing the constraint violations, and the minimum weighted utility maximization problem is then decomposed into subproblems for individual users to devise a distributed stochastic power control algorithm, where each user stochastically adjusts its target utility to improve the total utility by simulated annealing (SA). The proposed distributed power control algorithm can guarantee global optimality at the cost of slower convergence due to SA involved in the global optimization. The geometric cooling scheme with suitable choice of penalty parameters is then used to improve the convergence rate. Next, by integrating the stochastic power control approach with the back-pressure algorithm, we develop a joint scheduling and power allocation policy to stabilize the queueing systems under random packet traffic. Finally, we generalize the above distributed power control algorithms to multicast communications, and show their global optimality for multicast traffic.

Introduction

The broadcast nature of wireless transmissions makes wireless networks susceptible to interference, which deteriorates quality of service (QoS) provisioning. Power control is considered as a promising technique to mitigate interference. One primary objective of power control is to maximize the system utility that can achieve a variety of fairness objectives among users[1–4]. However, maximizing the system utility, under the physical interference model, often involves nonconvex optimization and it is known to be NP-hard, due to the complicated coupling among users through mutual interference effects[5].

Due to the nonconvex nature of the power control problem, it is challenging to find the globally optimal power allocation in a distributed manner. Notably, the authors of[6–9] devised distributed power control algorithms to find power allocations that can only satisfy the local optimality conditions, but global optimality could not be guaranteed in general, except for some special convexifiable cases (e.g., with strictly increasing log-concave utility functions). Another thread of work applied game-theoretic approaches to power control by treating it as a noncooperative game among transmitters[10, 11]. However, distributed solutions that converge to a Nash equilibrium may be suboptimal in terms of maximizing the total system utility. Different from these approaches, the authors of[12] transformed the power control problem into a DC (difference of convex functions) optimization problem[13]. Then, the global optimal solution can be solved in a centralized manner with the branch-and-bound algorithm. Recent study[14] proposed a globally optimal power control scheme, named MAPEL, by exploiting the monotonic nature of the underlying optimization problem. However, the complexity and the centralized nature of MAPEL hinder its applicability in practical scenarios, and thus it can be treated rather as a benchmark for performance evaluation in distributed networks.

To find the globally optimal power allocation in a distributed setting, recent study[15] has proposed the SEER algorithm based on Gibbs sampling[16], which can approach the globally optimal solution in an asymptotic sense when the control parameter in Gibbs sampling tends to infinity. Notably, for each iteration in the SEER algorithm, each user utilizes Gibbs sampling to compute its transition probability distribution for updating its transmission power, where the requirement for message passing and computing the transition probability distribution in each iteration can be demanding when applied to ad hoc communications without centralized control.

A challenging task in distributed power control in ad hoc networks is to reduce the amount of message passing while preserving the global optimality. To tackle this challenge, we first show that the globally optimal point lies on the boundary of the feasible region. This property is utilized to transform the utility maximization problem into an equivalent max–min problem with more structure, which can be solved by combining recent advances in extended duality theory (EDT)[17] with simulated annealing (SA)[18]. Compared with the classical duality theory with nonzero duality gap for nonconvex optimization problems, EDT can guarantee zero duality gap between the primal and dual problems by utilizing nonlinear Lagrangian functions. This property allows for solving the nonconvex problem by its extended dual while preserving the global optimality with distributed implementation. Furthermore, as will be shown in Section “Power control for unicast communications”, for the subproblem of each individual user, the extended dual can then be solved through stochastic search with SA. In particular, we first transform the original utility maximization problem into an equivalent max–min problem. This step is based on the key observation that in the case with continuous and strictly increasing utility functions, the globally optimal solution is always on the boundary of the feasible (utility) region. Then, appealing to EDT and SA, we develop a distributed stochastic power control (DSPC) algorithm that stochastically searches for the optimal power allocation in the neighborhood of the feasible region’s boundary, instead of bouncing around in the entire feasible region.

Specifically, we first show that DSPC can achieve the global optimality in the underlying nonconvex optimization problem, although the convergence rate can be slow (but this is clearly due to the slow convergence nature of SA with logarithmic cooling schedule). Then, to improve the convergence rate of DSPC, we propose an enhanced DSPC (EDSPC) algorithm that employs the geometric cooling schedule[19] and performs a careful selection of penalty parameters. As a benchmark for performance evaluation, we also develop a centralized algorithm to search for the globally optimal solution over simplices that cover the feasible region. The performance gain is further verified by comparing our distributed algorithms with MAPEL[14], SEER[15], and ADP[6] algorithms. Worth noting is that the proposed DSPC and EDSPC algorithms do not require any knowledge of channel gains, which is typically needed in existing algorithms, and instead they need only the standard feedback of signal-to-interference-plus-noise (SINR) for adaptation.

Next, we integrate the proposed distributed power control approach with the back-pressure algorithm[20] and devise a joint scheduling and power allocation policy for improving the queue stability in the presence of dynamic packet arrivals and departures. This policy fits into the dynamic back-pressure and resource allocation framework and enables distributed utility maximization under stochastic packet traffic[21, 22]. Then, we generalize the study to consider multicast communications, where a single transmission may simultaneously deliver packets to multiple recipients[23, 24]. Specifically, we extend DSPC and EDSPC algorithms to multicast communications with distributed implementation, and show that these algorithms can also achieve the global optimality in terms of jointly maximizing the minimum rates on bottleneck links in different multicast groups.

The rest of the article is organized as follows. In the following section, we first introduce the system model, establish the equivalence between the utility maximization problem and its max–min form, and then develop both centralized and distributed algorithms for the max–min problem. Next, building on these power control algorithms, we develop in Section “Joint scheduling and power control for stability of queueing systems” a joint scheduling and power allocation policy to stabilize queueing systems. The generalization to multicast communications is presented in Section “Power control for multicast communications”. We conclude the article in “Conclusion” Section.

Power control for unicast communications

System model

We consider an ad hoc wireless network with a set $L = {1, \dots, L}$ of links, where the channel is interference-limited, and all L links treat interference as noise, as illustrated in Figure1. Such a model of communication is readily applicable to cellular networks[1]. Each link consists of a dedicated transmitter-receiver pair.^a We denote by h_lk the fixed channel gain between user l’s transmitter and user k’s receiver, and by p_l the transmission power of link l with $P_{l}^{max}$ being its maximum power constraint. It follows that the received SINR for the l th user with a matched filter receiver is given by

γ_{l} (p) = \frac{h_{ll} p_{l}}{n_{l} + \sum_{k \neq l} h_{kl} p_{k}},

(1)

where p = (p₁,…,p_L) is a vector of the users’ transmission powers and n_lis the noise power. Accordingly, the l th user receives the utility U_l(γ_l), where U_l(·) is continuous and strictly increasing. We assume that each user l’s utility is zero when γ_l= 0, i.e., U_l(0) = 0. For ease of reference, the key notation of this article is listed in Table1.^b

Table 1 Summary of the notations and definitions

Full size table

Network utility maximization

We seek to find the optimal power allocation p^∗ that maximizes the overall system utility subject to the individual power constraints, given by the following optimization problem^c:

\begin{array}{l} maximize \sum_{l \in L} U_{l} (γ_{l} (p)) \\ subject to 0 \leq p_{l} \leq P_{l}^{max}, \forall l \in L \\ variables {p} . \end{array}

(2)

In general, (2) is a nonconvex optimization problem.^d In particular, if the utility function is the Shannon rate achievable over Gaussian flat fading channels, namely $U_{l} (γ_{l} (p)) = w_{l} log (1 + γ_{l} (p))$ , where w_l> 0 is a weight associated with user l, (2) boils down to the weighted sum rate maximization problem, which is known to be nonconvex and NP-hard[5]. Note that the weights in (2) can serve as the fairness measures[25] for different scenarios. In particular, in queueing systems, packet queues for arrival rates within the stability region can be stabilized by solving this weighted sum rate maximization problem, where the instantaneous queue lengths are chosen as the weights. In Section “Joint scheduling and power control for stability of queueing systems”, we will discuss how to stabilize the packet queues by integrating our distributed power control algorithms with the back-pressure algorithm.

Let $F$ denote the feasible utility region, where for each point U = (U₁,…,U_L) in $F$ , there exists a power vector p such that U_l= U_l(γ_l(p)) for all $l \in L$ . The feasible utility region $F$ is nonconvex, and in general, finding the globally optimal solution to (2) in $F$ is challenging. In the following example, we illustrate the geometry of $F$ for the utility function $U_{l} (γ_{l} (p)) = w_{l} log (1 + γ_{l} (p))$ and evaluate the solutions to (2) given by some existing power control approaches discussed in Section “Introduction”.

Example

For the case with two links, Figure2 illustrates the nonconvex feasible utility region $F$ for different system parameters. We compare the performance of the existing approaches[3, 6, 14, 15] in Table2.

Table 2 The performance of the existing approaches for Case I and II

Full size table

Remarks

The solutions to (2) given by the authors of[3, 6, 14] are either distributed but suboptimal or optimal but centralized. In particular, Chiang et al.[3] solve (2) by using geometric programming (GP) under the high-SINR assumption, which yields a suboptimal solution to (2) when this assumption does not hold (e.g., this is the case in the example above). The ADP algorithm[6] can guarantee only local optimality^e in a distributed manner. The MAPEL algorithm[14] can achieve the globally optimal solutions but it is centralized with high computational complexity. Compared with these algorithms, the SEER algorithm[15] can guarantee global optimality in a distributed manner but message passing needed in each iteration can be demanding, i.e., each link needs the knowledge of the channel gains, the receiver SINR and the signal power of all the other links. It is worth noting that the performance of SEER hinges heavily on the control parameter that can be challenging to choose on the fly.

From network utility maximization to minimum weighted utility maximization

In order to devise low-complexity distributed algorithms that can guarantee global optimality, we first study the basic properties for the solutions to (2), before transforming (2) into a more structured max–min problem.

Lemma 1

The optimal solution to (2) is on the boundary of the feasible utility region $F$ .

Proof

Let U^∗ denote a globally optimal solution to (2) over $F$ , and γ^∗ denote the corresponding SINR that supports U^∗. Since U_l(·) is continuous and strictly increasing, proving that U^∗ is on the boundary of $F$ is equivalent to showing that γ^∗ is on the boundary of the feasible SINR region. Suppose that γ^∗ is not on the boundary of the feasible SINR region, which indicates that there exists some point $\hat{γ}$ such that ${\hat{γ}}_{l} \geq γ_{l}^{*}$ for all $l \in L$ and ${\hat{γ}}_{l^{'}} > γ_{l^{'}}^{*}$ for some $l^{'} \in L$ . Since U_l(·) for any $l \in L$ is strictly increasing in γ_l, we have $U_{l} ({\hat{γ}}_{l}) \geq U_{l} (γ_{l}^{*})$ for all $l \in L$ and $U_{l} ({\hat{γ}}_{l^{'}}) > U_{l} (γ_{l^{'}}^{*})$ for some $l^{'} \in L$ , which contradicts the fact that γ^∗ is a globally optimal solution. Hence, Lemma 1 follows. □

Based on Lemma 1, if we can characterize the boundary of $F$ , then it is possible to solve (2) efficiently. Thus motivated, we first establish, by introducing a “contribution weight” for each user, the equivalence between (2) and the minimum weighted utility maximization problem.

Lemma 2

Problem (2) is equivalent to the following minimum weighted utility maximization:

\begin{array}{l} maximize & min_{l \in L} \frac{U_{l} (γ_{l} (p))}{x_{l}} \\ subject to & 0 \leq p_{l} \leq P_{l}^{max}, \forall l \in L \\ 0 \leq x_{l} \leq 1, \forall l \in L \\ \sum_{l \in L} x_{l} = 1 \\ variables & {p, x} . \end{array}

(3)

Proof

Let $t = \sum_{l \in L} U_{l} (γ_{l} (p))$ denote the total utility. Since U_l(·) is nonnegative, we define x_l∈[0, 1] as a ratio for the contribution of user l’s utility to t. Therefore, U_l(γ_l(p)) = t x_land $\sum_{l \in L} x_{l} = 1$ . Then (2) can be rewritten as

\begin{array}{l} maximize & t \\ subject to & t = \frac{U_{l} (γ_{l} (p))}{x_{l}}, \forall l \in L \\ 0 \leq p_{l} \leq P_{l}^{max}, \forall l \in L \\ 0 \leq x_{l} \leq 1, \forall l \in L \\ 0 \leq t, \sum_{l \in L} x_{l} = 1 \\ variables & {p, x, t} . \end{array}

(4)

In order to maximize t, it suffices then to relax $t = \frac{U_{l} (γ_{l} (p))}{x_{l}}$ in (4) as $t \leq \frac{U_{l} (γ_{l} (p))}{x_{l}}, \forall l \in L$ , which is equivalent to $t \leq min_{l \in L} \frac{U_{l} (γ_{l} (p))}{x_{l}}$ . Therefore, (4) can be treated as the hypograph form of (3), i.e., (4) and (3) are equivalent[26], thereby concluding the proof. □

By transforming (2) to this more structured max–min problem (3), the problem is reduced to finding a globally optimal x^∗, given which we can efficiently obtain a globally optimal solution, i.e., the tangent point of the hyperplane and $F$ , as illustrated in Figure3. Intuitively speaking, x represents a search direction. Once we find the best search direction x^∗p^∗ can be obtained efficiently by searching along the direction of x^∗. Actually, for given x, (3) is quasi-convex.^f By introducing an auxiliary variable t, we obtain the following equivalent formulation:

\begin{array}{l} maximize & t \\ subject to & U_{l}^{- 1} (t x_{l}) (n_{l} + \sum_{k \neq l} h_{kl} p_{k}) \leq h_{ll} p_{l} \\ 0 \leq p_{l} \leq P_{l}^{max}, \forall l \in L, 0 \leq t \\ variables & {p, t}, \end{array}

(5)

which can be solved in polynomial time through binary search on t[26]. However, the optimal search direction x^∗ is difficult to find due to the nonconvex nature of the problem. In the following section, we study how to find the globally optimal search direction x^∗.

Centralized versus distributed algorithms

In this section, we study algorithms achieving global optimality for (3). First, we propose a centralized algorithm for (3), which will serve as a benchmark for performance comparison. Then, by using EDT and SA, we propose a distributed algorithm, DSPC, for the problem (3). Building on this, we propose an EDSPC algorithm to improve the convergence rate of DSPC.

A centralized algorithm

Based on Lemmas 1 and 2, we develop a centralized algorithm (Algorithm 1) to solve the max–min optimization problem (3) under consideration. Roughly speaking, by dividing the simplex $S = {x | \sum_{l \in L} x_{l} = 1, 0 \leq x_{l} \leq 1, \forall l \in L}$ into many small simplices, the algorithm can find the optimal point on the boundary of $F$ . Figure4 illustrates how the simplex cutting is performed for the case with three links. Compared with the MAPEL algorithm[14], Algorithm 1 directly computes the points on the boundary, instead of constructing a series of polyblocks to approximate the boundary of the feasible region.

Algorithm 1

Initialization: Choose the approximation factor ε > 0, and construct the initial simplex $S$ with the vertex set V = {v₁,…,v_L}, where v_l= e_l and e_l is the l th unit coordinate vector. Let $v_{c} = \frac{1}{L} \sum_{l \in L} v_{l}$ be the center of S. Compute p^∗ by solving (5) at the point x = v_c. Denote $δ (S) = max_{v \in V} | v_{c} - v |$ as the diameter of $S$ .

Repeat

1.
Divide each simplex $S_{i}$ by using bisection method, which chooses the midpoint of one of the longest edges of the simplex $S_{i}$ , i.e., $v_{m} = \frac{1}{2} (v_{r} + v_{s})$ , where v _rand v _sare the end points of a longest edge of the simplex. In this case, the simplex $S_{i}$ is subdivided into two simplices $S_{i_{1}}$ and $S_{i_{2}}$ .
2.
For each new simplex $S_{i_{j}}$ , compute the diameter $δ (S_{i_{j}})$ and p ^∗ by solving (5) at x given by the center point of the simplex.
3.
Find the current best solution to (3) and the maximal diameter δ ^min these new subdivided simplices.

Until δ^m< ε.

Proposition 1

Algorithm 1 converges monotonically to the globally optimal solution to (3) as the approximation factor ε approaches zero.

Proof

For given ε, Algorithm 1 divides the simplex $S = {x | \sum_{l \in L} x_{l} = 1, 0 \leq x_{l} \leq 1, \forall l \in L}$ until the maximal diameter of the subdivided simplices δ^mis less than ε. Let d(ε) denote the maximum distance between the optimal solution x^∗ and the center point of the simplex that contains x^∗. Obviously, d(ε) is bounded by δ^m. Since δ^m decreases with the decreasing of ε, therefore d(ε) decreases monotonically with the decreasing of ε, i.e., the solution given by Algorithm 1 monotonically converges to x^∗. As ε approaches zero, Algorithm 1 exhaustively searches every point in the simplex $S$ , thereby concluding the proof. □

Remarks

Algorithm 1 can be used to obtain an ε-optimal solution with |x − x^∗| ≤ ε. That is to say, by controlling ε, one can strike a balance between the optimality and the computation time. Since finding the globally optimal solution requires centralized implementation, Algorithm 1 will be used only as a benchmark for performance evaluation of distributed algorithms.

DSPC algorithm

Next, we devise a DSPC algorithm based on EDT[17] and SA[18]. Compared to the classical duality theory with nonzero duality gap for nonconvex optimization problems, EDT can guarantee zero duality gap between the primal and dual problems by utilizing nonlinear Lagrangian functions. This property allows for solving the nonconvex problem by its extended dual while preserving the global optimality in distributed implementation. To this end, we first introduce auxiliary variables and use EDT to transform (3) with the auxiliary variables into an unconstrained problem. Then, we solve the unconstrained problem by using the SA mechanism. Specifically, we define $t_{l} = \frac{U_{l} (γ_{l} (p))}{x_{l}}$ and rewrite (3) as

\begin{array}{l} minimize & - min_{l \in L} t_{l} \\ subject to & t_{l} x_{l} \leq U_{l} (γ_{l} (p)), \forall l \in L \\ \sum_{l \in L} x_{l} = 1 \\ 0 \leq p_{l} \leq P_{l}^{max}, \forall l \in L \\ 0 \leq t_{l}, 0 \leq x_{l} \leq 1, \forall l \in L \\ variables & {p, x, t} . \end{array}

(6)

Next, we use EDT to write the Lagrangian function for (6) as

\begin{align} L (p, x, t, α, β) & = & - min_{l \in L} t_{l} + α | \sum_{l \in L} x_{l} - 1 | \\ + \sum_{l \in L} β_{l} {(t_{l} x_{l} - U_{l} (γ_{l} (p)))}^{+}, \end{align}

(7)

where ${(y)}^{+} = max (0, y)$ , and $α \in R$ and $β \in R^{L}$ are the penalty multipliers for penalizing the constraint violations. Based on EDT[17], there exist finite α^∗ ≥ 0 and $β_{l}^{*} \geq 0$ , for all $l \in L$ , such that, for any α > α^∗ and $β_{l} > β_{l}^{*}$ $\forall l \in L$ , the solution to (7) is the same as (6). Note that (7) does not include the constraints of p_l, x_l, and t_lfor each user, and there will be no constraint violation when each user updates these variables locally. Therefore, the minimization of (7) with respect to the primal variables (p, x, and t) can be carried out individually by each user in a distributed fashion.

The next key step is to perform a stochastic local search by each user based on SA. Let t_l, x_l, and p_l denote the primal values of the l th user, and $t_{l}^{'}$ and $x_{l}^{'}$ denote the new values randomly chosen by the l th user. Accordingly, $t_{l}^{'} x_{l}^{'}$ can be treated as a new target utility for the l th user. To achieve this target utility, the l th user updates $p_{l}^{'}$ by

p_{l}^{'} = min (\frac{U_{l}^{- 1} (t_{l}^{'} x_{l}^{'})}{γ_{l}} p_{l}, P_{l}^{max}),

(8)

where γ_l is the current SINR measured at the l th user’s receiver. Note that (8) does not need any information of channel gains, except the feedback of SINR γ_l. Since (8) corresponds to the distributed power control algorithm of standard form as described in[27],^g it converges geometrically fast to the target utility. Thus, we assume that each user l updates p_l at a faster time-scale than t_l and x_l such that p_l always converges before the next update of t_l and x_l. Next, we use SA to update t_l and x_l in a stochastic operation. By using the analogy with annealing in metallurgy, SA was proposed in[18] to mimic the behavior of the microscopic constituents in heating and controlled cooling of a material. By allowing occasional uphill moves, SA is able to escape from the local optimal points. In particular, let Δ denote the difference between L(p_l, x_l, t_l|p_−l, x_−l, t_−l, α, β) and $L (p_{l}^{'}, x_{l}^{'}, t_{l}^{'} | p_{- l}^{'}, x_{- l}, t_{- l}, α, β)$ , where y_−l is the vector y without the l th user’s variable. If Δ ≥ 0, i.e., $t_{l}^{'}$ , $x_{l}^{'}$ and $p_{l}^{'}$ reduce Lagrangian (7), then they are accepted with probability 1; otherwise, they are accepted with probability $exp (\frac{Δ}{T})$ , where T is a control parameter and sometimes it is called temperature. Note that, as T decreases, the acceptance of uphill move becomes less and less probable, and therefore a fine-grained search is needed. It has been shown that, as T approaches 0 according to a logarithmic cooling schedule, SA converges to a globally optimal point[16, 19]. To compute Δ locally by each user l, user l needs to broadcast the terms t_l, x_l and $β_{l} {(t_{l} x_{l} - U_{l} (γ_{l} (p)))}^{+}$ , whenever any of these terms changes.

Note that the target utility t_lx_l may not be feasible, i.e., the target utility cannot be achieved even though the user transmits at the maximum power. In this case, it can be shown that the power of those users with feasible target utilities will converge to a feasible solution, whereas the other users that cannot achieve the target utility will continue to transmit at maximum power[1]. If some target utility is not feasible as T tends to 0, based on EDT, the current values of α and β do not satisfy α > α^∗ or $β_{l} > β_{l}^{*}$ for all $l \in L$ . Therefore, each user l also needs to update α and β_l. In particular, if any constraint is violated, α and β_l are updated as follows:

\begin{align} α \leftarrow α + σ | \sum_{l \in L} x_{l} - 1 |, \\ β_{l} \leftarrow β_{l} + ϱ_{l} {(t_{l} x_{l} - U_{l} (γ_{l}))}^{+}, \forall l \in L, \end{align}

(9)

where σ and ϱ_l are used to control the rate of updating α and β_l. A detailed description of DSPC algorithm is given in Algorithm 2.

Remarks:

(1)
In Algorithm 2, each user randomly picks $t_{l}^{'} \in [U_{l}^{max}, U_{tot}^{max}]$ , where $U_{l}^{max}$ denotes the maximum utility of the user l when the user l transmits at the maximum power while the other users do not transmit, and $U_{tot}^{max} = \sum_{l \in L} U_{l}^{max}$ . Note that $U_{l}^{max}$ can be computed by each user locally. Further, we assume that each user broadcasts $U_{l}^{max}$ before running the algorithm so that $U_{tot}^{max}$ is also known by each user.
(2)
In practice, after initialization, α and β _lincrease in proportion to the violation of the corresponding constraint, which may lead to excessively large penalty values. Since it is beneficial to periodically scale down the penalty values to ease the unconstrained optimization, α and β _lare scaled down by multiplying with a random value (it can be chosen between 0.7 and 0.95 according to [17]), if the penalty decrease condition is satisfied, i.e., the maximum violation of constraints is not decreased after consecutively running Step 1 in Algorithm 2 several times, e.g., five times in [17].
(3)
In Algorithm 2, each user requires the knowledge of T and time epochs {τ ₁, τ ₂,…} to update t _land x _l, which can be determined and informed to each user offline.

Algorithm 2 DSPC

Initialization: Choose ε > 0. Let α = 0, β_l= 0, $\forall l \in L$ , and randomly choose p, x and t.

Step 1: update primal variables Set T = T₀, and select a sequence of time epochs {τ₁, τ₂,…} in continuous time.

Repeat for each user l

1.
Randomly pick $t_{l}^{'} \in [U_{l}^{max}, U_{tot}^{max}]$ and $x_{l}^{'} \in [0, 1]$ , and update $p_{l}^{'}$ according to (8).
2.
Keep sensing the change of $β_{l} {(t_{l} x_{l} - U_{l} (γ_{l} (p)))}^{+}$ broadcast by other users.
3.
Compute Δ, and accept $t_{l}^{'}$ , $x_{l}^{'}$ , and $p_{l}^{'}$ with probability 1, if Δ≥0, or with probability $exp (\frac{Δ}{T})$ , otherwise.
4.
Broadcast $t_{l}^{'}$ and $x_{l}^{'}$ , if $t_{l}^{'}$ and $x_{l}^{'}$ are updated.
5.
For each time epoch τ _i, update $T = T_{0} / log (i + 1)$ .

Until T < ε.

Step 2: update penalty variables

For each user l,

1.
Update α and β _laccording to (9), and scale down α and β _l, if the penalty decrease condition is satisfied.
2.
Go to Step 1 until no constraint is violated.

Proposition 2

The DSPC algorithm (Algorithm 2) converges almost surely to a globally optimal solution to (3), as temperature T in SA decreases to zero.

Proof

To show that Algorithm 2 converges almost surely to a globally optimal solution to (3), we only need to show that when α > α^∗ and $β_{l} > β_{l}^{*}$ for all $l \in L$ , Algorithm 2 can converge almost surely to a globally optimal solution to (6), since (3) is equivalent to (6), and if the solution does not satisfy the constraints of (6), α and β_l will increase in proportion to the violation of the corresponding constraint. Since Algorithm 2 uses SA with the logarithmic cooling schedule, based on[16, 19] it can converge almost surely to a globally optimal solution to (7), which is also a globally optimal solution to (6) based on EDT when α > α^∗ and $β_{l} > β_{l}^{*}$ for all $l \in L$ [17]. Hence, Proposition 2 follows. □

Remarks

The DSPC algorithm can guarantee global optimality in a distributed manner without the need of channel information. In particular, it needs the information of t_l and x_l, and can adapt to channel variations by utilizing the SINR feedback. However, the convergence rate of DSPC is slow due to the use of logarithmic cooling schedule.

EDSPC algorithm

It can be seen from Algorithm 2 that it is critical to find the optimal penalty variables α and β for computing (7). Moreover, a logarithmic cooling schedule is used to ensure convergence to a global optimum. To improve the convergence rate, we propose next an EDSPC algorithm by empirically choosing the initial penalty values α₀ and β₀ and employing a geometric cooling schedule[18], which reduces the temperature T in SA by T = ξT, 0 < ξ < 1, at each time epoch. Compared with the logarithmic cooling schedule, T converges to 0 much faster under the geometric cooling schedule, which in turn improves the convergence rate beyond DSPC. The resulting solution is given in Algorithm 3.

We note that although EDSPC converges much faster than DSPC, it may yield only near-optimal solutions. Based on EDT, we choose $α_{0} > α^{*}$ and $β_{0 l} > β_{l}^{*}, \forall l \in L$ , to satisfy the optimality conditions for penalty variables. Obviously, by choosing large α₀ and β_0l, these conditions can be always satisfied. Nevertheless, very large penalties introduce heavy costs for constraint violations such that EDSPC may end up with a feasible but suboptimal solution. Therefore, the selection of initial penalty values plays a critical role in the performance of EDSPC and deserves more attention in future work. In practice, we can choose the initial penalties based on the maximum value of the constraint that is associated with each of the penalty variables. This choice performs well in the simulations. For example, we can choose $β_{0 l} = U_{l}^{max}$ for the constraint t_lx_l ≤ U_l(γ_l(p)).

Algorithm 3 EDSPC

Initialization: Choose ε > 0. Let α = α₀, β_l= β_0l, $\forall l \in L$ , and randomly choose p, x and t.Set T = T₀, and select a sequence of time epochs {τ₁, τ₂,…} in continuous time.

Repeat for each user l

1.
Randomly pick $t_{l}^{'} \in [U_{l}^{max}, U_{tot}^{max}]$ and $x_{l}^{'} \in [0, 1]$ , and update $p_{l}^{'}$ according to (8).
2.
Keep sensing the change of $β_{l} {(t_{l} x_{l} - U_{l} (γ_{l} (p)))}^{+}$ broadcast by other users.
3.
Compute Δ, and accept $t_{l}^{'}$ , $x_{l}^{'}$ , and $p_{l}^{'}$ with probability 1, if Δ ≥ 0, or with probability $exp (\frac{Δ}{T})$ , otherwise.
4.
Broadcast $t_{l}^{'}$ and $x_{l}^{'}$ , if $t_{l}^{'}$ and $x_{l}^{'}$ are updated.
5.
For each time epoch τ _i, update T = ξT.

Until T < ε.

Performance evaluation

In this section, we evaluate the utility and convergence performance of Algorithms 2 and 3 (DSPC^h and EDSPC). We consider a wireless network with six links randomly distributed on a 10-by-10 m² square area. The channel gains h_lk are equal to $d_{lk}^{- 4}$ , where d_lk represents the distance between the transmitter of user l and the receiver of user k. We assume $U_{l} (γ_{l} (p)) = log (1 + γ_{l} (p))$ , $P_{l}^{max} = 1$ and $n_{l} = 1 0^{- 4}$ for all $l \in L$ , and consider one randomly generated realization of channel gains given by

H = [\begin{array}{c} 0.3318 & 0.0049 & 0.0141 & 0.0021 & 0.0016 & 0.0007 \\ 0.0031 & 0.9554 & 0.0063 & 0.0140 & 0.0012 & 0.0025 \\ 0.0155 & 0.0042 & 0.6166 & 0.0046 & 0.0108 & 0.0018 \\ 0.0017 & 0.2188 & 0.0340 & 0.6754 & 0.0062 & 0.0215 \\ 0.0020 & 0.0017 & 0.2216 & 0.0042 & 0.2955 & 0.0028 \\ 0.0007 & 0.0079 & 0.0254 & 0.2553 & 0.0404 & 0.3025 \end{array}] .

Figure5 shows how the total utility in the EDSPC algorithm converges over time, when we choose all the initial penalty values equal to 10. Also, we choose ξ = 0.9, ρ = 1, and ϱ = 1, and use Algorithm 1 as a benchmark to evaluate the optimal performance. As shown in Figure5, the EDSPC algorithm approaches the optimal utility, when the initial penalty values are carefully chosen. Moreover, the convergence rate of the EDSPC algorithm is much faster than DSPC, since DSPC continues updating the penalty values even after the optimal solution is found for the current penalty values. Figure6 illustrates the average performance (with confidence interval) of DSPC, EDSPC, SEER, and ADP under 100 random initializations, with the same system parameters as used in Figure5. As shown in Figure6, both DSPC and EDSPC are robust against the variations of initial values.

Figures5 and6 compare the proposed algorithms with the SEER and ADP. As mentioned in Section “Introduction”, ADP can only guarantee local optimality. Therefore, for nonconvex problems (e.g., in this example), ADP may converge to a suboptimal solution. As noted in[15], the performance of SEER heavily hinges on the control parameter that can be challenging to choose in online operation. In contrast, DSPC can approach the globally optimal solution regardless of the initial parameter selection, but the convergence rate may be slower. Furthermore, EDSPC improves the convergence rate, but in this case the initial penalty values would impact how close it can approach the optimal point. In terms of message passing, our algorithms do not require individual links to know the channel gains (including its own channel gain), the receiver SINR of the other links and the signal power of the other links, which are all needed in the SEER algorithm.

Joint scheduling and power control for stability of queueing systems

In Section “Power control for unicast communications”, we studied the distributed power allocation, by using DSPC and EDSPC, for utility maximization in the saturated case with uninterrupted packet traffic. In this section, we generalize the study by considering a queueing system with dynamic packet arrivals and departures. Specifically, we develop a joint scheduling and power allocation policy to stabilize packet queues by integrating our power control algorithms with the celebrated back-pressure algorithm[20].

Stability region and throughput optimal power allocation policy

Consider the same wireless network model with L links as in Section “Power control for unicast communications”. We assume that there are S classes of users in the system, and that the traffic brought by users of class s follows ${A_{sl} (t)}_{t = 1}^{\infty}$ , which are i.i.d. sequences of random variables for all l = 1,…,L and s = 1,…,S, where A_sl(t) denotes the amount of traffic generated by users of class s that enters the link l in slot t. We assume that the second moments of the arrival process ${A_{sl} (t)}_{t = 1}^{\infty}$ are finite. Let $Q_{T (l)}^{s} (t)$ and $Q_{R (l)}^{s} (t)$ denote the current backlog in the queue of class s in slot t on the transmitter and receiver sides of link l, respectively. The queue length $Q_{T (l)}^{s} (t)$ evolves over time as

\begin{align} Q_{T (l)}^{s} (t + 1) = max (Q_{T (l)}^{s} (t) - r_{l}^{s} (t), 0) + A_{sl} (t) \\ + \sum_{{m | T (l) = R (m), m \in L}} r_{m}^{s} (t), \end{align}

(10)

where $r_{l}^{s} (t)$ denotes the transmission rate of link l for users of class s. The third term in (10) denotes the traffic from the other links. The queue length process ${Q_{T (l)}^{s} (t)}_{t = 1}^{\infty}$ forms a Markov chain.

Let ψ_s denote the first moment of ${A_{sl} (t)}_{t = 1}^{\infty}$ , i.e., the load brought by users of class s. As is standard[20, 21, 28], the stability region is defined as follows.

Definition 1

The stability region ∧ is the closure of the set of all ${ψ_{s}}_{s = 1}^{S}$ for which there exists some feasible power allocation policy under which the system is stable, i.e., $\land = ⋃_{p \in P} \land (p)$ , where $\land (p) = {{ψ_{s}}_{s = 1}^{S} | \sum_{s = 1}^{S} E_{sl} ψ_{s} < r_{l} (p), \forall l}$ , $P$ denotes the set of feasible power allocation, r_l(p) denotes the rate of link l under power allocation p, and E_sl = 1 is the indicator that the path of users of class s uses link l, and E_sl = 0, otherwise.

For the sake of comparison, the throughput regionⁱ $F$ of the corresponding saturated case is defined as the set of all feasible link rates, i.e., $F = {r | r_{l} = r_{l} (p), p \in P}$ . In general, the throughput region $F$ may be different from the stability region ∧, except for some special cases (e.g., in slotted ALOHA systems the throughput region and the stability region are the same[29] for two links and in a multiple-access channel the information theoretic capacity region is equivalent to its stability region under specific feedback assumptions[30]).

The queueing system is stable if the arrival rates of packet queues are less than the service rates such that the queue lengths do not grow to infinity[31]. In order to stabilize packet queues, it is critical to find the optimal scheduling and power allocation policy that maximizes the weighted sum rate given by (11). By integrating our power control algorithms with the back-pressure algorithm, we propose a joint scheduling and power allocation policy presented in Algorithm 4 to stabilize the queueing system.

Proposition 3

The joint scheduling and power allocation policy (Algorithm 4) can stabilize the system such that ${lim sup}_{t \to \infty} \frac{1}{t} \sum_{τ = 0}^{t - 1} \sum_{l, s} E {Q_{l}^{s} (τ)} < \infty$ , when the traffic load ${ψ_{s}}_{s = 1}^{S}$ is strictly interior to the stability region ∧, i.e., there exists some ε > 0 such that ${ψ_{s} + ε}_{s = 1}^{S} \in \land$ .

The proof is similar to that in[21, 32], and is omitted for brevity.

Note that Algorithm 4 can be viewed as a dynamic back-pressure and resource allocation policy[32], crafted towards solving the weighted sum rate maximization problem (11). Specifically, by using the DSPC algorithm, Algorithm 4 can be implemented distributively to find the globally optimal resource allocation. We should caution that EDSPC can be applied to improve the convergence rate of Stage 2 in Algorithm 4 but it may render a suboptimal schedule (i.e., it can not stabilize all possible ${ψ_{s}}_{s = 1}^{S}$ within ∧), due to the fact that EDSPC may not always find the global optimal power allocation.

To reduce the complexity, we can consider a policy that computes (11) periodically every few slots, and it can be shown that this policy can also stabilize the system, when ${ψ_{s}}_{s = 1}^{S}$ is strictly interior to the stability region ∧[33, 34].

Algorithm 4 Joint scheduling and power allocation policy

Stage 1: For each link l, select a link weight according to $w_{l} (t) = max_{s = 1, \dots, S} D_{l}^{s} (t)$ , where the difference of queue lengths of class s is $D_{l}^{s} (t) = max (Q_{T (l)}^{s} (t) - Q_{R (l)}^{s} (t), 0)$ , if the receiver of link l is not the destination of class s’s traffic, and $D_{l}^{s} (t) = Q_{T (l)}^{s}$ , otherwise.

Stage 2: Compute the optimal power allocation p^∗ in each slot t by solving the following problem with DSPC algorithm

p^{*} = \underset{p}{arg max} \sum_{l = 1}^{L} w_{l} (t) r_{l} (p) .

(11)

Thus, the transmission rate of link l in slot t is given by $r_{l} (p^{*}) = log (1 + γ_{l} (p^{*}))$ .

Stage 3: Let $s_{l}^{*} = arg max_{s = 1, \dots, S} D_{sl} (t)$ denote the class scheduled in slot t; if multiple classes satisfy this condition, then $s_{l}^{*}$ is randomly chosen as one of these classes. Then, schedule these classes according to the solution given by Stage 2.

Performance evaluation

In this section, we present numerical results to illustrate the use of Algorithm 4 for stabilizing a queueing system. We consider a one-hop network (i.e., E = {E_sl} is the identity matrix) with two users (classes), where the channel gains are h₁₁ = 0.3, h₁₂ = 0.5, h₂₁ = 0.03, and h₂₂ = 0.8, and the noise power is 0.1 for each link. The maximum transmission power is set to 1 and 2 for links 1 and 2, respectively. Besides, we assume that the users of class s arrive at the network according to a Poisson process with rate λ_s, and that the size of packet batch for users of class s follows an exponential distribution with mean ν_s. The load brought by users of class s is then ψ_s= λ_sν_s. Figure7 shows the stability region ∧ and compares it with the throughput region $F$ of the corresponding saturated case. The stability region follows from the union of link rates that are conditioned on whether the other link is backlogged or not[29, 30]. First, we derive the stability region for the given power allocation. Then, we vary power allocation in the feasible region, and by taking the envelope of these regions, we obtain the overall stability region shown in Figure7. However, different from the previous cases, where the throughput region is the same as the stability region, e.g., in a slotted ALOHA system with two links[29] and in a multiple-access channel[30], the throughput region $F$ under the SINR model is strictly smaller than the stability region (due to the underlying nonconvex optimization problem), as observed from Figure7, which is the convex hull of $F$ , i.e., $Co (F)$ , achievable by timesharing across different transmission modes.^j

Then, we vary the arrival rate λ and the average batch size ν to change the traffic intensity ψ = λν. Assuming that the arrival rate and the average batch size of each user are the same, we compare in Figure8 the sample paths of each user’s queue length for ψ = 1 (λ = 1, ν = 1) with ψ = 1.5 (λ = 1.5, ν = 1). When ψ = 1, which falls in the stability region shown in Figure7, the system is stabilized by using Algorithm 4. On the other hand, the system becomes unstable when ψ = 1.5, which is outside the stability region. Figure9 illustrates the average delay of the system as a function of the arrival rates. The delay is finite for small loads and grows unbounded when the loads are outside the stability region.

Power control for multicast communications

Due to wireless multicast advantage[23], multicasting enables efficient data delivery to multiple recipients with a single transmission. In this section, we extend the DSPC algorithms in Section “Power control for unicast communications” to support multicast communications.

System model

Beyond the model described in Section “Power control for unicast communications”, we consider that each user l has one transmitter and a set $M_{l}$ of receivers. The corresponding transmission rate, r_l, is determined by the bottleneck link among these transmitter–receiver pairs, i.e., $r_{l} = min_{m \in M_{l}} r_{lim}$ , where r_lm denotes the link rate between the transmitter of user l and its receiver m, and it is calculated from the Shannon rate $log (1 + γ_{lim} (p))$ for Gaussian, flat fading channels. Here, we do not consider the general broadcast capacity region but rather focus on maximizing the bottleneck link rates.

Network utility maximization

We seek to find the optimal power allocation p^∗ that maximizes the overall system utility subject to the power constraints in multicast communications, as follows:

\begin{array}{l} maximize & \sum_{l \in L} U_{l} (r_{l}) \\ subject to & r_{l} = min_{m \in M_{l}} r_{lim}, \forall l \in L \\ r_{lim} = log (1 + γ_{lim} (p)), \forall l \in L, m \in M_{l} \\ 0 \leq p_{l} \leq P_{l}^{max}, \forall l \in L \\ variables & {p, {r_{l}}, {r_{lim}}} . \end{array}

(12)

Similar to (2), (12) is nonconvex due to the complicated interference coupling between individual links. Different from the techniques used in Section “Power control for unicast communications”, we relax $r_{l} = min_{m \in M_{l}} r_{lim}$ in (12) as $r_{l} \leq log (1 + γ_{lim} (p)), \forall l \in L, m \in M_{l}$ , in order to devise distributed algorithms solving (12). Thus, (12) can be rewritten as

\begin{array}{l} maximize & \sum_{l \in L} U_{l} (r_{l}) \\ subject to & r_{l} \leq log (1 + γ_{lim} (p)), \forall l \in L, m \in M_{l} \\ 0 \leq p_{l} \leq P_{l}^{max}, \forall l \in L \\ variables & {p, r} . \end{array}

(13)

Distributed global optimization algorithms

We develop next distributed algorithms that can find the globally optimal solutions to (13) based on EDT and SA. To this end, we first rewrite the optimization problem (13) as

\begin{array}{l} minimize & - \sum_{l \in L} U_{l} (r_{l}) \\ subject to & r_{l} \leq log (1 + γ_{lim} (p)), \forall l \in L, m \in M_{l} \\ 0 \leq p_{l} \leq P_{l}^{max}, \forall l \in L \\ variables & {p, r} . \end{array}

(14)

Next, we use EDT to write the Lagrangian function for (14) as

\begin{array}{l} L (p, r, {α_{lim}}) = - \sum_{l \in L} U_{l} (r_{l}) \\ + \sum_{l \in L, m \in M_{l}} α_{lim} {(r_{l} - log (1 + γ_{lim} (p)))}^{+}, \end{array}

(15)

where $α_{lim} \in R$ are the penalty multipliers. From EDT, there exist finite $α_{lim}^{*} \geq 0$ for all $l \in L, m \in M_{l}$ such that, for any $α_{lim} > α_{lim}^{*}$ $\forall l \in L, m \in M_{l}$ , the solution to (15) is the same as (14)[17]. Note that (15) does not include the constraint of p_l for each user. Therefore, there will be no constraint violation when each user updates the transmission power locally, while minimizing (15) in a distributed operation.

As in Section “Power control for unicast communications”, the key step is to let each user perform a local stochastic search based on SA. Let r_l and p_l denote the primal values of the l th user, and $r_{l}^{'}$ denote the new value randomly chosen by the l th user, which is treated as a new target transmission rate for the l th user. Different from the unicast communications case, the l th user updates $p_{l}^{'}$ by

p_{l}^{'} = min (\frac{e^{r_{l}^{'}} - 1}{min_{m \in M_{l}} γ_{lim}} p_{l}, P_{l}^{max}),

(16)

where γ_lm is the current SINR measured at the receiver m of user l. Note that (16) does not need any information of the channel gains, except the feedback of SINR γ_lm from the intended receivers. Since (16) is in standard form as described in[27], it converges geometrically fast to the target transmission rate. The steps to update _{r
l} and α_lm are similar to DSPC Algorithm 2 in Section “Power control for unicast communications”. Note that the target transmission rate r_l may not be feasible, i.e., the target utility cannot be achieved even though the user transmits at the maximum power. In this case, it can be shown that the power of those users with feasible target transmission rates will converge to a feasible solution, whereas the other users that cannot achieve the target transmission rate will continue to transmit at maximum power[1]. A detailed description of DSPC algorithm for multicast communications is presented in Algorithm 5.

Remarks

In Algorithm 5, each user randomly picks $r_{l}^{'} \in [0, r_{l}^{max}]$ , where $r_{l}^{max} = min_{m \in M_{l}} r_{lim}^{max}$ , and $r_{lim}^{max}$ is the maximum link rate between the transmitter of the user l and its receiver m, when the user l transmits at the maximum power while the other users do not transmit.

Proposition 4

The DSPC algorithm for multicast communications (Algorithm 5) converges almost surely to a globally optimal solution to (13), as temperature T in SA approaches zero.

Proof

The proof is based on EDT and SA arguments, and follows similar steps used in the proof of Proposition 2, and it is omitted here for brevity. □

To improve the convergence rate, we also propose an enhanced algorithm for Algorithm 5 by empirically choosing the initial penalty values and employing a geometric cooling schedule. The resulting algorithm is given in Algorithm 6. Similar to the unicast case, Algorithms 5 and 6 do not need any knowledge of channel information (or the bottleneck link) and they are dynamically updated by the SINR feedback from the intended receivers.

Algorithm 5 DSPC for multicast communications

Initialization: Choose ε > 0. Let α_lm= 0, $\forall l \in L, m \in M_{l}$ and randomly choose r and p.

Step 1: update primal variables Set T = T₀, and select a sequence of time epochs {τ₁, τ₂,…} in continuous time.

Repeat for each user l

1.
Randomly pick $r_{l}^{'} \in [0, r_{l}^{max}]$ , and update $p_{l}^{'}$ according to (16).
2.
Keep sensing the change of $\sum_{m \in M_{l}} α_{lim} {(r_{l} - log (1 + γ_{lim} (p)))}^{+}$ broadcast by other users.
3.
Let Δ be the difference between L(p, r _l|r _−l, {α _lm}) and $L (p^{'}, r_{l}^{'} | r_{- l}, {α_{lim}})$ , and accept $r_{l}^{'}$ and $p_{l}^{'}$ with probability 1, if Δ ≥ 0, or with probability $exp (\frac{Δ}{T})$ , otherwise.
4.
Broadcast $U_{l} (r_{l}^{'})$ , if $r_{l}^{'}$ is accepted.
5.
For each time epoch τ _i, update $T = T_{0} / log (i + 1)$ .

Until T < ε.

Step 2: update penalty variables

For each user l,

1.
Update $α_{lim} \leftarrow α_{lim} + ϱ_{lim} {(r_{l} - log (1 + γ_{lim} (p)))}^{+}$ , and scale down α _lm, if the condition of penalty decrease is satisfied.
2.
Go to Step 1 until no constraint is violated.

Algorithm 6 EDSPC for multicast communications

Initialization: Choose ε > 0. Let $α_{lim} = α_{lim}^{0}$ , $\forall l \in L, m \in M_{l}$ and randomly choose r and p.

Set T = T₀, and select a sequence of time epochs {τ₁, τ₂, …} in continuous time.Repeat for each user l

1.
Randomly pick $r_{l}^{'} \in [0, r_{l}^{max}]$ , and update $p_{l}^{'}$ according to (16).
2.
Keep sensing the change of $\sum_{m \in M_{l}} α_{lim} {(r_{l} - log (1 + γ_{lim} (p)))}^{+}$ broadcast by other users.
3.
Let Δ be the difference between L(p, _{r
l}|r _−l, {α _lm}) and $L (p^{'}, r_{l}^{'} | r_{- l}, {α_{lim}})$ , and accept $r_{l}^{'}$ and $p_{l}^{'}$ with probability 1, if Δ ≥ 0, or with probability $exp (\frac{Δ}{T})$ , otherwise.
4.
Broadcast $U_{l} (r_{l}^{'})$ , if $r_{l}^{'}$ is accepted.
5.
For each time epoch τ _i, update T = ξT.

Until T < ε.

Performance evaluation

In this section, we evaluate the performance of Algorithms 5 and 6 for multicast communications. We consider a wireless network with four transmitters and each transmitter has two receivers. These transmitters and receivers are randomly placed on a 10-by-10 m² square area. The channel gains h_lm are equal to $d_{lim}^{- 4}$ , where d_lm represents the distance between the transmitter l and the receiver m. The channel gains h_lm are equal to $d_{lim}^{- 4}$ , where d_lmrepresents the distance between the transmitter l and the receiver m. We assume U_l(r_l) = r_l, $P_{l}^{max} = 1$ , and $n_{lim} = 1 0^{- 4}$ for all $l \in L$ and $m \in M_{l}$ . Figure10 illustrates the fast convergence performance of Algorithms 5 and 6 in multicast communications.^k Besides, we examine the average performance (with confidence interval) of DSPC and EDSPC for multicast communications under 100 random initializations with the same system parameters as in the case shown in Figure10. As illustrated in Figure11, both Algorithms 5 and 6 are robust against the initial value variations.

Conclusion

We studied the distributed power control problem of optimizing the system utility as a function of the achievable rates in wireless ad hoc networks. Based on the observation that the global optimum lies on the boundary of the feasible region for unicast communications, we focused on the equivalent but more structured problem in the form of maximizing the minimum weighted utility. Appealing to EDT, we decomposed the minimum weighted utility maximization problem into subproblems by using penalty multipliers for constraint violations. We then proposed a DSPC algorithm to seek a globally optimal solution, where each user stochastically announces its target utility to improve the total system utility via SA. In spite of the nonconvexity of the underlying problem, the DSPC algorithm can guarantee global optimality, but only with a slow convergence rate. Therefore, we proposed an EDSPC to improve the convergence rate with geometric cooling schedule in SA. We then compared DSPC and EDSPC with the existing power control algorithms and verified the optimality and complexity reduction.

Next, we proposed the joint scheduling and power allocation policy for queueing systems by integrating our distributed power control algorithms with the back-pressure algorithm. The stability region was evaluated, which is shown to be strictly greater than the throughput region in the corresponding saturated case. Beyond unicast communications, we generalized our power control algorithms to multicast communications by jointly maximizing the minimum rates on bottleneck links in different multicast groups. Our DSPC approach guarantees global optimality without the need of channel information, while reducing the computation complexity, in general systems with unicast and multicast communications, and applies to both backlogged and random traffic patterns.

Endnotes

^a We use the terms “user” and “link” interchangeably throughout the article.

^b We use bold symbols (e.g., p) to denote vectors and calligraphic symbols (e.g., $L$ ) to denote sets.

^c The QoS constraint for each link can be incorporated in (2), and the proposed algorithms in the following section can easily be adapted to this case at the cost of added notational complexity.

^d For some special utility functions U_l(·), (2) can be transformed into a convex problem[3]. In this article, we focus on the nonconvex case that cannot be transformed to a convex problem by change of variables.

^e The local optimal solution found by ADP matches the globally optimal solution only in one of the cases that are illustrated in Table2.

^f By definition, a function $f : R^{n} \to R$ is quasi-convex, if its domain dom f and all its sublevel sets $S_{c} = {x \in dom f | f (x) \leq c}$ , for $c \in R$ , are convex[26].

^g A power control algorithm is of standard form, if the interference function (the effective interference each link must overcome) is positive, monotonic and scalable in power allocation[27].

^h The geometric cooling schedule is employed to accelerate the convergence rate of DSPC in the simulation. DSPC updates penalty values until they satisfy the threshold-based optimality condition.

ⁱ Note that the feasible utility region $F$ defined in Section “Power control for unicast communications” is the throughput region, when the utility function is the same as the rate function.

^j The transmission mode is defined as the transmission rate pair within the throughput region $F$ .

^k The other existing algorithms have specifically been designed for unicast communications; therefore, they are excluded here from the performance comparison.

References

Chiang M, Hande P, Lan T, Tan CW: Power control in wireless cellular networks. Found. Trends Netw 2008, 2(4):381-553.
Article Google Scholar
Julian D, Chiang M, O’Neill D, Boyd S: Qos and fairness constrained convex optimization of resource allocation for wireless cellular and ad hoc networks. Proc. IEEE INFOCOM, vol. 2 2002, 477-486. (New York, USA)
Google Scholar
Chiang M, Tan CW, Palomar D, O’Neill D, Julian D: Power control by geometric programming. IEEE Trans. Wirel. Commun 2007, 1(7):2640-2651.
Article Google Scholar
Xiao M, Shroff NB, Chong EKP: A utility-based power control scheme in wireless cellular systems. IEEE/ACM Trans. Netw 2003, 11(2):210-221. 10.1109/TNET.2003.810314
Article Google Scholar
Luo Z-Q, Zhang S: Dynamic spectrum management: complexity and duality. IEEE J. Sel. Topics Signal Process 2008, 2(1):57-73.
Article Google Scholar
Huang J, Berry R, Honig M: Distributed interference compensation for wireless networks. IEEE J. Sel. Areas Commun 2006, 24(5):1074-1084.
Article Google Scholar
Hande P, Rangan S, Chiang M, Wu X: Distributed uplink power control for optimal SIR assignment in celluar data networks. IEEE/ACM Trans. Netw 2008, 16(6):1420-1433.
Article Google Scholar
Papandriopoulos J, Dey S, Evans J: Optimal and distributed protocols for cross-layer design of physical and transport layers in MANETs. IEEE/ACM Trans. Netw 2008, 16(6):1392-1405.
Article Google Scholar
Papandriopoulos J, Evans JS: SCALE: a low-complexity distributed protocol for spectrum balancing in multiuser DSL networks. IEEE Trans. Inf. Theory 2009, 55(8):3711-3724.
Article MathSciNet Google Scholar
Saraydar CU, Mandayam NB, Goodman DJ: Efficient power control via pricing in wireless data networks. IEEE Trans. Commun 2002, 50(2):291-303. 10.1109/26.983324
Article Google Scholar
Alpcan T, Basar T, Srikant R, Altman E: Wirel. Netw. 2002, 8(6):659-670.
Google Scholar
Xu Y, Le-Ngoc T, Panigrahi S: Global concave minimization for optimal spectrum balancing in multi-user DSL networks. IEEE Trans. Signal Process 2008, 56(7):2875-2885.
Article MathSciNet Google Scholar
Horst R, Tuy H: Global Optimization—Deterministic Approaches. (Springer, New York, 2006)
Qian L, Zhang YJ, Huang J: MAPEL: achieving global optimality for a non-convex power control problem. IEEE Trans. Wirel. Commun 2009, 8(3):1553-1563.
Article Google Scholar
Qian L, Zhang YJ, Chiang M: Globally optimal distributed power control for nonconcave utility maximization. Proc. IEEE GLOBECOM 2010, 1-6. (Miami, USA)
Google Scholar
Geman S, Geman D: Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images. IEEE Trans. Pattern Anal. Mach. Intell 1984, 6(6):721-741.
Article MATH Google Scholar
Chen Y, Chen M: Extended duality for nonlinear programming. Comput. Optim. Appl 2010, 47(1):33-59. 10.1007/s10589-008-9208-3
Article MathSciNet MATH Google Scholar
Kirkpatrick S, Gelatt CD, Vecchi JMP: Optimization by simulated annealing. Science 1983, 220(4598):671-680. 10.1126/science.220.4598.671
Article MathSciNet MATH Google Scholar
Hajek B: Cooling schedules for optimal annealing. Math. Oper. Res 1988, 13(2):311-329. 10.1287/moor.13.2.311
Article MathSciNet MATH Google Scholar
Tassiulas L, Ephremides A: Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks. IEEE Trans. Autom. Control 1992, 37(12):1936-1948. 10.1109/9.182479
Article MathSciNet MATH Google Scholar
Neely MJ, Modiano E, Rohrs CE: Dynamic power allocation and routing for time varying wireless networks. IEEE J. Sel. Areas Commun 2005, 23(1):89-103.
Article Google Scholar
Lee H-W, Modiano E, Le LB: Distributed throughput maximization in wireless networks via random power allocation. IEEE Trans. Mob. Comput 2011, 11(4):577-590.
Google Scholar
Wieselthier JE, Nguyen GD, Ephremides A: On construction of energy-efficient broadcast and multicast trees in wireless networks. Proc. IEEE INFOCOM, vol. 2 2000, 585-594. (Tel Aviv, Israel)
Google Scholar
Wang K, Chiasserini CF, Proakis JG, Rao RR: Joint scheduling and power control supporting multicasting in wireless ad hoc networks. 2006, 4(4):532-546.
Google Scholar
Mo J, Walrand J: Fair end-to-end window-based congestion control. IEEE/ACM Trans. Netw 2000, 8(5):556-567. 10.1109/90.879343
Article Google Scholar
Boyd S, Vandenberghe L: Convex Optimization. (Cambridge University Press, Cambridge, UK, 2004)
Yates RD: A framework for uplink power control in cellular radio systems. IEEE J. Sel. Areas Commun 1995, 13(7):1341-1347. 10.1109/49.414651
Article MathSciNet Google Scholar
Lin X, Shroff NB, Srikant R: On the connection-level stability of congestion-controlled communication networks. IEEE Trans. Inf. Theory 2008, 54(5):2317-2338.
Article MathSciNet MATH Google Scholar
Rao R, Ephremides A: On the stability of interacting queues in a multiple-access system. IEEE Trans. Inf. Theory 1988, 34(5):918-930. 10.1109/18.21216
Article MathSciNet MATH Google Scholar
ParandehGheibi A, Medard M, Ozdaglar A, Eryilmaz A: Information theory vs. queueing theory for resource allocation in multiple access channels. Proc. IEEE Pers. Ind. Mob. Radio Commun 2008, 1-5. (Nice, France), pp. 1–5
Google Scholar
Loynes R: The stability of a queue with non-interdependent inter-arrival and service times. Proc. Camb. Philos. Soc 1962, 58: 497-520. 10.1017/S0305004100036781
Article MathSciNet MATH Google Scholar
Georgiadis L, Neely MJ, Tassiulas L: Resource allocation and cross-layer control in wireless networks. Found. Trends Netw 2006, 1(1):1-149.
Article MATH Google Scholar
Chaporkar P, Sarkar S: Stable scheduling policies for maximizing throughput in generalized constrained queueing systems. IEEE Trans. Autom. Control 2008, 53(8):1913-1931.
Article MathSciNet Google Scholar
Yi Y, Zhang J, Chiang M: Delay and effective throughput of wireless scheduling in heavy traffic regimes: vacation model for complexity. Proc. ACM Mobihoc 2009, 55-64. (New York, USA)
Google Scholar

Download references

Acknowledgements

This study was supported in part by the DoD MURI Project FA9550-09-1-0643 and the AFOSR grants FA9550-10-C-0026 and FA9550-11-C-0006.

Author information

Authors and Affiliations

School of ECEE, Arizona State University, Tempe, AZ, 85287, USA
Lei Yang & Junshan Zhang
Intelligent Automation, Inc, Rockville, MD, 20855, USA
Yalin E Sagduyu & Jason H Li

Authors

Lei Yang
View author publications
You can also search for this author in PubMed Google Scholar
Yalin E Sagduyu
View author publications
You can also search for this author in PubMed Google Scholar
Junshan Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Jason H Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lei Yang.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Authors’ original file for figure 6

Authors’ original file for figure 7

Authors’ original file for figure 8

Authors’ original file for figure 9

Authors’ original file for figure 10

Authors’ original file for figure 11

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Yang, L., Sagduyu, Y.E., Zhang, J. et al. Distributed stochastic power control in ad hoc networks: a nonconvex optimization case. J Wireless Com Network 2012, 231 (2012). https://doi.org/10.1186/1687-1499-2012-231

Download citation

Received: 15 February 2012
Accepted: 04 July 2012
Published: 24 July 2012
DOI: https://doi.org/10.1186/1687-1499-2012-231

Distributed stochastic power control in ad hoc networks: a nonconvex optimization case

Abstract

Introduction

Power control for unicast communications

System model

Network utility maximization

Example

Remarks

From network utility maximization to minimum weighted utility maximization

Lemma 1

Proof

Lemma 2

Proof

Centralized versus distributed algorithms

A centralized algorithm

Algorithm 1

Proposition 1

Proof

Remarks

DSPC algorithm

Algorithm 2 DSPC

Proposition 2

Proof

Remarks

EDSPC algorithm

Algorithm 3 EDSPC

Performance evaluation

Joint scheduling and power control for stability of queueing systems

Stability region and throughput optimal power allocation policy

Definition 1

Proposition 3

Algorithm 4 Joint scheduling and power allocation policy

Performance evaluation

Power control for multicast communications

System model

Network utility maximization

Distributed global optimization algorithms

Remarks

Proposition 4

Proof

Algorithm 5 DSPC for multicast communications

Algorithm 6 EDSPC for multicast communications

Performance evaluation

Conclusion

Endnotes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ original submitted files for images

Rights and permissions

About this article

Cite this article

Share this article

Keywords