A Stochastic Multiobjective Optimization Framework for Wireless Sensor Networks
© Shibo He et al. 2010
Received: 31 October 2009
Accepted: 17 February 2010
Published: 20 May 2010
Skip to main content
© Shibo He et al. 2010
Received: 31 October 2009
Accepted: 17 February 2010
Published: 20 May 2010
In wireless sensor networks (WSNs), there generally exist many different objective functions to be optimized. In this paper, we propose a stochastic multiobjective optimization approach to solve such kind of problem. We first formulate a general multiobjective optimization problem. We then decompose the optimization formulation through Lagrange dual decomposition and adopt the stochastic quasigradient algorithm to solve the primal-dual problem in a distributed way. We show theoretically that our algorithm converges to the optimal solution of the primal problem by using the knowledge of stochastic programming. Furthermore, the formulation provides a general stochastic multiobjective optimization framework for WSNs. We illustrate how the general framework works by considering an example of the optimal rate allocation problem in multipath WSNs with time-varying channel. Extensive simulation results are given to demonstrate the effectiveness of our algorithm.
The layered architecture approach has achieved great success in traditional wired network design by dividing the whole architecture into several modules, called layers, each of which performs a separate functionality. As each layer design only needs some interface variables from the layer below, the complexity of other layers can be hidden. The layered architecture approach suggests that the network design can be scalable, evolvable, and implementable. However, it may have limitations in improvement of efficiency and fairness, and suffer potential risks of manageability , which motivates the optimization of network design. Chiang et al. [1, 2] propose an optimization decomposition technique to systematically understand the network architecture, known as "layering as optimization decomposition". They model the network as an optimization problem and decompose the problem into many subproblems. They classify the decompositions into vertical decomposition and horizontal decomposition. Vertical decomposition layers the network architecture into several modules and horizontal decomposition provides distributed algorithms to fulfill the functionality within the modules. According to the requirements of the applications, the decomposition may be different, yielding different layers and distributed algorithms. There are usually two steps in the process of layering as optimization decomposition: ( ) modeling the network problem as a specific NUM problem, and ( ) exploring the alternative decompositions to design different modules and distributed algorithms. Most existing efforts have been put to the second step and simply assume that the network problems can be modeled by a unified utility function at the first step [3–6]. However, not all network problems can be modeled by a unified utility function in a tractable way since there may exist many objectives to be achieved, such as guaranteeing fairness, maximizing throughput, reducing packet dropping and delay, prolonging the network lifetime, and so forth. It may not be possible to integrate all these objectives into a single unified utility function, that is, network problems should be formulated as multiobjective optimization problems.
While the performance of the network can be greatly enhanced by adopting the NUM approach, the corresponding cost of algorithm implementation also increases. As we usually design and implement an algorithm for a specific application from scratch, the implementation can hardly be transplantable to other applications. This is especially aggravated in wireless sensor networks (WSNs) due to the application-oriented and infrastructureless nature of these networks. For example, if we design an efficient algorithm for events monitoring, the network lifetime is the main concern and the propagation delay can be tolerant, but it is difficult to apply such algorithm to online query applications, where the query delay is the primary objective.
In WSNs, some parameters (e.g., the topology of the networks or channel condition) are time-varying. In , Lee et al. demonstrated that the state of the network can be more efficiently utilized to improve the performance of the network (e.g., increasing the throughput and reducing packet delay), by appropriately exploiting the variability of the time-varying channels. Also there are measurement errors in the implementation of distributed algorithms, such as the noisy feedback  or lossy links . Therefore, we also characterize these random factors in our model. Our contributions in this paper are summarized as follows.
We formulate a general multiobjective stochastic optimization problem for WSNs. We decompose the optimization problem through Lagrange dual decomposition and adopt the stochastic quasigradient method to solve the primal-dual problem. In other words, we transform the multiple objectives of WSNs into the multiple objectives of each individual sensor node. The global optimal solution can be obtained when each sensor node maximizes its own objective vector function. Therefore, our approach provides a general framework for multiobjective optimization for WSNs.
We study the stability of the algorithm by using the knowledge of stochastic programming, and show that our algorithm for stochastic multiobjective optimization problem (ASMOP) can converge to the optimal solution of the primal problem.
We demonstrate how the general framework can be applied to different applications, by considering the rate allocation problem as an example. We introduce three multiobjective optimization methods: ( ) constraint method, ( ) linear weighted method, and ( ) hierarchical sequence method. The three paradigms show that although different requirements may lead to different models [6, 10], we can solve them in the general framework.
The remainder of the paper is organized as follows: in Section 2, we discuss related work regarding the NUM problem and stochastic network utility maximization (SNUM). We formulate a general mathematical model and design a distributed algorithm to solve the problem in Section 3, and the stability of the algorithm is also discussed. We provide three paradigms in Section 4 to demonstrate the general framework for different applications. Simulation results are given in Section 5. We conclude the paper in Section 6.
There are several research works in the literature studying the NUM problem [4, 11–14]. Kelly et al. were the first to propose the optimization approach, which provides a mathematical foundation for NUM problem . In , Chiang adopted the NUM approach to obtain a cross-layer design including the physical layer and the transport layer. Zhu et al.  considered the energy model in the cross-layer design. In , Chiang et al. provided a mathematical theory of network architectures. Wang et al. studied joint interference-aware routing and TDMA link scheduling to improve the throughput in multihop wireless networks . Zhang et al.  elaborated on the impact of the feedback in the implementation of distributed NUM algorithms. Since feedback is often collected using error-prone measurement mechanisms, for example, biased estimator or unbiased estimator, they adopted the knowledge of stochastic approximation and proved stability of the algorithms of single-time scale and two-time scale. Lee et al. utilized the variation of channels to guide power and rate control in cross-layer design . In this paper, we formulate a more general mathematical model by considering stochastic multiple objectives in objective functions. We apply our approach to rate allocation problem in multipath WSNs with time-varying channels. Rate allocation is a fundamental problem and has been extensively investigated [16–19]. Low and Lapsley  first introduced the Lagrange dual method to decompose the problem and proposed two algorithms under synchronous and asynchronous scenarios. A multipath formulation for rate control in multi-cast networks was proposed in , and three distributed algorithms were proposed to solve the problem. The goal is to maximize the aggregate utility. In , Srinivasan et al. considered two objectives: utility maximization and guaranteeing prespecified network lifetime for multipath wireless ad hoc networks. In , Zhu et al. also focused on the network lifetime and application performance (utility), and employed the linear weighted method from the multiobjective optimization to transform these two objective functions into a single one which was named to the utility-lifetime tradeoff function.
Throughout the paper, we will denote sets by capital letters, variables by lowercase letters, vectors by bold lowercase letters, and matrices by bold capital letters. For a vector , we denote its th component by and its transpose by . We use capital letters for both the sets and the cardinality of sets.
where is the th decoupled objective function, and is the corresponding vector of auxiliary variables.
In our formulation, the objective functions are deterministic, taking the advantage that each sensing node can be obtained from the network. The constraint set contains the random factors of the networks, such as message exchange, and environmental effect. If we know the distribution, , , of , we can transform the problem into a deterministic one, by calculating the expectation. However, in WSNs, there is often no prior knowledge about the randomness from the networks themselves and the environmental effect. Therefore, we develop an algorithm without this prior knowledge, which can be achieved by the stochastic quasigradient method .
where is the objective vector function of sensing node . It is a formal expression which can be transformed into different objective functions for different applications.
where and are the stochastic quasigradients of .
where is the state of at iteration .
We summarize our algorithm for the general formulation of stochastic multiobjective optimization problem (ASMOP) in the Algorithm 1.
Algorithm 1: ASMOP.
( ) Price update algorithm: At times , decoupled prices are updated according to
( ) Sensor node 's Algorithm: At time , each sensing node updates its variables
To prove that the algorithm can converge to the optimal solution of the primal problem, we make the following assumptions.
as well as the objective vector function (which can be transformed into a single function in applications), , are twice continuous differentiable concave functions.
, , are convex and twice continuous differentiable functions in , for all .
If (1) hold, then from an arbitrary point of , and , the sequence generated by (7), (9), and (10) converges. Every limit point of the sequence is primal-dual optimal.
It can be seen that , , satisfy (14). From ; we know that and from (12) and (13) also satisfy (15) and (16).
From assumptions (1) and (2), the primal function is concave and the dual function is convex in and for a fixed . From (7), (9), (10), (11), (12), and (13), we can conclude that the sequence converges to the optimal solution by solving the dual problem . As the primal problem is a convex optimization problem, there is no gap between the primal and dual problems. So the sequence generated by the algorithm is primal-dual optimal.
Because of multipath routing, the problem, , may not be strictly concave even if is strictly concave. This may lead to oscillation of the sequences generated by the algorithms. There are several ways to cope with this problem. For example, we can first add some augmented variables to and adopt the first-order Lagrangian method to solve it .
The main difference of our proposed approach is that we adopt the knowledge of multiobjective optimization and provide some potential interfaces for each layer. In this way, we can take the advantages of both the layered architectures and cross-layer design. In other words, we can implement different algorithms in each module according to specific applications. In Figure 1, and act as the interface variables between different modules and sensor nodes. Through and , the network architectures can be decomposed into different modules and each module fulfills corresponding functionality distributively. From (7), we can transform the multiple objectives of the whole network into the multiple objectives of each sensor node. Optimizing the objective vector function of each sensor node can achieve the global optimal solution. Therefore, it is very convenient to implement algorithms in each module to solve the objective vector function independently according to different requirements.
In the proposed general framework, in is a vector function and can be transformed into a single function according to different requirements. Therefore, solving is application-dependent, which provides the flexibility of solving a class of applications by the general framework.
In this section, we consider the rate allocation problem as an example and show how the general framework works. Rate allocation problem is a well-investigated problem , and has different requirements for different applications. There are usually three methods to cope with the requirements: ( ) Constraint Method , ( ) Linear Weighted Method , and ( ) Hierarchical Sequence Method. While these methods are extensively studied in existing works, we can integrate these methods together into the general framework. Hence, our approach can be applied to a class of applications with different background, which will offer significant convenience to the designers.
In this section, we give a brief introduction to the three multiobjective methods.
, , are constant constraints imposed by applications.
, , are weight coefficients.
In this section, we consider the rate allocation problem with two objectives: ( ) maximizing aggregate utility and ( ) prolonging the network lifetime.
Assume the sensing nodes can transmit their rates to the sink nodes over a set of links, each of which has capacity . Each sensing node can transmit its rate through of the routes. Each route traverses over a set of links with a rate . Let be the rate vector of sensing node , the set of sensing nodes using link and the subset of routes used by sensing node to traverse over link . We denote the set of sensing nodes that use sensing node as an interim relay node by (not including the sensing node itself). Let be the subset of routes which use sensing node as a relay node and the relay nodes used by route . Let be finite number of state that the channels have and the probability of the state , . Each sensing node is characterized by three parameters , where is a strictly concave utility function, and which are the required minimum and maximum transmission rates for each sensing node , respectively.
where describes the distribution of the states of link channel condition and is the capacity of link under state . We will establish algorithms that can guarantee convergence without prior knowledge of the underlying probability distribution of the system channel state.
where and are the energy consumptions at sensing node for transmitting or receiving unit data flow over route , respectively.
Here, parameter scales the values of the two objective functions into the same order of magnitude.
where and .
where is the objective vector function for sensing node .
So far, we have introduced interface variables ( and ) for a fully distributed implementation and provided a framework for rate allocation in WSNs with time-varying channels. Next, we show how to apply the framework to applications under different methods.
where , is the weight coefficient.
It is sufficient to employ optimization methods to solve (37), (39), or (40), and (41) for different applications, while and updates are kept unchanged (according to (34) and (35)).
We have two objectives: maximizing the aggregate utility and the network lifetime. For the utility objective, we set for each sensor node , where . From , the function can have a ratio higher than 0.95 to approximate the original lifetime problem when . In our simulations, we use . The link capacities vary from time to time according to a uniform distribution with the expected capacities of links 1–13 to be 2000, 2000, 2200, 2000, 2000, 2500, 2800, 3500, 3500, 2000, 3000, 2800 (bit/s). For the energy consumption model, from , is a constant and , where is the length of the outgoing link of sensing node for transmitting rate of route . We set nJ/bit, pJ/bit/ , , nJ/bit. The initial energy of the sensing nodes 1–9 is set to be (J) and the sink node (node 10) is assumed to have enough energy. The minimum and maximum rates of each sensing node are set to be and , respectively.
We further set the threshold of the network lifetime to be 800 h in the simulation for the constraint method and for linear weighted method. These two methods mainly target the energy-constraint in WSNs. For the hierarchical sequence method, we focus on the utility of the network. From Figures 3, 4, and 5, it can seen that the rates in Figure 5 are much larger than those in Figures 3 and 4. On the other hand, as the rates in each sensor node become large, the energy consumption increases. So the network lifetime under the hierarchical sequence method is less than that under the constraint method or linear weighted method. In addition, different multiobjective methods obtain different network performances. The results of the three simulations also demonstrate the efficiency and convenience of our proposed framework.
In this paper, we have proposed a general stochastic multiobjective optimization framework for WSNs. Our approach inherits advantages of both layered architectures and cross-layer design. Therefore, even the requirements and objectives are changed, it is not necessary to redesign the optimization framework but to have minor modifications of specific modules to meet the corresponding requirements. Although there may be uncertainty in WSNs, our approach can still achieve desired performance. In our future work, we will focus on investigating the general multiobjective optimization problem, instead of transforming the multiple objectives into a single one. We will study the distributed algorithms to optimize the objective vector function under some criteria, for example, Pareto optimality.
The research was supported in parts by NSFC Guangdong joint Project grant U0735003, NSFC grants 60736021 and 60974122, China 863 High-Tech Project 2007AA041201, the Fundamental Research Funds for the Central Universities grant 2009QNA5007.
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.