Rationally Inattentive Path-Planning via RRT*
Abstract
We consider a path-planning scenario for a mobile robot traveling in a configuration space with obstacles under the presence of stochastic disturbances. A novel path length metric is proposed on the uncertain configuration space and then integrated with the existing RRT* algorithm. The metric is a weighted sum of two terms which capture both the Euclidean distance traveled by the robot and the perception cost, i.e., the amount of information the robot must perceive about the environment to follow the path safely. The continuity of the path length function with respect to the topology of the total variation metric is shown and the optimality of the Rationally Inattentive RRT* algorithm is discussed. Three numerical studies are presented which display the utility of the new algorithm.
I Introduction
As robots are designed to be more self-reliant in navigating complex and stochastic environments, it is sensible for the strategic execution of perception/cognition tasks to be included in the theory which governs their path-planning [1, 2, 3]. Even though the body of work surrounding motion planning techniques has greatly expanded recently, a technological gap remains in the integration of perception concerns into planning tasks [4]. Mitigating this gap is paramount to missions which require robots to autonomously complete tasks when sensing actions carry high costs (battery power, computing constraints, etc).
Path-planning is typically followed by feedback control design, which is executed during the path following phase. In the current practice, path-planning and path-following are usually discussed separately (notable exceptions include [5, 6, 7]), and the cost of feedback control (perception cost in particular) is not incorporated in the path-planning phase. The first objective of this work is to fill this gap by introducing a novel path cost function which incorporates the expected perception cost accrued during path-following into the planning phase. This cost jointly penalizes the amount of sensing needed to follow a path and the distance traveled. Our approach is closely related to the concept of rationally inattentive (RI) control [8] (topic from macroeconomics which has recently been applied in control theory [9, 10]). The aim of rationally inattentive control is to jointly design the control and sensing policies such that the least amount of information (measured in bits) is collected about the environment in order to achieve the desired control.
The second objective of this work is to integrate the proposed path length function with an existing sampling-based algorithm, such as Rapidly-Exploring Random Trees (RRT) [11]. The RRT algorithm is suited for this problem as it has been shown to find feasible paths in motion planning problems quickly. A modified version of this algorithm, RRT* [12], will be utilized as it has the additional property of being asymptotically optimal. We develop an RRT*-like algorithm incorporating the proposed path length function (called the RI-RRT* algorithm) and demonstrate its effectiveness.
While the practical utility of the proposed framework must be thoroughly studied in the future, its expected impact is displayed in Fig. 1. This figure shows the example of a robot moving through the two-dimensional, obstacle-filled environment. Path A (red) represents the path from the origin to target location which minimizes the Euclidean distance. However, this path requires a large number of sensor actuations to keep the robot’s spatial uncertainty small and avoid colliding with obstacles. Alternatively, Path B (blue) allows for the covariance to safely grow more along the path. Although the Path B travels a greater Euclidean distance to reach the target, it is cheaper in the information-theoretic sense as it requires fewer sensing actions. Therefore, if the perception cost is weighed more than the travel cost, Path B is characterized as the shortest path in the proposed path planning framework. We will demonstrate this effect in a numerical simulation in Section V-C.
The proposed concept of rationally inattentive path-planning provides insight into the mathematical modeling of human experts’ skills in path planning [13], especially in terms of an efficiency-simplicity trade-off. Several path-planning algorithms have been proposed in the literature that are capable of enhancing path simplicity; this list includes potential field approaches [14], multi-resolution perception and path-planning [15, 16], and safe path-planning [17, 18]. The information-theoretic distance function we introduce in this paper can be thought of as an alternative measure of path simplicity, which may provide a suitable modeling of the human intuition for simplicity in planning. In our standard, a path which requires less sensor information during the path-following phase is more “simple;” Path B in Fig. 1 is simpler than Path A, and the simplest path is that which is traceable by an open-loop control policy.
The contributions of this paper are summarized as follows:
-
•
A novel path cost (RI cost) is formulated which jointly accounts for travel distance and perception cost.
-
•
The continuity of the path cost with respect to the topology of the total variation metric is shown in the single dimensional case, which is a step forward to guaranteeing the asymptotic optimality of sampling-based algorithms.
-
•
An RRT*-like algorithm is produced implementing the RI path-planning concept.
Notation: For the purpose of this work, the following definitions for vectors (lower case) and matrices (upper case) hold: , , and bold symbols such as represent random variables. The vector 2-norm is and is Frobenius norm. The maximum singular values of a matrix is denoted by .
II Preliminary Material
In this paper, we consider a path-planning problem for a mobile robot with dynamics given by model (1). Let be a -valued random process representing the robot’s position at time , given by the controlled Ito process:
(1) |
with and . Here, is the velocity input command, is the -dimensional standard Brownian motion, and is a given positive definite matrix used in modeling the process noise intensity. We assume that the robot is commanded to travel at a unit velocity (i.e., ). Let be a partition of , which must not necessarily be of equal spacing. Time discretization of (1) based on the Euler-Maruyama method [19] yields:
(2) |
where and . Introducing a new control input and applying the constraint , (2) can be written as:
(3) |
with . Due to the unit velocity assumption above, the time intervals are determined once the command sequence is formalized. Since the physical times do not play significant roles in our theoretical development in the sequel, it is convenient to rewrite (3) as the main dynamics model of this work:
(4) |
Let the probability distributions of the robot position at a given time step be parametrized by a Gaussian model , where is the nominal position and is the associated covariance matrix (with being the dimension of the configuration space). In this paper, we consider a path-planning framework in which the sequence is scheduled. Following [20, 21], the product space is called the uncertain configuration space. In what follows, the problem of finding the shortest path in the uncertain configuration space with respect to a novel information-theoretic path length function is formulated.
First, an appropriate directed distance function from a point to another is introduced. This function is interpreted as the cost of steering the random state variable to in the next time step under the dynamics provided by (4). In order to implement the rational inattention concept, we formulate this cost as a weighted sum of the control cost and the information cost in achieving each state transition.
II-A Control Cost
The control cost is simply the commanded travel distance in the Euclidean metric:
(5) |
II-B Information Cost
Jointly accounting for both the control efficiency and sensing simplicity in planning necessitates the formulation of a metric that captures the information acquisition cost required for path following. We utilize the information gain (entropy reduction) for this purpose.
Assume that the control input is applied to (4). The propagation of the prior covariance during the movement of the robot, over the time interval , is denoted as . At time , the covariance is “reduced” to by utilizing a sensor input. The minimum information gain (minimum number of bits that must be contained in the sensor data) for this transition is:
(6) |
The notion of an “optimal” sensing signal which reduces to has been previously discussed in [22] in the context of optimal sensing in filtering theory. The information cost function in (6) is well-defined for the pairs satisfying . For those pairs which do not satisfy , we generalize (6) as:
(7) |
Notice that (7) takes a non-negative value for any given transition from an origin to destination . However, (7) is an implicit function involving a convex optimization problem in its expression (more precisely, the max-det problem [23]). To see why (7) is an appropriate generalization of (6), consider a two-step procedure to update the prior covariance to the posterior covariance . In the first step, the uncertainty is “reduced” from to satisfy both and . The associated information gain (the amount of telemetry data) is . In the second step, the covariance is “increased” to . This step incurs no information cost, since the location uncertainty can be increased simply by “deteriorating” the prior knowledge. The max-det problem (7) can then be interpreted as finding the optimal intermediate step which minimizes the information gain in the first step.
II-C Total Cost
The cost to steer a random state variable to is a weighted sum of and . Introducing , the total RI cost is:
(8) |
By increasing , more weight is placed on the amount of information which must be gained compared to the distance traversed. Note that the information cost is an asymmetric function, so that transitioning does not return the same cost as .
III Problem Formulation
Having introduced the RI cost function (8), it is now appropriate to introduce the notion of path length. Let , be a path. The RI length of a path is defined as:
where the supremum is over the space of partitions of . If is differentiable and , then it can be shown that:
III-A Topology on the path space
In this subsection, we introduce a topology for the space of paths , which is necessary to discuss continuity of . The space of all paths can be thought of as a subset (convex cone) of the space of generalized paths . The space of generalized paths is a vector space on which addition and scalar multiplication exist and are defined as and for , respectively. Assuming that a path can be partitioned such that , the variation of a generalized path with respect to the choice of is given by:
where , and . Utilizing the above definition for the variation of a path, the total variation of a generalized path corresponds to the partition which results in the supremum of the variation:
Notice that defines a norm on the space of generalized paths. The following relationship holds between and
Lemma 1
[24, Lemma 13.2] For a given path with partitioning the following inequality holds:
Proof:
See Appendix A for proof. ∎
In what follows, we assume on the space of generalized paths the topology of total variation metric , which is then inherited to the space of paths . We denote by the space of paths such that . In the next subsection, we discuss the continuity of the RI path cost in the space .
III-B Continuity of RI Cost Function
The continuity of RI path cost function plays a critical role in determining the theoretical guarantees we can provide when we use sampling-based algorithms to find the shortest RI path. Specifically, the asymptotic optimality (the convergence to the path with the minimum cost as the number of nodes is increased) of RRT* algorithms [12], the main numerical method we use in this paper, expects the continuity of the path cost function. Showing that the RI cost function (8) is continuous requires additional derivation which is shown via Theorem 1.
Theorem 1
When , the path cost function is continuous in the sense that for every , , and for every , there exists such that
Proof:
See Appendix B for proof. ∎
Before discussing the required modifications for implementing RRT* algorithm with RI cost in Section IV, we first characterize the shortest RI path in the obstacle-free space, and then formally define the shortest RI path problem in obstacle-filled spaces in the following subsections.
III-C Shortest Path in Obstacle-Free Space
In obstacle-free space, it can be shown that the optimal path cost between and is equal to . In other words, the triangular inequality holds. This means it is optimal for the robot to follow the direct path from to without sensing, and then make a measurement at to shrink the uncertainty from to . In what follows, we call such a motion plan the “move-and-sense” strategy. The optimality of the move-and-sense path for one-dimensional geometric space is shown in Appendix C. We confirm this optimality by simulation in Section V-A, where the move-and-sense path is the wedge-shaped path depicted in Fig. 3 (a).
III-D Shortest Path Formulation
The utility of path-planning algorithms is made non-trivial by the introduction of obstacles in the path space. Let be a closed subset of spatial points representing obstacles. The initial configuration of the robot is defined as , while is a given closed subset representing the target region which the robot desires to attain. Given a confidence level parameter , the shortest RI path problem can be formulated as:
(9) |
The term in the constraints of (9) is implemented to provide a confidence bound on probability that a robot with position will not be in contact with an obstacle .
IV RI-RRT* Algorithm
IV-A RRT*
The RRT algorithm [25] constructs a tree of nodes (state realizations) through random sampling of the feasible state-space and then connects these nodes with edges (tree branches). A user-defined cost is utilized to quantify the length of the edges, which are in turn summed to form path lengths. Each new node is connected via a permanent edge to the existing node which provides the shortest path between the new node and the initial node of the tree. Although the RRT algorithm is known to be probabilistically complete (the algorithm finds a feasible path if one exists), it does not achieve asymptotic optimality (path cost does not converge to the optimal one as the number of nodes is increased) [12]. The RRT* algorithm [12] attains asymptotic optimality by including an additional “re-wiring” step that re-evaluates if the path length for each node can be reduced via a connection to the newly created node. This paper utilizes RRT* as a numerical approach to the shortest path problem (9).
IV-B Algorithm
Provided below is a Rationally Inattentive RRT* (RI-RRT*) algorithm for finding a solution to (9). Like the original RRT* algorithm, the RI-RRT* algorithm constructs a graph of state nodes and edges () in spaces with or without obstacles.
In Algorithm 1, the function creates a new point by randomly sampling a spatial location () and covariance (). Notice that for a -dimensional configuration space, the corresponding uncertain configuration space has dimensions from which the samples are generated. The function finds the nearest point (), in metric , between the newly generated state and an existing state in the set . Using the metric , the function linearly shifts the generated point () to a new location as:
where is a user-defined constant. In addition to generating , the Scale function also ensures that its covariance region does not interfere with any obstacles.

The function ensures that transition from state to does not intersect with the obstacles. More precisely, we assume the transition follows the move-and-sense path, introduced in Section III-C, and the ObsCheck function returns if all state pairs along the move-and-sense path, shown by blue ellipses in Fig. 2, has covariance regions that are non-interfering with obstacles.
The function returns the sub-set of nodes described as . This set is then evaluated for the presence of obstacles via the ObsCheck function of the previous paragraph. Note that in this instance, the function is evaluating obstacle interference along the continuous path of state-covariance pairs from . Lines 14-17 of Algorithm 1 connect the new node to the existing graph in an identical manner to RRT*, where the denote the cost of the path from the to node through the edges of . Line 18 creates a new edge between the new node and the existing nodes from the neighbor group which results in the minimum . The calculation of RI path cost in Line 14 utilizes (8).
Lines 19-24 are the tree re-wiring steps of Algorithm 1. In line 20, the ObsCheck function is called again. This is because the move-and-sense path is direction-dependent, and thus does not necessarily imply . Finally, for each rewired node , its cost (i.e., ) and the cost of its descendants are updated via function in line 25.
To increase the computational efficiency of RI-RRT* algorithm we deploy a branch-and-bound technique as detailed in [26]. For a given tree , let be the node that has the lowest cost along the nodes of within . As discussed in Section III-C, is a lower-bound for the cost of transitioning from to . The branch-and-bound algorithm periodically deletes the nodes . This elimination of the non-optimal nodes speeds up the RI-RRT* algorithm.
IV-C Properties of RI-RRT*
The question regarding the asymptotic optimality of RI-RRT* naturally arises. Recall that the proof of the asymptotic optimality of the RRT* algorithm [27] is founded on four main assumptions:
-
1.
additivity of the cost function,
-
2.
the cost function is monotonic,
-
3.
there exists a finite distance between all points on the optimal path and the obstacle space,
- 4.
The proofs of the first three assumptions are trivial for the RI cost (8). However, Theorem 1 does not suffice to guarantee that the RI cost meets the fourth condition for . For this reason, currently the asymptotic optimality of the RI-RRT* algorithm cannot be guaranteed, while the numerical simulations of Section V do show that the proposed algorithm does have merit in rationally inattentive path-planning.
V Simulation Results
V-A One-Dimensional Simulation


The first study is the case of a robot which is allowed to travel at a constant velocity in a one-dimensional geometric space from a predetermined initial position and covariance , specified by the red dot in Fig. 3 (a). The robot has a goal of reaching some final state within the blue box representing a goal region which is a sub-set of the reachable space. Note that the goal region contains acceptable bounds on both location and uncertainty. Although, in one-dimensional setting, the strategy which minimizes the control cost is obviously the one that moves directly toward the target region, we utilize the RI-RRT* algorithm to solve the non-trivial measurement scheduling problem.
In Fig. 3 (a), the blue curve represents the path generated by the RI-RRT* algorithm with nodes, which is sufficiently close to the shortest path obtainable via the RI-distance depicted as the red curve. These wedge-shaped optimal paths are created by the “move-and-sense” strategy integrated in the RI-cost, where it has a section of covariance propagation followed by an instantaneous reduction of covariance as discussed in Section III-C. For example, if the robot were an autonomous ground vehicle with GPS capabilities, then this path signifies the robot driving the total distance without any GPS updates, followed by a reduction its spatial uncertainty with a single update once the goal region is reached. The minimum path cost at the end of each iteration of the for-loop in Algorithm 1 is depicted in Fig. 3 (b), where the red curve represents the average of independent simulations. The path cost of each simulation approaches to the optimal cost.
V-B Two-Dimensional Asymmetric Simulation

The asymmetric characteristic of the RI-cost is demonstrated via a simulation in the two-dimensional configuration space with a diagonal wall, as seen in Fig. 4. The initial position and covariance of the robot is depicted as a red dot, while the target region is illustrated as the black rectangle at the upper-right corner.
The path is generated by the RI-RRT* algorithm by sampling nodes. The corresponding sampled covariance ellipses are shown in black where the blue ellipses represent covariance propagation. As shown in Fig. 4, there are two options; path A requires the robot to move into a funnel-shaped corridor, while the path B moves out of a funnel. In this setting, the RI-cost prefers the path B even though both A and B have the same Euclidean distance. This asymmetric behavior results from the fact that as the robot approaches the goal region path B requires a less severe uncertainty reduction compared to path A. Similarly, by exchanging the start and goal positions, the RI-cost prefers path A over path B, thus displaying the directional dependency of our efficient sensing strategy.
V-C Two-Dimensional Simulation with Multiple Obstacles



In a final demonstration, the RI-RRT* algorithms with and are implemented in a two-dimensional configuration space containing multiple obstacles in order to illustrate the effects of varying the information cost. All three paths in panels of Fig. 5 are generated from 4,000 nodes.
As seen in Fig. 5 (a), when the algorithm simply finds a path that has the shortest Euclidean distance, even if that path requires frequent sensing actions. In contrast, the RI-RRT* algorithm with does not take a constrained pathway, and thus requires fewer sensor actuations in order to avoid obstacle collisions. As a result, the algorithm deviates from the shortest Euclidean distance path and allows the covariance to propagate safely, as seen in Fig. 5 (c). A moderate path, illustrated in Fig. 5 (b), can be also obtained by choosing .
VI Conclusion
In this work, a novel RI cost for utilization in path-planning algorithms is presented. The cost accounts for both the path distance traversed (efficiency) and the amount of information which must be perceived by the robot. Information gained from perception is important in that it allows the robot to safely navigate obstacle-filled environments with confidence that collisions will be avoided. This method, in balancing path distance and perception costs, provides a simplicity-based path which can be tailored to mimic the results potentially generated by an expert human path-planner. Three numerical simulations were provided demonstrating these results.
Currently, the preliminary version of the RI-RRT* algorithm is optimized for computational efficiency by the aid of a branch-and-bound technique. The authors note that utilizing other RRT* improvement methods, such as k-d trees could further improve the computational speed of Algorithm 1 and should be considered in future work. In the same vein of future work, the authors note the importance of quantifying the impact that the user-defined constants, such as the distances which signify which nodes are neighbors, have on the results of the RI-RRT* algorithm. Also, the topic of path refinement should be further explored as RRT*-like algorithms converge asymptotically.
It should be noted that once the RI-RRT* algorithm finds an initial feasible path, there exist iterative methods for path “smoothing” which do not require additional node sampling. In a theorized hybrid method, RI-RRT* is first utilized to find some initial path and then an iterative method takes the initial path and “smooths” towards the optimal path. Convergence benefits of iterative methods are often improved as the initial path guess is more similar to the optimal path, and a trade-off could be found in computational efficiency which results in the best time to switch between algorithms.
Appendix A Proof of Lemma 1
For every , set a partition . Then
Appendix B Proof of Theorem 1
The proof is based on the following lemma:
Lemma 2
Assume . For each satisfying , there exists a constant such that the inequality:
holds for all
such that
Proof:
For simplicity, we assume , but the extension of the following proof to general cases is straightforward. In what follows, we write
We consider four different cases depending on the signs of and .
Case 1: First, we consider the case with
In this case:
(10) |
Using the fact that for , we have:
Noticing that the arguments in the logarithmic terms in (10) are , and using the fact that , we have:
Therefore,
(11) |
Case 2: Next, we consider the case with
(12a) | ||||
(12b) |
Notice that (12b) implies . In this case:
(13a) | |||
(13b) | |||
(13c) | |||
(13d) | |||
(13e) | |||
(13f) | |||
(13g) |
In step (13b), we have used the fact that the difference between the two logarithmic terms is positive, because of the hypothesis (12a). In step (13c), we used the fact that , for .
Case 3: Next, we consider the case with
(14a) | ||||
(14b) |
The first hypothesis (14a) implies:
(15) |
Using
one can deduce from (10) that:
(16) |
This results in:
(17a) | |||
(17b) | |||
(17c) | |||
(17d) |
In (17b) we used the fact that the difference between the two logarithmic terms is positive. In (17c) we used the fact that , for . Finally, the inequality (16) was used in step (17d).
Proof of Theorem 1
Suppose and . In what follows, we consider the choice:
(19) |
Since , we have . In particular, for each , we have and . Moreover:
Therefore, , we have both and . Let be a partition and define:
For any partition , the following chain of inequalities holds:
(20a) | |||
(20b) | |||
(20c) | |||
(20d) | |||
(20e) | |||
(20f) | |||
(20g) | |||
(20h) |
The inequality (20c) follows from Lemma 2. Let and be sequences of partitions such that:
(21) |
and let be the sequence of partitions such that for each , is a common refinement of and . Since both
hold for each , (21) implies
(22) |
Now, since the chain of inequalities (20) holds for any partitions,
holds for all . This results in:
(23) |
Appendix C One-Dimensional Problem Optimal Path
Consider taking the single perception optimal path in Fig. 3 (a):
and dividing it into the combination of two sub-paths :
where is a constant which denotes where in the additional sensing action takes place. The combination of the divided sub-paths have in the same initial () and ending () states as the original path, but also achieve an intermediate state ().
Path has a total RI cost:
where, . Likewise, the path has a length which is the summation of two information gains and while transitioning the same distance as .
By comparing the costs between and , it is possible to achieve:
where . Last inequality follows the facts that and is a decreasing function ().
References
- [1] S. Pendleton, H. Andersen, X. Du, X. Shen, M. Meghjani, Y. Eng, D. Rus, and M. Ang, “Perception, planning, control, and coordination for autonomous vehicles,” Machines, vol. 5, no. 1, p. 6, 2017.
- [2] M. Pfeiffer, M. Schaeuble, J. Nieto, R. Siegwart, and C. Cadena, “From perception to decision: A data-driven approach to end-to-end motion planning for autonomous ground robots,” in Proc. IEEE Int. Conf. Robot. Autom., 2017, pp. 1527–1533.
- [3] L. Carlone and S. Karaman, “Attention and anticipation in fast visual-inertial navigation,” IEEE Trans. Robot., vol. 35, no. 1, pp. 1–20, 2019.
- [4] R. Alterovitz, S. Koenig, and M. Likhachev, “Robot planning in the real world: research challenges and opportunities,” AI Magazine, vol. 37, no. 2, pp. 76–84, 2016.
- [5] Y. Kuwata, J. Teo, S. Karaman, G. Fiore, E. Frazzoli, and J. How, “Motion planning in complex environments using closed-loop prediction,” AIAA Guid. Navi. Control Conf. Exhibit, 2008.
- [6] J. Van Den Berg, P. Abbeel, and K. Goldberg, “LQG-MP: Optimized path planning for robots with motion uncertainty and imperfect state information,” Int. J. Robot. Res., vol. 30, no. 7, pp. 895–913, 2011.
- [7] A.-A. Agha-Mohammadi, S. Chakravorty, and N. M. Amato, “FIRM: Sampling-based feedback motion-planning under motion uncertainty and imperfect measurements,” Int. J. Robot. Res., vol. 33, no. 2, pp. 268–304, 2014.
- [8] C. A. Sims, “Implications of rational inattention,” J. Monetary Economics, vol. 50, no. 3, pp. 665–690, 2003.
- [9] E. Shafieepoorfard, M. Raginsky, and S. P. Meyn, “Rationally inattentive control of markov processes,” SIAM J. Control Optimization, vol. 54, no. 2, pp. 987–1016, 2016.
- [10] E. Shafieepoorfard and M. Raginsky, “Rational inattention in scalar lqg control,” in Proc. Conf. Decision Control, 2013, pp. 5733–5739.
- [11] S. M. LaValle, Planning algorithms. Cambridge University Press, 2006.
- [12] S. Karaman and E. Frazzoli, “Incremental sampling-based algorithms for optimal motion planning,” in Proc. Robot.: Sci. Syst., 2010.
- [13] J. J. Marquez and M. L. Cummings, “Design and evaluation of path planning decision support for planetary surface exploration,” J. Aerosp. Comput., Info., Comm., vol. 5, no. 3, pp. 57–71, 2008.
- [14] Y. K. Hwang and N. Ahuja, “A potential field approach to path planning,” IEEE Trans. Robot. Autom., vol. 8, no. 1, pp. 23–32, 1992.
- [15] S. Kambhampati and L. Davis, “Multiresolution path planning for mobile robots,” IEEE J. Robot. Autom., vol. 2, no. 3, pp. 135–145, 1986.
- [16] F. Hauer, A. Kundu, J. M. Rehg, and P. Tsiotras, “Multi-scale perception and path planning on probabilistic obstacle maps,” in Proc. IEEE Int. Conf. Robot. Autom., 2015, pp. 4210–4215.
- [17] A. Lambert and D. Gruyer, “Safe path planning in an uncertain-configuration space,” in Proc. IEEE Int. Conf. Robot. Autom., vol. 3, 2003, pp. 4185–4190.
- [18] R. Pepy and A. Lambert, “Safe path planning in an uncertain-configuration space using RRT,” in Proc. IEEE/RSJ Int. Conf. Intell. Robots Syst., 2006, pp. 5376–5381.
- [19] P. Kloeden and E. Platen, Numerical Solution of Stochastic Differential Equations. Berlin: Springer, 1992.
- [20] A. Lambert and D. Gruyer, “Safe path planning in an uncertain-configuration space,” in Proc. IEEE Int. Conf. Robot. Autom., vol. 3, 2003, pp. 4185–4190.
- [21] R. Pepy and A. Lambert, “Safe path planning in an uncertain-configuration space using RRT,” in Proc. IEEE/RSJ Int. Conf. Intell. Robots Syst., 2006, pp. 5376–5381.
- [22] T. Tanaka, K.-K. Kim, P. A. Parrilo, and S. K. Mitter, “Semidefinite programming approach to Gaussian sequential rate-distortion trade-offs,” IEEE Trans. Automat. Control, vol. 62, no. 4, pp. 1896–1910, 2016.
- [23] L. Vandenberghe, S. Boyd, and S.-P. Wu, “Determinant maximization with linear matrix inequality constraints,” SIAM J. Matrix Analysis and Applications, vol. 19, no. 2, pp. 499–533, 1998.
- [24] N. L. Carothers, Real analysis. Cambridge University Press, 2000.
- [25] S. M. LaValle and J. J. Kuffner, “Randomized kinodynamic planning,” Int. J. Robot. Res., vol. 20, no. 5, pp. 378–400, May 2001.
- [26] S. Karaman, M. R. Walter, A. Perez, E. Frazzoli, and S. Teller, “Anytime motion planning using the RRT,” in Proc. IEEE Int. Conf. Robot. Autom., 2011, pp. 1478–1483.
- [27] S. Karaman and E. Frazzoli, “Sampling-based algorithms for optimal motion planning,” Int. J. Robot. Res., vol. 30, no. 7, pp. 846–894, 2011.