본문

서브메뉴

Efficient and Reconfigurable Approximate Value Functions for Task Scheduling, Path Planning, and Control.
Содержание
Efficient and Reconfigurable Approximate Value Functions for Task Scheduling, Path Planning, and Control.
자료유형  
 학위논문
Control Number  
0017162958
International Standard Book Number  
9798384340515
Dewey Decimal Classification Number  
330
Main Entry-Personal Name  
Washington, Patrick Henry.
Publication, Distribution, etc. (Imprint  
[S.l.] : Stanford University., 2024
Publication, Distribution, etc. (Imprint  
Ann Arbor : ProQuest Dissertations & Theses, 2024
Physical Description  
149 p.
General Note  
Source: Dissertations Abstracts International, Volume: 86-03, Section: A.
General Note  
Includes supplementary digital materials.
General Note  
Advisor: Schwager, Mac.
Dissertation Note  
Thesis (Ph.D.)--Stanford University, 2024.
Summary, Etc.  
요약Task scheduling, path planning, and control are all problems in robotics that involve choosing the best action to take given the current state of the system. For task scheduling, this means observing the condition of the robots and deciding which robot should do which task and when. Path planning involves choosing how the robot should navigate the space and control deals with choosing inputs to best follow that plan. The concept that links these ideas is that of a value function. This thesis addresses two key challenges in value functions, efficiency and reconfigurability.First, when dealing with teams of robots, the number of state variables grows quickly and we run into what is known as the curse of dimensionality, where traditional methods of dynamic programming have trouble finding exact solutions because of the number of possible states in the system. This is exemplified in task scheduling problems, where there are teams of robots that need to manage various tasks. The task scheduling problem presented in this thesis is the persistent surveillance problem. It involves several robots deciding when to charge while maintaining surveillance coverage over a region. With each robot having a position and battery level along with the surveillance position moving over time, it is intractable to find an exact value function and corresponding policy. We first demonstrate a modified Monte Carlo Tree Search algorithm that estimates the value of actions through model predictive control ideas. This method uses the idea that in a scenario where most action sequences fail, searching for any successful performance can outperform searching for the best expected performance. We then present Reduced State Value Iteration, an algorithm that builds off of other approximate value iteration ideas to simplify the decision making state space and efficiently solve for an approximate value function. It leverages knowledge of the problem's structure to vastly reduce the number of states relevant to decision making.Second, many algorithms that develop value functions for path planning and control suffer from an inability to be reconfigured in the event of a change to the problem. Sometimes, this comes in the form of finding a single trajectory from predefined start and end points and building a feedback law to follow that trajectory. However, if either point changes or the robot strays too far from the desired trajectory, the plan is useless and needs to be recomputed. On the other hand, algorithms that solve for feedback policies over the entire state space are generally slow and unable to adapt to new state or control constraints, requiring extensive recomputation. We introduce GrAVITree, a graph-based planning algorithm that builds a value function and feedback policy for simultaneous path planning and control. It samples backwards in time from the goal and maintains a graph that stores state and control information. One advantage of this is that by sampling backwards in time to branch, we only explore regions of the state space that can actually reach the goal, rather than solving over the whole state space. By storing the graph, we can also easily change the goal, the cost function, and the constraints after solving by editing the graph and reselecting the optimal edges to account for the new changes, thus it is a reconfigurable approximate value function. Another key feature is that we build the graph one step at a time, bypassing the need for a complex local controller to connect points in the graph, instead using derivative-free sampling methods to make local connections. This enables us to use black box dynamics models, such as those represented by trained neural networks, where we are unable to leverage the structure of the model to aid in building the value function.Finally, we introduce a method that augments GrAVITree by determining the valid region of the dynamics model for image-based systems. Such systems use autoencoders to map image outputs into low-dimensional latent states. These state spaces are convenient due to their dimension but are not easily interpreted by humans. As such, there is not always a natural way to determine what latent states are actually valid and do not violate a constraint or correspond to an image.
Subject Added Entry-Topical Term  
Aircraft.
Subject Added Entry-Topical Term  
Scheduling.
Subject Added Entry-Topical Term  
Dynamic programming.
Subject Added Entry-Topical Term  
Planning.
Subject Added Entry-Topical Term  
Decision making.
Subject Added Entry-Topical Term  
Neural networks.
Subject Added Entry-Topical Term  
Robots.
Subject Added Entry-Topical Term  
Military bases.
Subject Added Entry-Topical Term  
Drones.
Subject Added Entry-Topical Term  
Surveillance.
Subject Added Entry-Topical Term  
National parks.
Subject Added Entry-Topical Term  
Robotics.
Subject Added Entry-Topical Term  
Computer science.
Subject Added Entry-Topical Term  
Military studies.
Added Entry-Corporate Name  
Stanford University.
Host Item Entry  
Dissertations Abstracts International. 86-03A.
Electronic Location and Access  
로그인을 한후 보실 수 있는 자료입니다.
Control Number  
joongbu:657367
New Books MORE
최근 3년간 통계입니다.

Подробнее информация.

  • Бронирование
  • 캠퍼스간 도서대출
  • 서가에 없는 책 신고
  • моя папка
материал
Reg No. Количество платежных Местоположение статус Ленд информации
TQ0033544 T   원문자료 열람가능/출력가능 열람가능/출력가능
마이폴더 부재도서신고

* Бронирование доступны в заимствований книги. Чтобы сделать предварительный заказ, пожалуйста, нажмите кнопку бронирование

해당 도서를 다른 이용자가 함께 대출한 도서

Related books

Related Popular Books

도서위치