본문

서브메뉴

Leveraging Cross-Task Transfer in Sequential Decision Problems.
Inhalt Info
Leveraging Cross-Task Transfer in Sequential Decision Problems.
자료유형  
 학위논문
Control Number  
0017162492
International Standard Book Number  
9798383057247
Dewey Decimal Classification Number  
004
Main Entry-Personal Name  
Zentner, K. R.
Publication, Distribution, etc. (Imprint  
[S.l.] : University of Southern California., 2024
Publication, Distribution, etc. (Imprint  
Ann Arbor : ProQuest Dissertations & Theses, 2024
Physical Description  
111 p.
General Note  
Source: Dissertations Abstracts International, Volume: 85-12, Section: B.
General Note  
Advisor: Sukhatme, Gaurav S.
Dissertation Note  
Thesis (Ph.D.)--University of Southern California, 2024.
Summary, Etc.  
요약The past few years have seen an explosion of interest in using machine learning to make robots capable of learning a diverse set of tasks. These robots use Reinforcement Learning to learn detailed sub-second interactions, but consequently require large amounts of data for each task. In this thesis we explore how Reinforcement Learning can be combined with Transfer Learning to re-use data across tasks. We begin by reviewing the start of Multi-Task and Meta RL and describe the motivations for using Transfer Learning. Then, we describe a basic framework for using Transfer Learning to efficiently learn multiple tasks, and show how it requires predicting how effectively transfer can be performed across tasks. Next, we present a simple rule, based in information theory, for predicting the effectiveness of Cross-Task Transfer. We discuss the theoretical implications of that rule, and show various quantitative evaluations of it. Then, we show two directions of work making use of our insights to perform efficient Transfer Reinforcement Learning. The first of these directions uses Cross-Task Co-Learning and Plan Conditioned Behavioral Cloning to share skill representations produced by a Large Language Model, and it able to learn many tasks from a single demonstration each in a simulated environment. The second of these directions uses Two-Phase KL Penalization to enforce a (potentially off-policy) trust region. These advances in Transfer RL may enable robots to be used in a wider range of applications, and may also inform applying Transfer RL outside of robotics.
Subject Added Entry-Topical Term  
Computer science.
Subject Added Entry-Topical Term  
Robotics.
Subject Added Entry-Topical Term  
Information technology.
Index Term-Uncontrolled  
Imitation learning
Index Term-Uncontrolled  
Large Language Models
Index Term-Uncontrolled  
Machine learning
Index Term-Uncontrolled  
Reinforcement learning
Index Term-Uncontrolled  
Transfer learning
Added Entry-Corporate Name  
University of Southern California Computer Science
Host Item Entry  
Dissertations Abstracts International. 85-12B.
Electronic Location and Access  
로그인을 한후 보실 수 있는 자료입니다.
Control Number  
joongbu:657165
New Books MORE
최근 3년간 통계입니다.

Buch Status

  • Reservierung
  • 캠퍼스간 도서대출
  • 서가에 없는 책 신고
  • Meine Mappe
Sammlungen
Registrierungsnummer callnumber Standort Verkehr Status Verkehr Info
TQ0033386 T   원문자료 열람가능/출력가능 열람가능/출력가능
마이폴더 부재도서신고

* Kredite nur für Ihre Daten gebucht werden. Wenn Sie buchen möchten Reservierungen, klicken Sie auf den Button.

해당 도서를 다른 이용자가 함께 대출한 도서

Related books

Related Popular Books

도서위치