서브메뉴
검색
Optimization Foundations of Reinforcement Learning
Optimization Foundations of Reinforcement Learning
상세정보
- 자료유형
- 학위논문
- Control Number
- 0015760961
- International Standard Book Number
- 9798672189628
- Dewey Decimal Classification Number
- 658
- Main Entry-Personal Name
- Bhandari, Jalaj.
- Publication, Distribution, etc. (Imprint
- [Sl] : Columbia University, 2020
- Publication, Distribution, etc. (Imprint
- Ann Arbor : ProQuest Dissertations & Theses, 2020
- Physical Description
- 205 p
- General Note
- Source: Dissertations Abstracts International, Volume: 82-04, Section: B.
- General Note
- Advisor: Russo, Daniel.
- Dissertation Note
- Thesis (Ph.D.)--Columbia University, 2020.
- Restrictions on Access Note
- This item must not be sold to any third party vendors.
- Subject Added Entry-Topical Term
- Operations research
- Subject Added Entry-Topical Term
- Computer science
- Subject Added Entry-Topical Term
- Statistics
- Index Term-Uncontrolled
- Dynamic programming
- Index Term-Uncontrolled
- Markov decision process
- Index Term-Uncontrolled
- Optimization
- Index Term-Uncontrolled
- Policy gradients
- Index Term-Uncontrolled
- Reinforcement learning
- Index Term-Uncontrolled
- Temporal difference learning
- Added Entry-Corporate Name
- Columbia University Operations Research
- Host Item Entry
- Dissertations Abstracts International. 82-04B.
- Host Item Entry
- Dissertation Abstract International
- Electronic Location and Access
- 로그인을 한후 보실 수 있는 자료입니다.
- Control Number
- joongbu:588748
MARC
008210127s2020 c eng d■001000015760961
■00520210215113541
■020 ▼a9798672189628
■035 ▼a(MiAaPQ)AAI28095595
■040 ▼aMiAaPQ▼cMiAaPQ
■0820 ▼a658
■1001 ▼aBhandari, Jalaj.
■24510▼aOptimization Foundations of Reinforcement Learning
■260 ▼a[Sl]▼bColumbia University▼c2020
■260 1▼aAnn Arbor▼bProQuest Dissertations & Theses▼c2020
■300 ▼a205 p
■500 ▼aSource: Dissertations Abstracts International, Volume: 82-04, Section: B.
■500 ▼aAdvisor: Russo, Daniel.
■5021 ▼aThesis (Ph.D.)--Columbia University, 2020.
■506 ▼aThis item must not be sold to any third party vendors.
■590 ▼aSchool code: 0054.
■650 4▼aOperations research
■650 4▼aComputer science
■650 4▼aStatistics
■653 ▼aDynamic programming
■653 ▼aMarkov decision process
■653 ▼aOptimization
■653 ▼aPolicy gradients
■653 ▼aReinforcement learning
■653 ▼aTemporal difference learning
■690 ▼a0796
■690 ▼a0984
■690 ▼a0463
■71020▼aColumbia University▼bOperations Research.
■7730 ▼tDissertations Abstracts International▼g82-04B.
■773 ▼tDissertation Abstract International
■790 ▼a0054
■791 ▼aPh.D.
■792 ▼a2020
■793 ▼aEnglish
■85640▼uhttp://www.riss.kr/pdu/ddodLink.do?id=T15760961▼nKERIS▼z이 자료의 원문은 한국교육학술정보원에서 제공합니다.
■980 ▼a202102▼f2021