본문

서브메뉴

Optimization Foundations of Reinforcement Learning
Optimization Foundations of Reinforcement Learning

상세정보

자료유형  
 학위논문
Control Number  
0015760961
International Standard Book Number  
9798672189628
Dewey Decimal Classification Number  
658
Main Entry-Personal Name  
Bhandari, Jalaj.
Publication, Distribution, etc. (Imprint  
[Sl] : Columbia University, 2020
Publication, Distribution, etc. (Imprint  
Ann Arbor : ProQuest Dissertations & Theses, 2020
Physical Description  
205 p
General Note  
Source: Dissertations Abstracts International, Volume: 82-04, Section: B.
General Note  
Advisor: Russo, Daniel.
Dissertation Note  
Thesis (Ph.D.)--Columbia University, 2020.
Restrictions on Access Note  
This item must not be sold to any third party vendors.
Subject Added Entry-Topical Term  
Operations research
Subject Added Entry-Topical Term  
Computer science
Subject Added Entry-Topical Term  
Statistics
Index Term-Uncontrolled  
Dynamic programming
Index Term-Uncontrolled  
Markov decision process
Index Term-Uncontrolled  
Optimization
Index Term-Uncontrolled  
Policy gradients
Index Term-Uncontrolled  
Reinforcement learning
Index Term-Uncontrolled  
Temporal difference learning
Added Entry-Corporate Name  
Columbia University Operations Research
Host Item Entry  
Dissertations Abstracts International. 82-04B.
Host Item Entry  
Dissertation Abstract International
Electronic Location and Access  
로그인을 한후 보실 수 있는 자료입니다.
Control Number  
joongbu:588748

MARC

 008210127s2020                                          c    eng  d
■001000015760961
■00520210215113541
■020    ▼a9798672189628
■035    ▼a(MiAaPQ)AAI28095595
■040    ▼aMiAaPQ▼cMiAaPQ
■0820  ▼a658
■1001  ▼aBhandari,  Jalaj.
■24510▼aOptimization  Foundations  of  Reinforcement  Learning
■260    ▼a[Sl]▼bColumbia  University▼c2020
■260  1▼aAnn  Arbor▼bProQuest  Dissertations  &  Theses▼c2020
■300    ▼a205  p
■500    ▼aSource:  Dissertations  Abstracts  International,  Volume:  82-04,  Section:  B.
■500    ▼aAdvisor:  Russo,  Daniel.
■5021  ▼aThesis  (Ph.D.)--Columbia  University,  2020.
■506    ▼aThis  item  must  not  be  sold  to  any  third  party  vendors.
■590    ▼aSchool  code:  0054.
■650  4▼aOperations  research
■650  4▼aComputer  science
■650  4▼aStatistics
■653    ▼aDynamic  programming
■653    ▼aMarkov  decision  process
■653    ▼aOptimization
■653    ▼aPolicy  gradients
■653    ▼aReinforcement  learning
■653    ▼aTemporal  difference  learning
■690    ▼a0796
■690    ▼a0984
■690    ▼a0463
■71020▼aColumbia  University▼bOperations  Research.
■7730  ▼tDissertations  Abstracts  International▼g82-04B.
■773    ▼tDissertation  Abstract  International
■790    ▼a0054
■791    ▼aPh.D.
■792    ▼a2020
■793    ▼aEnglish
■85640▼uhttp://www.riss.kr/pdu/ddodLink.do?id=T15760961▼nKERIS▼z이  자료의  원문은  한국교육학술정보원에서  제공합니다.
■980    ▼a202102▼f2021

미리보기

내보내기

chatGPT토론

Ai 추천 관련 도서


    신착도서 더보기
    최근 3년간 통계입니다.

    소장정보

    • 예약
    • 캠퍼스간 도서대출
    • 서가에 없는 책 신고
    • 나의폴더
    소장자료
    등록번호 청구기호 소장처 대출가능여부 대출정보
    TQ0010257 T   원문자료 열람가능/출력가능 열람가능/출력가능
    마이폴더 부재도서신고

    * 대출중인 자료에 한하여 예약이 가능합니다. 예약을 원하시면 예약버튼을 클릭하십시오.

    해당 도서를 다른 이용자가 함께 대출한 도서

    관련도서

    관련 인기도서

    도서위치