중부대학교 도서관

본문 바로가기
탑 메뉴 바로가기
주 메뉴 바로가기
하단 바로가기

상세정보

Optimization Foundations of Reinforcement Learning

자료유형: 학위논문

Control Number: 0015760961

International Standard Book Number: 9798672189628

Dewey Decimal Classification Number: 658

Main Entry-Personal Name: Bhandari, Jalaj.

Publication, Distribution, etc. (Imprint: [Sl] : Columbia University, 2020

Publication, Distribution, etc. (Imprint: Ann Arbor : ProQuest Dissertations & Theses, 2020

Physical Description: 205 p

General Note: Source: Dissertations Abstracts International, Volume: 82-04, Section: B.

General Note: Advisor: Russo, Daniel.

Dissertation Note: Thesis (Ph.D.)--Columbia University, 2020.

Restrictions on Access Note: This item must not be sold to any third party vendors.

Subject Added Entry-Topical Term: Operations research

Subject Added Entry-Topical Term: Computer science

Subject Added Entry-Topical Term: Statistics

Index Term-Uncontrolled: Dynamic programming

Index Term-Uncontrolled: Markov decision process

Index Term-Uncontrolled: Optimization

Index Term-Uncontrolled: Policy gradients

Index Term-Uncontrolled: Reinforcement learning

Index Term-Uncontrolled: Temporal difference learning

Added Entry-Corporate Name: Columbia University Operations Research

Host Item Entry: Dissertations Abstracts International. 82-04B.

Host Item Entry: Dissertation Abstract International

Electronic Location and Access: 로그인을 한후 보실 수 있는 자료입니다.

Control Number: joongbu:588748

008210127s2020                                          c    eng  d
■001000015760961
■00520210215113541
■020    ▼a9798672189628
■035    ▼a(MiAaPQ)AAI28095595
■040    ▼aMiAaPQ▼cMiAaPQ
■0820  ▼a658
■1001  ▼aBhandari,  Jalaj.
■24510▼aOptimization  Foundations  of  Reinforcement  Learning
■260    ▼a[Sl]▼bColumbia  University▼c2020
■260  1▼aAnn  Arbor▼bProQuest  Dissertations  &  Theses▼c2020
■300    ▼a205  p
■500    ▼aSource:  Dissertations  Abstracts  International,  Volume:  82-04,  Section:  B.
■500    ▼aAdvisor:  Russo,  Daniel.
■5021  ▼aThesis  (Ph.D.)--Columbia  University,  2020.
■506    ▼aThis  item  must  not  be  sold  to  any  third  party  vendors.
■590    ▼aSchool  code:  0054.
■650  4▼aOperations  research
■650  4▼aComputer  science
■650  4▼aStatistics
■653    ▼aDynamic  programming
■653    ▼aMarkov  decision  process
■653    ▼aOptimization
■653    ▼aPolicy  gradients
■653    ▼aReinforcement  learning
■653    ▼aTemporal  difference  learning
■690    ▼a0796
■690    ▼a0984
■690    ▼a0463
■71020▼aColumbia  University▼bOperations  Research.
■7730  ▼tDissertations  Abstracts  International▼g82-04B.
■773    ▼tDissertation  Abstract  International
■790    ▼a0054
■791    ▼aPh.D.
■792    ▼a2020
■793    ▼aEnglish
■85640▼uhttp://www.riss.kr/pdu/ddodLink.do?id=T15760961▼nKERIS▼z이  자료의  원문은  한국교육학술정보원에서  제공합니다.
■980    ▼a202102▼f2021

신착도서 더보기

최근 3년간 통계입니다.

예약
캠퍼스간 도서대출
서가에 없는 책 신고
보존서고대출신청
나의폴더

소장자료
등록번호	청구기호	소장처	대출가능여부	대출정보
TQ0010257	T	원문자료	열람가능/출력가능	열람가능/출력가능 마이폴더 부재도서신고

* 대출중인 자료에 한하여 예약이 가능합니다. 예약을 원하시면 예약버튼을 클릭하십시오.

본문

서브메뉴

검색

상세정보

MARC

미리보기

내보내기

chatGPT토론

Ai 추천 관련 도서

신착도서 더보기

최근 3년간 통계입니다.

소장정보

해당 도서를 다른 이용자가 함께 대출한 도서

관련도서

관련 인기도서

도서위치

QUICK LINK