서브메뉴
검색
Towards Robust and Scalable Large Language Models- [electronic resource]
Towards Robust and Scalable Large Language Models- [electronic resource]
- 자료유형
- 학위논문
- Control Number
- 0016932482
- International Standard Book Number
- 9798380877077
- Dewey Decimal Classification Number
- 004
- Main Entry-Personal Name
- Jain, Paras Jagdish.
- Publication, Distribution, etc. (Imprint
- [S.l.] : University of California, Berkeley., 2023
- Publication, Distribution, etc. (Imprint
- Ann Arbor : ProQuest Dissertations & Theses, 2023
- Physical Description
- 1 online resource(92 p.)
- General Note
- Source: Dissertations Abstracts International, Volume: 85-06, Section: B.
- General Note
- Advisor: Stoica, Ion;Gonzalez, Joseph E.
- Dissertation Note
- Thesis (Ph.D.)--University of California, Berkeley, 2023.
- Restrictions on Access Note
- This item must not be sold to any third party vendors.
- Summary, Etc.
- 요약This dissertation addresses two significant challenges of large language models (LLMs): robustness and scalability. Firstly, we focus on improving large language model robustness through the lens of learning code representations. I highlight our work on ContraCode which learns representations of code that are robust to label-preserving edits. Secondly, we tackle scalability challenges from a systems perspective. We present Checkmate, a system to support training models beyond GPU memory capacity limits through optimal rematerialization. Furthermore, Skyplane, a system that optimizes bulk data transfers between cloud object stores, enables training models on larger pre-training datasets in the cloud. Together, these contributions present a roadmap for enhancing the robustness and scalability of large language models.
- Subject Added Entry-Topical Term
- Computer science.
- Subject Added Entry-Topical Term
- Computer engineering.
- Index Term-Uncontrolled
- Cloud computing
- Index Term-Uncontrolled
- Large language models
- Index Term-Uncontrolled
- Natural language processing
- Index Term-Uncontrolled
- Robustness
- Index Term-Uncontrolled
- Scalability
- Added Entry-Corporate Name
- University of California, Berkeley Electrical Engineering & Computer Sciences
- Host Item Entry
- Dissertations Abstracts International. 85-06B.
- Host Item Entry
- Dissertation Abstract International
- Electronic Location and Access
- 로그인을 한후 보실 수 있는 자료입니다.
- Control Number
- joongbu:640905
Buch Status
- Reservierung
- 캠퍼스간 도서대출
- 서가에 없는 책 신고
- Meine Mappe