중부대학교 도서관

본문 바로가기
탑 메뉴 바로가기
주 메뉴 바로가기
하단 바로가기

내용보기

Teaching Large Language Models to Use Tools at Scale.

자료유형: 학위논문

Control Number: 0017161897

International Standard Book Number: 9798384453758

Dewey Decimal Classification Number: 004

Main Entry-Personal Name: Patil, Shishir Girishkumar.

Publication, Distribution, etc. (Imprint: [S.l.] : University of California, Berkeley., 2024

Publication, Distribution, etc. (Imprint: Ann Arbor : ProQuest Dissertations & Theses, 2024

Physical Description: 106 p.

General Note: Source: Dissertations Abstracts International, Volume: 86-03, Section: B.

General Note: Advisor: Gonzalez, Joseph Edgar;Dutta, Prabal.

Dissertation Note: Thesis (Ph.D.)--University of California, Berkeley, 2024.

Summary, Etc.: 요약Large language models (LLMs) have shown impressive advancements in many complex tasks such as mathematical reasoning and program synthesis. Despite this progress, the ability of LLMs to effectively utilize tools, services, and applications remains limited. In order to address this gap, we first introduce Gorilla LLM, a finetuning recipe that enhances the ability of LLMs to use tools by invoking APIs. Gorilla also introduces abstract syntax tree (AST)-based metrics to evaluate API hallucination in LLMs. Further, recognizing that evaluating LLMs can be challenging, we develop OpenFunctions, a pre-trained model that does not require retraining and instead relies on retrieval-augmented generation (RAG) to surface relevant APIs. This system allows LLMs to access an updated repository of functions and services, improving their utility without the overhead of constant model retraining.Complementing function calling, RAFT (Retrieval Augmented Fine Tuning) provides a recipe for embedding new domain-specific knowledge into models. By training LLMs to discern and utilize only relevant information from a set of retrieved documents, RAFT improves accuracy and reliability in "open-book" settings across various in-domain datasets.Finally, to enable the autonomous execution of LLM-generated commands-which can be prone to errors-the Gorilla Execution Engine (GoEx) is a novel runtime system that enforces execution under least privilege by dynamically interpreting user intentions and also incorporates "undo" and "damage confinement" abstractions to mitigate risks. GoEx supports post-facto validation, allowing users to verify the correctness of actions after they are executed and to revert any undesired effects. GoEx enables LLMs to operate autonomously, significantly reducing the potential risks associated with their autonomous actions.We believe that together, these developments-Gorilla, OpenFunctions, RAFT, and GoEx-are critical to unlocking the potential for LLM agents to interact with applications and services.

Subject Added Entry-Topical Term: Computer science.

Subject Added Entry-Topical Term: Computer engineering.

Index Term-Uncontrolled: Large language models

Index Term-Uncontrolled: Retriever augmented generation

Index Term-Uncontrolled: Tool use

Index Term-Uncontrolled: Retrieval-augmented generation

Added Entry-Corporate Name: University of California, Berkeley Electrical Engineering & Computer Sciences

Host Item Entry: Dissertations Abstracts International. 86-03B.

Electronic Location and Access: 로그인을 한후 보실 수 있는 자료입니다.

Control Number: joongbu:656474

신착도서 더보기

최근 3년간 통계입니다.

예약
캠퍼스간 도서대출
서가에 없는 책 신고
보존서고대출신청
나의폴더

소장자료
등록번호	청구기호	소장처	대출가능여부	대출정보
TQ0032596	T	원문자료	열람가능/출력가능	열람가능/출력가능 마이폴더 부재도서신고

* 대출중인 자료에 한하여 예약이 가능합니다. 예약을 원하시면 예약버튼을 클릭하십시오.

본문

서브메뉴

검색

신착도서 더보기

최근 3년간 통계입니다.

소장정보

해당 도서를 다른 이용자가 함께 대출한 도서

관련도서

관련 인기도서

도서위치

QUICK LINK