중부대학교 도서관

본문 바로가기
탑 메뉴 바로가기
주 메뉴 바로가기
하단 바로가기

Inhalt Info

Hybrid Knowledge Architectures for Question Answering- [electronic resource]

자료유형: 학위논문

Control Number: 0016935000

International Standard Book Number: 9798380855600

Dewey Decimal Classification Number: 004

Main Entry-Personal Name: Ma, Kaixin.

Publication, Distribution, etc. (Imprint: [S.l.] : Carnegie Mellon University., 2023

Publication, Distribution, etc. (Imprint: Ann Arbor : ProQuest Dissertations & Theses, 2023

Physical Description: 1 online resource(151 p.)

General Note: Source: Dissertations Abstracts International, Volume: 85-05, Section: B.

General Note: Advisor: Nyberg, Eric.

Dissertation Note: Thesis (Ph.D.)--Carnegie Mellon University, 2023.

Restrictions on Access Note: This item must not be sold to any third party vendors.

Summary, Etc.: 요약Question answering (QA) is a knowledge-intensive task in natural language processing (NLP) that requires the system to provide answers to user queries expressed in natural language. The types of knowledge a QA system is equipped with broadly fall into two categories, namely explicit knowledge and implicit knowledge. The explicit knowledge takes formats that are human readable, e.g. raw text, knowledge graphs, structured tables etc, while the implicit knowledge resides in the model parameters that are learned by training on the explicit knowledge. Due to the complementary nature of these two types of knowledge, most recent QA research has tried to leverage both of them for modeling. However, existing work on this front mostly focuses on building customized models for specific datasets, which do not generalize well to other use cases. Moreover, while using end-to-end models to directly learn to fuse different knowledge is a simple solution and often works well, it's hard to interpret the model's reasoning process, leading to untrustworthy predictions. Finally, most systems are designed without considering memory and computation efficiency, which hinders' their application to real-world use cases.With the aforementioned issues in mind, in this thesis, we present solutions for building generalizable, interpretable, and efficient QA systems. Specifically, we present three solution elements, namely 1) hybrid knowledge fusion, 2) modularized knowledge framework, and 3) modularized knowledge sharing. In the first part, we study various ways of injecting commonsense knowledge into QA systems powered by pretrained language models. Our results show that instance-level late fusion of knowledge subgraphs is promising in a supervised setting and pretraining on transformed knowledge graphs (KGs) provides substantial gains across a diverse set of tasks in a zero-shot setup. These findings show that combining explicit and implicit knowledge is a step towards generalization across different domains of questions. In the second part, we explored two different modularized frameworks for open-domain question answering that bridge the gap across knowledge types and question types. We show that text can serve as a universal knowledge interface for different types of structured knowledge, and decomposing the reasoning process into discrete steps enables a single unified system to solve both single-hop and multi-hop questions. Modularized frameworks not only offer generalization across modalities of knowledge and question types but also bring improved interpretability of the reasoning process. In the third part, we extend the modularized framework from the previous part by allowing implicit knowledge sharing among different modules. Multiple reasoning modules are merged together and learned simultaneously through multi-task learning, and we further add skill-specific specialization for each module to reduce task interference. Such an architecture not only greatly reduced the overall model size but also improved the inference efficiency, therefore achieving all three target properties generalizability, interpretability, and efficiency. Finally, we discuss open challenges and ways forward beyond this thesis.

Subject Added Entry-Topical Term: Computer science.

Subject Added Entry-Topical Term: Linguistics.

Subject Added Entry-Topical Term: Information technology.

Index Term-Uncontrolled: Commonsense reasoning

Index Term-Uncontrolled: Information retrieval

Index Term-Uncontrolled: Knowledge bases

Index Term-Uncontrolled: Natural language processing

Index Term-Uncontrolled: Question answering

Added Entry-Corporate Name: Carnegie Mellon University Language Technologies Institute

Host Item Entry: Dissertations Abstracts International. 85-05B.

Host Item Entry: Dissertation Abstract International

Electronic Location and Access: 로그인을 한후 보실 수 있는 자료입니다.

Control Number: joongbu:640725

New Books MORE

최근 3년간 통계입니다.

Reservierung
캠퍼스간 도서대출
서가에 없는 책 신고
보존서고대출신청
Meine Mappe

Sammlungen
Registrierungsnummer	callnumber	Standort	Verkehr Status	Verkehr Info
TQ0026645	T	원문자료	열람가능/출력가능	열람가능/출력가능 마이폴더 부재도서신고

* Kredite nur für Ihre Daten gebucht werden. Wenn Sie buchen möchten Reservierungen, klicken Sie auf den Button.

본문

서브메뉴

검색

New Books MORE

최근 3년간 통계입니다.

Buch Status

해당 도서를 다른 이용자가 함께 대출한 도서

Related books

Related Popular Books

도서위치

QUICK LINK