본문

서브메뉴

Prosody in Human Communication and Machine Understanding.
コンテンツ情報
Prosody in Human Communication and Machine Understanding.
자료유형  
 학위논문
Control Number  
0017163913
International Standard Book Number  
9798384094975
Dewey Decimal Classification Number  
401
Main Entry-Personal Name  
Ng, Sara B.
Publication, Distribution, etc. (Imprint  
[S.l.] : University of Washington., 2024
Publication, Distribution, etc. (Imprint  
Ann Arbor : ProQuest Dissertations & Theses, 2024
Physical Description  
91 p.
General Note  
Source: Dissertations Abstracts International, Volume: 86-03, Section: B.
General Note  
Advisor: Wright, Richard A.;Ostendorf, Mari.
Dissertation Note  
Thesis (Ph.D.)--University of Washington, 2024.
Summary, Etc.  
요약Speech technology is a ubiquitous part of the modern world, from the voice-enabled assistants in smartphones to bespoke tools used by language researchers. Technological advances and the curation of large speech datasets have enabled these systems to identify words with remarkable quality. However, the black-box nature of large commercial speech understanding systems brings into question the extent to which they can take advantage of cues from prosody.Prosody has great potential as an untapped source of linguistic information for speech understanding that is not surfaced in other aspects of language. Previous work has shown that prosodic information can be exploited computationally to resolve ambiguity for linguistic structures in computational models, and to perform tasks which are considered prosodically significant, such as sarcasm detection. However, computational systems do not benefit from the same social and conversational context that humans have in processing this kind of communication, making such tasks more challenging and further motivating the careful study of prosodic input.This work investigates the hypothesis that explicit encoding of acoustic-prosodic features is a benefit to speech understanding technology. From the domain of punctuation prediction in automatic speech recognition, I show that adding acoustic-prosodic measures can improve the performance of punctuation prediction models for speech transcripts compared to a system that uses only the word sequence. I provide a potential use case for prosodic modeling in the domain of speech entrainment. Finally, I show how computational methods can be used to understand human behavior in prosodically marked speech within the domains of speech timing and regions of presumed hyper-articulation.This work bridges the gap between linguistic questions about prosody, and computational questions about the use of or need for linguistically-motivated acoustic features. Understanding how prosody influences the quality of speech understanding systems is vital in enhancing their utility across various domains and for diverse speakers. The synthesis of the these research strands provides a bird's eye view of the methodologies and challenges that can be involved in computational processing of prosody.
Subject Added Entry-Topical Term  
Linguistics.
Subject Added Entry-Topical Term  
Communication.
Subject Added Entry-Topical Term  
Speech therapy.
Index Term-Uncontrolled  
Entrainment
Index Term-Uncontrolled  
Hyperarticulation
Index Term-Uncontrolled  
Prosody
Index Term-Uncontrolled  
Punctuation
Index Term-Uncontrolled  
Speech recognition
Index Term-Uncontrolled  
Stance
Added Entry-Corporate Name  
University of Washington Linguistics
Host Item Entry  
Dissertations Abstracts International. 86-03B.
Electronic Location and Access  
로그인을 한후 보실 수 있는 자료입니다.
Control Number  
joongbu:654645
New Books MORE
최근 3년간 통계입니다.

詳細情報

  • 予約
  • 캠퍼스간 도서대출
  • 서가에 없는 책 신고
  • 私のフォルダ
資料
登録番号 請求記号 場所 ステータス 情報を貸す
TQ0030567 T   원문자료 열람가능/출력가능 열람가능/출력가능
마이폴더 부재도서신고

*ご予約は、借入帳でご利用いただけます。予約をするには、予約ボタンをクリックしてください

해당 도서를 다른 이용자가 함께 대출한 도서

Related books

Related Popular Books

도서위치