본문

서브메뉴

Modelling Sequence and Structure Towards Functional Protein Design.
Modelling Sequence and Structure Towards Functional Protein Design.
Contents Info
Modelling Sequence and Structure Towards Functional Protein Design.
Material Type  
 학위논문
 
0017164029
Date and Time of Latest Transaction  
20250211152823
ISBN  
9798346567646
DDC  
574
Author  
Paul, Steffanie B.
Title/Author  
Modelling Sequence and Structure Towards Functional Protein Design.
Publish Info  
[S.l.] : Harvard University., 2024
Publish Info  
Ann Arbor : ProQuest Dissertations & Theses, 2024
Material Info  
210 p.
General Note  
Source: Dissertations Abstracts International, Volume: 86-05, Section: B.
General Note  
Advisor: Marsk, Debora S.
학위논문주기  
Thesis (Ph.D.)--Harvard University, 2024.
Abstracts/Etc  
요약Millenia of evolutionary experiments have produced an extensive universe of natural macromolecular machines - proteins - that perform the variety of complex functions needed to make up a cell. In the last few decades, advances in protein engineering technologies, including the adoption of machine learning methods, have enabled us to bend and reform nature's designs towards our human needs. The advent of generative machine learning models trained on evolutionary data has enabled us to leverage nature's experiments along with years of domain knowledge to significantly move the needle on what kinds of proteins and functions we can possibly design. While these models have massive promise, there is still much to understand about i) what these models are learning, ii) how well they are learning, and iii) which models are useful for which design task. This thesis provides tools and insights to the field to shed light on these questions and thus advance our ability to engineer proteins for the functions we want.We begin with the need to identify what models perform better than others and what biological design tasks they may be useful for. Towards this, chapter 1 details our curation of the largest benchmarking dataset for generative protein models for fitness prediction, which we used to identify functional advantages for particular classes of generative models. This evaluation paradigm relies on functional measurements, which may not be available for any given protein an engineer is interested in. Thus, in chapter 2 we develop novel, statistically motivated kernel-based evaluation metrics that can be used to verify how accurately and reliably a conditional generative model has learned the distribution of the protein of interest; this provides a practitioner with helpful information about how well their model might perform for their task a priori. For highly complex functions in highly local sequence space, we argue that focused experimental data are needed to get engineering gains. In chapter 3 we discuss how machine learning models can improve the efficiency of experimental pipelines and increase our design capabilities, with a case-study on machine learning-assisted antibody optimization.
Subject Added Entry-Topical Term  
Bioinformatics.
Subject Added Entry-Topical Term  
Applied mathematics.
Subject Added Entry-Topical Term  
Bioengineering.
Subject Added Entry-Topical Term  
Evolution & development.
Index Term-Uncontrolled  
Generative AI
Index Term-Uncontrolled  
Machine learning
Index Term-Uncontrolled  
Protein engineering
Index Term-Uncontrolled  
Millenia
Index Term-Uncontrolled  
Protein design
Added Entry-Corporate Name  
Harvard University Medical Sciences
Host Item Entry  
Dissertations Abstracts International. 86-05B.
Electronic Location and Access  
로그인을 한후 보실 수 있는 자료입니다.
Control Number  
joongbu:656493
New Books MORE
최근 3년간 통계입니다.

Detail Info.

  • Reservation
  • 캠퍼스간 도서대출
  • 서가에 없는 책 신고
  • My Folder
Material
Reg No. Call No. Location Status Lend Info
TQ0032615 T   원문자료 열람가능/출력가능 열람가능/출력가능
마이폴더 부재도서신고

* Reservations are available in the borrowing book. To make reservations, Please click the reservation button

해당 도서를 다른 이용자가 함께 대출한 도서

Related books

Related Popular Books

도서위치