서브메뉴
검색
Methods for the Design and Analysis of Disease-Oriented Multi-Sample Single-Cell Studies.
Methods for the Design and Analysis of Disease-Oriented Multi-Sample Single-Cell Studies.
- 자료유형
- 학위논문
- Control Number
- 0017161678
- International Standard Book Number
- 9798382784342
- Dewey Decimal Classification Number
- 574
- Main Entry-Personal Name
- Millard, Nghia Patrick.
- Publication, Distribution, etc. (Imprint
- [S.l.] : Harvard University., 2024
- Publication, Distribution, etc. (Imprint
- Ann Arbor : ProQuest Dissertations & Theses, 2024
- Physical Description
- 283 p.
- General Note
- Source: Dissertations Abstracts International, Volume: 85-12, Section: B.
- General Note
- Advisor: Raychaudhuri, Soumya.
- Dissertation Note
- Thesis (Ph.D.)--Harvard University, 2024.
- Summary, Etc.
- 요약Recent advances in single-cell technologies have enabled the characterization of heterogeneous cell types in human diseases by measuring various features of individual cells, such as their transcriptomic, proteomic, and epigenomic profiles in the context of their spatial location in tissue. Due to the expensive cost and the high-dimensionality, sparsity, and noisiness of single-cell data investigators who wish to use single-cell technologies face key challenges in designing single-cell studies, performing integrative analysis of cells from multiple samples, and gleaning biological understanding from these data. In this dissertation, I present the development and application of novel computational methods and analysis frameworks that help address these challenges.First, I introduce scPOST, an algorithm for simulating large-scale, multi-sample single-cell RNA-sequencing datasets. scPOST enables investigators to simulate their future single-cell studies with different parameters, such as the number of cells, number of cells per sample, and number of batches. This allows investigators to determine the optimal design parameters for their study.Next, I introduce the development and application of two algorithms, Harmony and Crescendo, which are batch correction algorithms designed to help remove the batch effects that are prominent in single-cell data. I show that these algorithms feature superior performance in removing batch effects and are fast and scalable to large single-cell datasets that contain hundreds of thousands or even millions of cells. Finally, I showcase the application of these methods to analyzing a large 82-sample cohort of rheumatoid arthritis (RA) patients containing 314,000 cells. After performing batch correction with Harmony and a prospective power analysis with scPOST, I introduce a novel framework called cell-type abundance phenotypes (CTAPs) for classifying samples based on the abundance of cell types present in the sample. I then discuss how we used the CTAP framework to characterize the diversity of synovial inflammation in RA, identify disease-relevant cell states and transcriptomic signatures for different phenotypes of RA, and predict disease response.Overall, this work features a collection of computational methods that investigators can use to design their studies and analyze their single-cell data. These approaches are broadly applicable to many single-cell technologies and different diseases and will help investigators gain a greater understanding of how cells contribute to the pathology of a disease.
- Subject Added Entry-Topical Term
- Bioinformatics.
- Subject Added Entry-Topical Term
- Immunology.
- Subject Added Entry-Topical Term
- Cellular biology.
- Index Term-Uncontrolled
- Cell
- Index Term-Uncontrolled
- Transcriptomics
- Index Term-Uncontrolled
- Single-cell technologies
- Index Term-Uncontrolled
- Rheumatoid arthritis
- Index Term-Uncontrolled
- Human diseases
- Added Entry-Corporate Name
- Harvard University Medical Sciences
- Host Item Entry
- Dissertations Abstracts International. 85-12B.
- Electronic Location and Access
- 로그인을 한후 보실 수 있는 자료입니다.
- Control Number
- joongbu:654996