중부대학교 도서관

본문 바로가기
탑 메뉴 바로가기
주 메뉴 바로가기
하단 바로가기

내용보기

Statistical and Computational Methods for High-Dimensional Genomics Data- [electronic resource]

자료유형: 학위논문

Control Number: 0016935568

International Standard Book Number: 9798380371568

Dewey Decimal Classification Number: 310

Main Entry-Personal Name: Ma, Ying.

Publication, Distribution, etc. (Imprint: [S.l.] : University of Michigan., 2023

Publication, Distribution, etc. (Imprint: Ann Arbor : ProQuest Dissertations & Theses, 2023

Physical Description: 1 online resource(282 p.)

General Note: Source: Dissertations Abstracts International, Volume: 85-03, Section: B.

General Note: Advisor: Zhou, Xiang.

Dissertation Note: Thesis (Ph.D.)--University of Michigan, 2023.

Restrictions on Access Note: This item must not be sold to any third party vendors.

Restrictions on Access Note: This item must not be added to any third party search indexes.

Summary, Etc.: 요약Advancements in transcriptomic technologies have enabled the measurement of gene expression at single cell resolution and provided spatial localization information on tissues. The increasing accessibility of these single-cell RNA sequencing (scRNA-seq) or spatially resolved transcriptomic (SRT) datasets provides a comprehensive cell atlas. It enables the thorough characterization of transcriptomic landscapes of tissues for a mechanistic understanding of many biological processes. In the meantime, improvements in transcriptomic technologies have increased both the volume and complexity of data, introducing new computational and statistical challenges for data analysis, including differential expression analysis, gene set enrichment analysis, cell type deconvolution analysis, and spatial domain clustering. In this dissertation, I propose three statistical and computational methods to address these challenges for capturing and dissecting cellular and tissue heterogeneity with high statistical power and accuracy, while providing new insight into biological systems. In Chapter 2, I develop a method, iDEA, that performs joint DE and GSE analysis in scRNA-seq studies. By integrating DE and GSE analyses, iDEA can improve the power and consistency of DE analysis, produce effective control of type I errors, thus yielding high statistical power and accuracy of GSE analysis. Importantly, iDEA uses only DE summary statistics as input, enabling effective data modeling through complementing and pairing with various existing DE methods. I illustrate the benefits of iDEA with extensive simulations, and three scRNA-seq data sets, where iDEA achieves up to five-fold power gain over existing GSE methods and up to 64% power gain over existing DE methods. In Chapter 3, I develop a method CARD to perform spatially informed cell type deconvolution for SRT data. CARD builds upon a non-negative matrix factorization (NMF) model that leverages the cell-type-specific gene expression from scRNA-seq data. A unique feature of CARD is its ability to accommodate the spatial correlation structure in cell-type composition across tissue locations by a conditional autoregressive (CAR) modeling assumption. This enables accurate and robust deconvolution of SRT data across technologies and in the presence of mismatched scRNA-seq references. Furthermore, modeling spatial correlation allows CARD to impute cell-type compositions and gene expression levels on new locations of the tissue, facilitating the reconstruction of high-resolution map. Importantly, CARD is computationally scalable and efficient to datasets with tens of thousands of genes measured on tens of thousands of samples. With extensive simulations and comprehensive applications to four real datasets, CARD outperforms other methods, provide novel biological insight underlines the tissue heterogeneity. In Chapter 4, I develop a method that simultaneously characterize the transcriptomic landscapes on multiple tissues. While SRT datasets can be generated from multiple tissue sections with high resolution, existing methods primarily focus on a single tissue section and fail to utilize information from scRNA-seq datasets for spatial domain detection. Additionally, many published methods lack computational scalability for high-resolution large-scale SRT datasets being collected today. To fill these gaps, I developed IRIS, which leverages cell type specific gene expression information from scRNA-seq to detect spatial domains on multiple tissue sections. By iteratively updating spatial domain labels while considering within-slice and between-slice compositional similarities, IRIS ensures optimal clustering performance. Through in-depth analysis of six spatial transcriptomics datasets, IRIS demonstrates significant advantages, achieving up to 1083% clustering accuracy improvement over existing methods. This enables the identification of transcriptomic landscapes in complex tissues, including the human prefrontal cortex, spermatogenesis, olfactory bulb, and human breast cancer.

Subject Added Entry-Topical Term: Statistics.

Subject Added Entry-Topical Term: Biostatistics.

Subject Added Entry-Topical Term: Genetics.

Index Term-Uncontrolled: Statistical methods

Index Term-Uncontrolled: Single-cell RNA-seq

Index Term-Uncontrolled: Spatial transcriptomics

Index Term-Uncontrolled: Gene expression

Index Term-Uncontrolled: Complex tissues

Added Entry-Corporate Name: University of Michigan Biostatistics

Host Item Entry: Dissertations Abstracts International. 85-03B.

Host Item Entry: Dissertation Abstract International

Electronic Location and Access: 로그인을 한후 보실 수 있는 자료입니다.

Control Number: joongbu:641205

신착도서 더보기

최근 3년간 통계입니다.

예약
캠퍼스간 도서대출
서가에 없는 책 신고
보존서고대출신청
나의폴더

소장자료
등록번호	청구기호	소장처	대출가능여부	대출정보
TQ0027119	T	원문자료	열람가능/출력가능	열람가능/출력가능 마이폴더 부재도서신고

* 대출중인 자료에 한하여 예약이 가능합니다. 예약을 원하시면 예약버튼을 클릭하십시오.

본문

서브메뉴

검색

신착도서 더보기

최근 3년간 통계입니다.

소장정보

해당 도서를 다른 이용자가 함께 대출한 도서

관련도서

관련 인기도서

도서위치

QUICK LINK