서브메뉴
검색
Rethinking System Design With Awareness for Cross-Layer Aspects of Datacenter Storage.
Rethinking System Design With Awareness for Cross-Layer Aspects of Datacenter Storage.
- 자료유형
- 학위논문
- Control Number
- 0017161835
- International Standard Book Number
- 9798382808048
- Dewey Decimal Classification Number
- 004
- Main Entry-Personal Name
- Raina, Ashwini.
- Publication, Distribution, etc. (Imprint
- [S.l.] : Princeton University., 2024
- Publication, Distribution, etc. (Imprint
- Ann Arbor : ProQuest Dissertations & Theses, 2024
- Physical Description
- 104 p.
- General Note
- Source: Dissertations Abstracts International, Volume: 85-12, Section: B.
- General Note
- Advisor: Freedman, Michael J.
- Dissertation Note
- Thesis (Ph.D.)--Princeton University, 2024.
- Summary, Etc.
- 요약Storage is a critical piece of infrastructure in modern web applications. In recent years, storage technologies employed in building such systems have undergone significant evolution, bringing about novel cost-performance trade-offs. Concurrently, datacenter storage architectures have become increasingly layered. Software systems designed based on outdated assumptions of datacenter storage often result in poor cost-performance trade-offs or suffer from suboptimal performance. This dissertation proposes a new design approach for systems, one that incorporates the awareness of cross-layer aspects of datacenter storage, and validates the effectiveness of this approach through two systems.The first system is PrismDB, a novel key-value store that exploits two extreme ends of the spectrum of modern NVMe storage technologies (3D XPoint and QLC NAND) simultaneously. In recent years, emerging storage technologies have focused on divergent goals: better performance or lower cost. Correspondingly, data systems that employ these technologies are typically optimized either to be fast (but expensive) or cheap (but slow). PrismDB take a different approach: by architecting a storage engine to natively utilize two tiers of fast and low-cost storage technologies, it shows that a Pareto-efficient balance between performance and cost can be achieved.The second system is Fusion, an object store for analytics that is optimized for query pushdown on erasure-coded data. Computation pushdown is a widely adopted technique to reduce latency of highly selective queries in modern OLAP cloud database running on disaggregated storage. However, existing pushdown solutions are inefficient on erasure-coded storage since the analytics file objects get partitioned across storage nodes. Consequently, the storage system must reassemble the object across nodes before executing the query, leading to significant network latency. Fusion addresses this problem by co-designing its erasure coding and file placement topologies, taking into account popular analytics file formats (e.g., Parquet). It employs a novel stripe construction algorithm that prevents the fragmentation of computable units within an object, and minimizes storage overhead during erasure coding.Overall, this dissertation advocates for designing software systems with an awareness of cross-layer aspects in datacenter storage, and demonstrates the benefits of that approach via two systems: PrismDB and Fusion.
- Subject Added Entry-Topical Term
- Computer science.
- Subject Added Entry-Topical Term
- Architectural engineering.
- Index Term-Uncontrolled
- Datacenter storage
- Index Term-Uncontrolled
- Distributed systems
- Index Term-Uncontrolled
- Key-value store
- Index Term-Uncontrolled
- Object store
- Index Term-Uncontrolled
- Storage systems
- Index Term-Uncontrolled
- Tiered storage
- Added Entry-Corporate Name
- Princeton University Computer Science
- Host Item Entry
- Dissertations Abstracts International. 85-12B.
- Electronic Location and Access
- 로그인을 한후 보실 수 있는 자료입니다.
- Control Number
- joongbu:657454
Подробнее информация.
- Бронирование
- 캠퍼스간 도서대출
- 서가에 없는 책 신고
- моя папка