중부대학교 도서관

본문 바로가기
탑 메뉴 바로가기
주 메뉴 바로가기
하단 바로가기

내용보기

Domain Adapted Visual Representation Learning for Machine Perception- [electronic resource]

자료유형: 학위논문

Control Number: 0016934714

International Standard Book Number: 9798380394345

Dewey Decimal Classification Number: 621.3

Main Entry-Personal Name: Li, Yu-Jhe.

Publication, Distribution, etc. (Imprint: [S.l.] : Carnegie Mellon University., 2023

Publication, Distribution, etc. (Imprint: Ann Arbor : ProQuest Dissertations & Theses, 2023

Physical Description: 1 online resource(194 p.)

General Note: Source: Dissertations Abstracts International, Volume: 85-03, Section: B.

General Note: Advisor: Kitani, Kris.

Dissertation Note: Thesis (Ph.D.)--Carnegie Mellon University, 2023.

Restrictions on Access Note: This item must not be sold to any third party vendors.

Summary, Etc.: 요약Our objective is to enhance the generalization capabilities of existing machine perception models and achieve diverse domain alignments through adept representation learning. Many established approaches for perception tasks, encompassing object classification, detection, tracking, and rendering, often confront diverse domain changes that curtail their adaptability to novel domains. We categorize these changes into three types: 1) alterations in pose and viewpoint, 2) variations in visual capture conditions, and 3) diversity in modalities. Initially, models trained on specific viewpoints may falter when faced with viewpoints outside their training range. Second, changes in visual data capture conditions, encompassing changes in illumination or image resolution, can erode the generalization of trained models. Third, employing pre-trained models across distinct modalities, such as RGB, Lidar point clouds, Radar maps, or text embeddings, can lead to performance degradation. In this thesis, we propose to perform domain alignment to handle the aforementioned domain changes.The first segment of this thesis outlines our approach to performing domain alignment without the need for arduously training extensive models across multiple domains. We advocate for efficient handling of each type of change through visual representation learning techniques, utilizing models with minimal network parameters and judicious training data. This process, known as domain adaptation, unfolds in three stages. Initially, for pose and viewpoint variation, we propose acquiring viewpoint-invariant or pose-invariant representations, relevant to tasks like Re-ID, object tracking, and 3D face rendering. Subsequently, to mitigate the impact of changes in visual capture conditions, we harness semi-supervised and adversarial learning methods for tasks such as object detection and Re-ID. Lastly, to address cross-modal domain changes, we leverage self-training strategies to cultivate modality-agnostic representations for object detection.The second part of this thesis extends our domain-aligning framework to manage scenarios involving more than two forms of domain changes. To concurrently handle viewpoint variation and diverse modalities, we devise models capable of learning view-invariant representations for multiple modalities within the realm of 3D human pose estimation and rendering. Moreover, to combat changes arising from changes in resolution and diverse modalities in physical devices (e.g., ADC signals and Radar's RGB images), we advocate for the acquisition of super-resolution representations using models featuring complex values. Broadly, this thesis delves into the intricacies of perception tasks affected by domain changes and provides pragmatic solutions to address these challenges in real-world contexts.

Subject Added Entry-Topical Term: Computer engineering.

Subject Added Entry-Topical Term: Computer science.

Index Term-Uncontrolled: Deep learning

Index Term-Uncontrolled: Domain adaptation

Index Term-Uncontrolled: Multi-modality learning

Index Term-Uncontrolled: Perception tasks

Index Term-Uncontrolled: Representation learning

Added Entry-Corporate Name: Carnegie Mellon University Electrical and Computer Engineering

Host Item Entry: Dissertations Abstracts International. 85-03B.

Host Item Entry: Dissertation Abstract International

Electronic Location and Access: 로그인을 한후 보실 수 있는 자료입니다.

Control Number: joongbu:640428

신착도서 더보기

최근 3년간 통계입니다.

예약
캠퍼스간 도서대출
서가에 없는 책 신고
보존서고대출신청
나의폴더

소장자료
등록번호	청구기호	소장처	대출가능여부	대출정보
TQ0026344	T	원문자료	열람가능/출력가능	열람가능/출력가능 마이폴더 부재도서신고

* 대출중인 자료에 한하여 예약이 가능합니다. 예약을 원하시면 예약버튼을 클릭하십시오.

본문

서브메뉴

검색

신착도서 더보기

최근 3년간 통계입니다.

소장정보

해당 도서를 다른 이용자가 함께 대출한 도서

관련도서

관련 인기도서

도서위치

QUICK LINK