중부대학교 도서관

본문 바로가기
탑 메뉴 바로가기
주 메뉴 바로가기
하단 바로가기

내용보기

Bridging the Gap Between Humans and Machines in 3D Object Perception- [electronic resource]

자료유형: 학위논문

Control Number: 0016932475

International Standard Book Number: 9798380879408

Dewey Decimal Classification Number: 004

Main Entry-Personal Name: Collins, Jasmine.

Publication, Distribution, etc. (Imprint: [S.l.] : University of California, Berkeley., 2023

Publication, Distribution, etc. (Imprint: Ann Arbor : ProQuest Dissertations & Theses, 2023

Physical Description: 1 online resource(92 p.)

General Note: Source: Dissertations Abstracts International, Volume: 85-06, Section: B.

General Note: Advisor: Malik, Jitendra.

Dissertation Note: Thesis (Ph.D.)--University of California, Berkeley, 2023.

Restrictions on Access Note: This item must not be sold to any third party vendors.

Summary, Etc.: 요약Humans possess a remarkable ability to extract general object representations from a single image, capturing not only shape and texture, but also 3D form. In contrast, 3D reasoning in many computer vision systems is often limited. This thesis present three efforts aimed towards bridging this gap in 3D object perception. First we introduce a new dataset that focuses on real-world, object-centered 3D understanding. The dataset provides a diverse set of objects corresponding to real household objects, with varying geometries and physically-based rendering materials. It also includes additional annotations describing each object, making it a valuable resource for training and evaluating computer vision models. Next, we design a method for automatically inferring the articulation of 3D objects. The method enables the interaction of 3D objects and can be used to generate more realistic and dynamic scenes. By understanding how different parts of an object move and interact, computer vision systems can better model and reason about complex 3D scenes in simulation. Finally, we investigate the effectiveness of contrastive learning with 3D data augmentation to generate multiple views of objects, a departure from the typical method of training single view images. We show that generating multiple views of objects can help computer vision systems learn better representations and improve their overall object understanding in terms of classification and shape perception. These contributions represent efforts towards bridging the gap between human and machine 3D object perception, ultimately enabling them to understand 3D objects from single images in ways that are more aligned with human perception.

Subject Added Entry-Topical Term: Computer science.

Subject Added Entry-Topical Term: Robotics.

Index Term-Uncontrolled: Contrastive learning

Index Term-Uncontrolled: Computer vision models

Index Term-Uncontrolled: Single view images

Index Term-Uncontrolled: Human perception

Index Term-Uncontrolled: 3D understanding

Added Entry-Corporate Name: University of California, Berkeley Computer Science

Host Item Entry: Dissertations Abstracts International. 85-06B.

Host Item Entry: Dissertation Abstract International

Electronic Location and Access: 로그인을 한후 보실 수 있는 자료입니다.

Control Number: joongbu:642779

신착도서 더보기

최근 3년간 통계입니다.

예약
캠퍼스간 도서대출
서가에 없는 책 신고
보존서고대출신청
나의폴더

소장자료
등록번호	청구기호	소장처	대출가능여부	대출정보
TQ0028697	T	원문자료	열람가능/출력가능	열람가능/출력가능 마이폴더 부재도서신고

* 대출중인 자료에 한하여 예약이 가능합니다. 예약을 원하시면 예약버튼을 클릭하십시오.

본문

서브메뉴

검색

신착도서 더보기

최근 3년간 통계입니다.

소장정보

해당 도서를 다른 이용자가 함께 대출한 도서

관련도서

관련 인기도서

도서위치

QUICK LINK