본문

서브메뉴

Bridging the Gap Between Humans and Machines in 3D Object Perception- [electronic resource]
Sommaire Infos
Bridging the Gap Between Humans and Machines in 3D Object Perception- [electronic resource]
자료유형  
 학위논문
Control Number  
0016932475
International Standard Book Number  
9798380879408
Dewey Decimal Classification Number  
004
Main Entry-Personal Name  
Collins, Jasmine.
Publication, Distribution, etc. (Imprint  
[S.l.] : University of California, Berkeley., 2023
Publication, Distribution, etc. (Imprint  
Ann Arbor : ProQuest Dissertations & Theses, 2023
Physical Description  
1 online resource(92 p.)
General Note  
Source: Dissertations Abstracts International, Volume: 85-06, Section: B.
General Note  
Advisor: Malik, Jitendra.
Dissertation Note  
Thesis (Ph.D.)--University of California, Berkeley, 2023.
Restrictions on Access Note  
This item must not be sold to any third party vendors.
Summary, Etc.  
요약Humans possess a remarkable ability to extract general object representations from a single image, capturing not only shape and texture, but also 3D form. In contrast, 3D reasoning in many computer vision systems is often limited. This thesis present three efforts aimed towards bridging this gap in 3D object perception. First we introduce a new dataset that focuses on real-world, object-centered 3D understanding. The dataset provides a diverse set of objects corresponding to real household objects, with varying geometries and physically-based rendering materials. It also includes additional annotations describing each object, making it a valuable resource for training and evaluating computer vision models. Next, we design a method for automatically inferring the articulation of 3D objects. The method enables the interaction of 3D objects and can be used to generate more realistic and dynamic scenes. By understanding how different parts of an object move and interact, computer vision systems can better model and reason about complex 3D scenes in simulation. Finally, we investigate the effectiveness of contrastive learning with 3D data augmentation to generate multiple views of objects, a departure from the typical method of training single view images. We show that generating multiple views of objects can help computer vision systems learn better representations and improve their overall object understanding in terms of classification and shape perception. These contributions represent efforts towards bridging the gap between human and machine 3D object perception, ultimately enabling them to understand 3D objects from single images in ways that are more aligned with human perception.
Subject Added Entry-Topical Term  
Computer science.
Subject Added Entry-Topical Term  
Robotics.
Index Term-Uncontrolled  
Contrastive learning
Index Term-Uncontrolled  
Computer vision models
Index Term-Uncontrolled  
Single view images
Index Term-Uncontrolled  
Human perception
Index Term-Uncontrolled  
3D understanding
Added Entry-Corporate Name  
University of California, Berkeley Computer Science
Host Item Entry  
Dissertations Abstracts International. 85-06B.
Host Item Entry  
Dissertation Abstract International
Electronic Location and Access  
로그인을 한후 보실 수 있는 자료입니다.
Control Number  
joongbu:642779
New Books MORE
최근 3년간 통계입니다.

Info Détail de la recherche.

  • Réservation
  • 캠퍼스간 도서대출
  • 서가에 없는 책 신고
  • My Folder
Matériel
Reg No. Call No. emplacement Status Lend Info
TQ0028697 T   원문자료 열람가능/출력가능 열람가능/출력가능
마이폴더 부재도서신고

* Les réservations sont disponibles dans le livre d'emprunt. Pour faire des réservations, S'il vous plaît cliquer sur le bouton de réservation

해당 도서를 다른 이용자가 함께 대출한 도서

Related books

Related Popular Books

도서위치