서브메뉴
검색
Topological Representations for Visual Object Recognition in Unseen Indoor Environments- [electronic resource]
Topological Representations for Visual Object Recognition in Unseen Indoor Environments- [electronic resource]
- 자료유형
- 학위논문
- Control Number
- 0016934827
- International Standard Book Number
- 9798380333818
- Dewey Decimal Classification Number
- 621
- Main Entry-Personal Name
- Samani, Ekta Umesh.
- Publication, Distribution, etc. (Imprint
- [S.l.] : University of Washington., 2023
- Publication, Distribution, etc. (Imprint
- Ann Arbor : ProQuest Dissertations & Theses, 2023
- Physical Description
- 1 online resource(162 p.)
- General Note
- Source: Dissertations Abstracts International, Volume: 85-03, Section: B.
- General Note
- Advisor: Banerjee, Ashis G.
- Dissertation Note
- Thesis (Ph.D.)--University of Washington, 2023.
- Restrictions on Access Note
- This item must not be sold to any third party vendors.
- Summary, Etc.
- 요약Object recognition is an essential component of visual perception tasks that help robots build a semantic-level understanding of their environment. Although deep learning methods achieve extraordinary recognition performance in previously seen environments, they are insufficient for deployment in complex and continually-changing environments due to their sensitivity to environmental variations. To realize the goal of long-term autonomy in robots, we need perception methods that go beyond statistical correlation. Therefore, this dissertation focuses on developing robust object recognition methods using topological methods and human-like reasoning mechanisms.We begin by using topologically persistent features, which capture the objects' 2D shape information for recognition in unseen environments. In particular, we present two kinds of representations, namely, sparse persistence image (PI) and amplitude, computed by applying persistent homology to multi-directional height function-based filtrations (nested sequences of cubical complexes) representing the objects' segmentation maps. Using a benchmark dataset, we demonstrate that sparse PI features show better recognition performance in unseen environments than the features from widely-used deep learning-based feature extractors. On a new dataset, the UW Indoor Scenes (UW-IS) dataset, designed to test object recognition performance in unseen environments, the performance of sparse PI features remains relatively unchanged in an unseen test environment, unlike a state-of-the-art domain-adaptive object detection method.Next, we propose a new descriptor, TOPS, to capture the 3D shape information of point clouds generated from depth images, and an accompanying recognition framework, THOR, inspired by human reasoning. The descriptor employs a novel slicing-based approach to compute topological features from filtrations of simplicial complexes using persistent homology, and facilitates reasoning-based recognition using object unity. Apart from a benchmark dataset, we report performance on a new dataset, the UW Indoor Scenes (UW-IS) Occluded dataset, curated using commodity hardware to reflect real-world scenarios with different environmental conditions and degrees of object occlusion. THOR outperforms state-of-the-art methods on both the datasets and achieves substantially higher recognition accuracy for all the scenarios of the UW-IS Occluded dataset.Subsequently, we extend the TOPS descriptor to incorporate object color information via color embeddings and obtain the TOPS2 descriptor. The color embeddings are computed by leveraging the similarity and connectivity between colors in a color network generated using the Mapper algorithm. The accompanying THOR2 framework, trained entirely on synthetic RGB-D images of unoccluded objects, witnesses considerable performance improvements over the shape-based THOR framework on both the OCID and UW-IS Occluded datasets. THOR2 also achieves substantially higher accuracy than a state-of-the-art vision transformer adapted for RGB-D object recognition on the OCID and UW-IS Occluded dataset, regardless of the camera orientation and environmental conditions, respectively. Therefore, the approaches presented in this work, which have also been successfully implemented on a low-cost robot, lay the foundation for achieving robust object recognition in unseen environments using computational topology tools.
- Subject Added Entry-Topical Term
- Mechanical engineering.
- Subject Added Entry-Topical Term
- Robotics.
- Subject Added Entry-Topical Term
- Computer engineering.
- Index Term-Uncontrolled
- AI-enabled robotics
- Index Term-Uncontrolled
- Object recognition
- Index Term-Uncontrolled
- RGB-D perception
- Index Term-Uncontrolled
- Topological data analysis
- Added Entry-Corporate Name
- University of Washington Mechanical Engineering
- Host Item Entry
- Dissertations Abstracts International. 85-03B.
- Host Item Entry
- Dissertation Abstract International
- Electronic Location and Access
- 로그인을 한후 보실 수 있는 자료입니다.
- Control Number
- joongbu:639239