Dissertations and Theses

A Real-Time 3D Object Detection, Recognition and Presentation System on a Mobile Device for Assistive Navigation

Jin Chen, CUNY City CollegeFollow

Date of Award

2022

Document Type

Thesis

Department

Computer Science

First Advisor

Zhigang Zhu

Keywords

3D object detection, assistive technology, ARKit, blind or low vision, voice assistance

Abstract

This thesis proposes an integrated solution for 3D object detection, recognition, and presentation to increase accessibility for various user groups in indoor areas through a mobile application. The system has three major components: a 3D object detection module, an object tracking and update module, and a voice and AR-enhanced interface. The 3D object detection module consists of pre-trained 2D object detectors and 3D bounding box estimation methods to detect the 3D poses and sizes of the objects in each camera frame. This module can easily adapt to various 2D object detectors (e.g., YOLO, SSD, Mask RCNN) based on the requested task and requirements of the run time and details for the 3D detection result. It can run on a cloud server or mobile application. The object tracking and update module minimizes the computational power for long- term environment scanning by converting 2D tracking results into 3D results. The voice and AR-enhanced interface integrates ARKit and SiriKit to provide voice interaction and AR visualization to improve information delivery for different user groups. The system can be integrated with existing applications, especially assistive navigation, to increase travel safety for people who are blind or have low vision (BLV) and improve social interaction for individuals with autism spectrum disorder (ASD). In addition, it can potentially be used for 3D reconstruction of the environment for other applications. Our preliminary test results for the object detection evaluation and real-time system performance are provided to validate the proposed system.

Recommended Citation

Chen, Jin, "A Real-Time 3D Object Detection, Recognition and Presentation System on a Mobile Device for Assistive Navigation" (2022). CUNY Academic Works.
https://academicworks.cuny.edu/cc_etds_theses/1069

Download

Included in

Computer Sciences Commons

COinS

CUNY Academic Works

Dissertations and Theses

A Real-Time 3D Object Detection, Recognition and Presentation System on a Mobile Device for Assistive Navigation

Date of Award

Document Type

Department

First Advisor

Keywords

Abstract

Recommended Citation

Included in

Browse

Search

Author Corner

Links

CUNY Academic Works

Dissertations and Theses

A Real-Time 3D Object Detection, Recognition and Presentation System on a Mobile Device for Assistive Navigation

Author

Date of Award

Document Type

Department

First Advisor

Keywords

Abstract

Recommended Citation

Included in

Share

Browse

Search

Author Corner

Links