IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Phoenix Precision Algorithm for Blind People With Enhanced Voice Assistant

Phoenix Precision Algorithm for Blind People With Enhanced Voice Assistant
View Sample PDF
Author(s): Judy Flavia B. (SRM Institute of Science and Technology, India), S. Sridevi (SRM Instıtute of Science and Technology, India), V. Srivathsan (SRM Instıtute of Science and Technology, India), Aravindak Kumar R. K. (SRM Instıtute of Science snd Technology, India), Ashwin Kumar M. K. (SRM Instıtute of Science and Technology, India), S. Rubin Bose (SRM Instıtute of Science and Technology, India)and R. Regin (SRM Instıtute of Science and Technology, India)
Copyright: 2024
Pages: 18
Source title: Advanced Applications of Generative AI and Natural Language Processing Models
Source Author(s)/Editor(s): Ahmed J. Obaid (University of Kufa, Iraq), Bharat Bhushan (School of Engineering and Technology, Sharda University, India), Muthmainnah S. (Universitas Al Asyariah Mandar, Indonesia)and S. Suman Rajest (Dhaanish Ahmed College of Engineering, India)
DOI: 10.4018/979-8-3693-0502-7.ch016

Purchase

View Phoenix Precision Algorithm for Blind People With Enhanced Voice Assistant on the publisher's website for pricing and purchasing information.

Abstract

The chapter presents an innovative approach to object detection that combines the advantages of the DETR (DEtection TRansformer) and RetinaNet models and features a phoenix precision algorithm. Object tracking is a basic computer vision task for identifying and locating objects in an image. The DETR model revolutionized object detection by introducing a transformer-based architecture that eliminates the need for anchor boxes rather than maximum damping, resulting in industry-leading performance. On the other hand, RetinaNet is a popular single-stage object detection model known for its efficiency and accuracy. This chapter proposes a hybrid model that uses both DETR and RetinaNet. The transformer-based architecture of the DETR model provides an excellent understanding of the overall context and allows you to capture long-range dependencies and maintain object associations. Meanwhile, RetinaNet's pyramid array (FPN) and focus loss enable precise localization and manipulation of objects at different scales.

Related Content

Wasswa Shafik. © 2024. 25 pages.
Muthmainnah Muthmainnah, Eka Apriani, Prodhan Mahbub Ibna Seraj, Ahmed J. Obaid, Ahmad M. Al Yakin. © 2024. 17 pages.
Arkar Htet, Sui Reng Liana, Theingi Aung, Amiya Bhaumik. © 2024. 26 pages.
Shwetha Baliga, Harshith K. Murthy, Apoorv Sadhale, Dhruti Upadhyaya. © 2024. 18 pages.
Manoj Kumar Pandey, Jyoti Upadhyay. © 2024. 21 pages.
R. Angeline, S. Aarthi, Rishabh Jain, Muzamil Faisal, Abishek Venkatesan, R. Regin. © 2024. 16 pages.
Gagan Deep, Jyoti Verma. © 2024. 20 pages.
Body Bottom