AI-DRIVEN ELECTRONIC VOCAL CONVERTER | IJCT Volume 13 – Issue 3 | IJCT-V13I3P74

International Journal of Computer Techniques
ISSN 2394-2231
Volume 13, Issue 3  |  Published: May – June 2026

Author

Shaik Abdul Nabi, Shaik Mohammed Ather, Sri Kannan, Dr.K.Akila

Abstract

This paper presents an AI-driven vocal converter aimed at improving accessibility and communication for differently-abled individuals by enabling seamless multi-modal data transformation. The system integrates speech-to-text and text-to-speech processing for real-time voice interaction, image-to-text extraction using optical character recognition (OCR) for reading printed and handwritten content, and Morse code encoding and decoding for alternative communication support. Advanced machine learning and signal processing techniques are employed to ensure high accuracy, fast response time, and robustness under varying environmental conditions. The platform is designed with a user-friendly interface to support ease of adoption and practical usability in real-world scenarios.

Keywords

Speech-to-Text,Text-to-Speech, OCR, Morse Code, Assistive Technology, Accessibility, Artificial Intelligence, Multimodal System.

Conclusion

AI-driven vocal converter systems play a vital role in enhancing accessibility, communication efficiency, and digital inclusion in modern assistive technology environments. The proposed system continuously processes multi-modal inputs including speech, text, images, and Morse code signals to deliver accurate and reliable data transformation using advanced machine learning, speech processing, and optical character recognition techniques. By enabling real-time speech-to-text and text-to-speech conversion, image-based text extraction, and Morse code encoding and decoding, the platform significantly reduces communication barriers for individuals with visual, hearing, and speech impairments. Real-Time Processing: The system supports continuous and low-latency conversion of voice, image, and symbolic inputs, allowing users to interact with digital platforms efficiently and naturally. Adaptive Intelligence: Machine learning models improve recognition accuracy over time by adapting to variations in accents, noise levels, handwriting styles, and image quality. Autonomous Operation: The platform performs automatic data interpretation and conversion without manual intervention, enabling independent usage for differently-abled users. Data Integration: Multiple conversion modules are integrated into a centralized framework, providing a unified interface and seamless workflow for multimodal communication. Assistance, reducing operational and deployment costs. In conclusion, the development and implementation of AI-driven vocal converter systems contribute significantly to inclusive technology adoption by enabling safe, reliable, and scalable communication solutions. Future enhancements may include multilingual support, mobile deployment, cloud integration, and improved model optimization to further expand accessibility and real-world applicability.

References

[1] L. Rabiner and B. H. Juang, “Fundamentals of Speech Recognition,” Prentice Hall, Upper Saddle River, NJ, USA, 1993. [2] H. Zen, A. Senior, and M. Schuster, “Statistical Parametric Speech Synthesis Using Deep Neural Networks,” Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 7962–7966, 2013. [3] T. O’Malley and D. M. Bikel, “Automatic Speech Recognition: A Deep Learning Approach,” Computational Linguistics Journal, vol. 47, no. 3, pp. 659–682, 2021. [4] R. Smith, “An Overview of the Tesseract OCR Engine,” Proc. International Conference on Document Analysis and Recognition, pp. 629–633, 2007.

How to Cite This Paper

Shaik Abdul Nabi, Shaik Mohammed Ather, Sri Kannan, Dr.K.Akila (2026). AI-DRIVEN ELECTRONIC VOCAL CONVERTER. International Journal of Computer Techniques, 13(3). ISSN: 2394-2231.

© 2026 International Journal of Computer Techniques (IJCT). All rights reserved.

Submit Your Paper