Vision Aid: AI-Powered Assistive Technology for the Visually Impaired
DOI:
https://doi.org/10.21467/proceedings.7.5.3Keywords:
Assistive Technology, Artificial Intelligence, Deep LearningAbstract
Vision Aid is a tech tool powered by AI that intends to increase the self-reliance and well-being of people who have impaired sight. This mobile application offers features such as medicine identification, emotion detection, and real-time facial recognition, providing intelligent support through advanced deep learning algorithms. The system, with convolutional neural network as well as via transfer learning, rapidly recognizes a number of known faces, understands multiple emotional signals, along with reads prescription labels, in addition to translating this entire store of information into real-time spoken feedback. By fostering user autonomy, the software also addresses critical issues like social isolation, medication management, and safety, creating a comprehensive and inclusive experience. The potential for scalability exists, as the data and training models can be repurposed for different datasets. For instance, an eye model could serve as a sensor for an autonomous mobile robot for tasks such as object recognition or environmental awareness. Vision Aid, along with assistive technology continuing improvement, is certainly a breakthrough for many people with visual impairments so they can improve capability in closing the divide across the environment surrounding them.
References
[1] T. Pun, P. Roth, G. Bologna, K. Moustakas, and D. Tzovaras, "Image and video processing for visually handicapped people," EURASIP J. Image Video Process. 2007, 1–15 (2007) at https://jivp-eurasipjournals.springeropen.com/articles/10.1155/2007/25214
[2] C. Shi, C. Tan, and L. Wang, "A facial expression recognition method based on a multibranch cross-connection convolutional neural network," IEEE Access 9, 1–10 (2021) at https://ieeexplore.ieee.org/iel7/6287639/9312710/09367192.pdf
[3] K. Simonyan and A. Zisserman, "Very deep convolutional networks for large-scale image recognition", arXiv preprint arXiv:1409.1556, (2014) at https://arxiv.org/abs/1409.1556
[4] L. Li, X. Mu, S. Li, and H. Peng, "A review of face recognition technology," IEEE Access 8, 123456–123467 (2020)
[5] G. Zhao, H. Yang, and M. Yu, "Expression recognition method based on a lightweight convolutional neural network," IEEE Access 8, 1–10 (2020) at https://ieeexplore.ieee.org/iel7/6287639/8948470/08952725.pdf
[6] S. D. M. Iqbal and B. Y. Suprapto, "Real-time implementation of face recognition and emotion recognition in a humanoid robot using a convolutional neural network," IEEE Access 10, 1–10 (2022) at https://ieeexplore.ieee.org/document/9864185
[7] Y. Lecun, Y. Bengio, and G. Hinton, "Deep learning," Nature 521, 436–444 (2015) at https://www.nature.com/articles/nature14539
[8] B. Mocanu, R. Tapu, and T. Zaharia, "Deep-see face: A mobile face recognition system dedicated to visually impaired people," IEEE Access 6, 1–10 (2018) at https://ieeexplore.ieee.org/iel7/6287639/8274985/08466782.pdf
[9] L. B. Neto, F. Grijalva, V. R. M. L. Maike, L. C. Martini, D. Florencio, M. C. C. Baranauskas, A. Rocha, and S. Goldenstein, "A kinect-based wearable face recognition system to aid visually impaired users," IEEE Trans. Hum.-Mach. Syst. 47(2), 1–10 (2017) at https://ieeexplore.ieee.org/document/7571103
[10] S. Zhang, F. Jiang, and M. Li, "Facial expression recognition based on improved VGG16 convolutional neural network," in Proc. 2nd Int. Conf. Signal Process. Comput. Netw. Commun., 162–168 (ACM, 2024) at https://dl.acm.org/doi/abs/10.1016/j.patcog.2016.07.026
[11] Y. Huang, F. Chen, S. Lv, and X. Wang, "Facial expression recognition: A survey," Symmetry, vol. 11, no. 10, pp. 1189, 2019) at https://www.mdpi.com/2073-8994/11/10/1189
[12] C. Szegedy, V. Vanhoucke, S. Ioffe, J. Shlens, and Z. Wojna, "Rethinking the inception architecture for computer vision," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), 2818–2826 (2016) at https://ieeexplore.ieee.org/document/7780677
[13] R. Girshick, "Fast R-CNN," in Proc. IEEE Int. Conf. Comput. Vis. (ICCV), 1440–1448 (2015) at https://ieeexplore.ieee.org/document/7410526
[14] S. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, and Y. Bengio, "Generative adversarial networks," Commun. ACM 63(11), 139–144 (2020) at https://dl.acm.org/doi/10.1145/3422622
[15] J. Redmon, S. Divvala, R. Girshick, and A. Farhadi, "You only look once: Unified, real-time object detection," in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), 779–788 (2016) at https://ieeexplore.ieee.org/document/7780460
Downloads
Published
Issue
Section
License

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.