Bio Larynx: When Your Voice Knows Before You Do

Usha Dhankar; Priyansha Sachdev; Swastik Goomber; Supriya Kumari; Shivam Bansal

doi:10.21467/proceedings.7.6.34

Authors

Usha Dhankar HMR Institute of Technology and Management, Hamidpur Author
Priyansha Sachdev HMR Institute of Technology and Management, Hamidpur Author
Swastik Goomber HMR Institute of Technology and Management, Hamidpur Author
Supriya Kumari HMR Institute of Technology and Management, Hamidpur Author
Shivam Bansal HMR Institute of Technology and Management, Hamidpur Author

DOI:

https://doi.org/10.21467/proceedings.7.6.34

Keywords:

Speech Biomarkers, Neurological and Vocal Disorders, Artificial Intelligence in Healthcare

Abstract

Neurological and vocal disorders such as Parkinson’s Disease, Alzheimer’s Disease, stroke related impairments, and vocal fold pathologies often go undetected in their early stages due to a reliance on subjective assessments, expensive imaging techniques, or invasive procedures. These challenges make it difficult to diagnose conditions early, monitor them effectively, and ensure accessible care, especially in settings with limited resources. To demonstrate the potential of artificial intelligence in this context, this study analyzes speech derived biomarkers from existing datasets to explore how AI can assist in identifying and understanding neurological and vocal disorders. Built upon publicly available datasets, the system integrates a complete analytical pipeline including data preparation, exploratory analysis, feature selection, machine learning based classification, and statistical interpretation. It employs both statistical methods and ensemble learning techniques to identify robust acoustic and prosodic features relevant across multiple disorders. A suite of classifiers ranging from logistic regression to gradient boosting and neural networks are trained and validated using stratified cross validation. Significant biomarkers are further examined using nonparametric tests and effect size estimation. Analysis on various parameters were carried out like Fundamental Frequency, Speech Duration, Pitch period Entropy, Detrended Fluctuation analysis and more. Unsupervised clustering and dimensionality reduction techniques are also applied to explore latent subgroups within patient populations. The system outputs interactive visualizations and auto generated reports, offering a transparent, reproducible, and scalable approach to voice based health diagnostics.

References

[1] A. Idrisoglu, A. L. Dallora, P. Anderberg, and J. S. Berglund, "Applied Machine Learning Techniques to Diagnose Voice-Affecting Conditions and Disorders: Systematic Literature Review," J. Med. Internet Res., vol. 25, p. e46105, Jul. 2023. https://www.jmir.org/2023/7/e46105/

[2] Z. Zhang, "Mechanics of human voice production and control," J. Acoust. Soc. Am., vol. 140, no. 4, p. 2614, Oct. 2016. https://asa.scitation.org/doi/10.1121/1.4964509

[3] K. Verdolini, C. Rosen, and R. Branski, Eds., Classification Manual for Voice Disorders. New York, NY: Psychology Press, 2005. https://www.routledge.com/Classification-Manual-for-Voice-Disorders/Verdolini-Rosen-Branski/p/book/9781841690416

[4] J. Robin, J. E. Harrison, L. D. Kaufman, F. Rudzicz, W. Simpson, and M. Yancheva, "Evaluation of Speech-Based Digital Biomarkers: Review and Recommendations," Digit. Biomark., vol. 4, no. 3, pp. 99–108, 2020. https://www.karger.com/Article/FullText/510820

[5] G. Fagherazzi, A. Fischer, M. Ismael, and V. Despotovic, "Voice for Health: The Use of Vocal Biomarkers from Research to Clinical Practice," Digit. Biomark., vol. 5, no. 1, pp. 78–88, 2021. https://www.karger.com/Article/FullText/515346

[6] J. M. Tracy, Y. Özkanca, D. C. Atkins, and R. Hosseini Ghomi, "Investigating voice as a biomarker: Deep phenotyping methods for early detection of Parkinson's disease," J. Biomed. Inform., vol. 104, p. 103362, Apr. 2020. https://www.sciencedirect.com/science/article/pii/S1532046419302106

[7] K. López-de-Ipiña et al., "On the selection of non-invasive methods based on speech analysis oriented to automatic Alzheimer disease diagnosis," Sensors, vol. 13, no. 5, pp. 6730–6745, May 2013. https://www.mdpi.com/1424-8220/13/5/6730

[8] A. Esteva et al., "A guide to deep learning in healthcare," Nat. Med., vol. 25, no. 1, pp. 24–29, Jan. 2019. https://www.nature.com/articles/s41591-018-0316-z

[9] S. A. Syed, M. Rashid, and S. Hussain, "Meta-analysis of voice disorders databases and applied machine learning techniques," Math. Biosci. Eng., vol. 17, no. 6, pp. 7958–7979, Nov. 2020. https://www.aimsciences.org/article/doi/10.3934/mbe.2020404

[10] U. Petti, S. Baker, and A. Korhonen, "Areview of automatic Alzheimer's disease detection from speech and language," J. Am. Med. Inform. Assoc., vol. 27, no. 11, pp. 1784–1797, Nov. 2020. https://academic.oup.com/jamia/article/27/11/1784/5907371

[11] C. O. Sakar et al., "A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor wavelet transform," Appl. Soft Comput., vol. 74, pp. 255–263, Jan. 2019. https://www.sciencedirect.com/science/article/pii/S1568494618305639

[12] B. E. Sakar et al., "Collection and analysis of a Parkinson speech dataset with multiple types of sound recordings," IEEE J. Biomed. Health Inform., vol. 17, no. 4, pp. 828–834, Jul. 2013. https://ieeexplore.ieee.org/document/6516742

[13] A. Tsanas, M. A. Little, P. E. McSharry, J. Spielman, and L. O. Ramig, "Novel speech signal processing algorithms for high-accuracy classification of Parkinson's disease," IEEE Trans. Biomed. Eng., vol. 59, no. 5, pp. 1264–1271, May 2012. https://ieeexplore.ieee.org/document/6146196

[14] A. Tsanas, M. A. Little, C. Fox, and L. O. Ramig, "Objective Automatic Assessment of Rehabilitative Speech Treatment in Parkinson's Disease," IEEE Trans. Neural Syst. Rehabil. Eng., vol. 22, no. 1, pp. 181–190, Jan. 2014. https://ieeexplore.ieee.org/document/6616469

[15] M. Shen, P. Mortezaagha, and A. Rahgozar, "Explainable artificial intelligence to diagnose early Parkinson’s disease via voice analysis," Sci. Rep., vol. 15, no. 1, p. 11687, Apr. 2025. https://www.nature.com/articles/s41598-025-96575-6

[16] M. A. Little et al., "Suitability of dysphonia measurements for telemonitoring of Parkinson’s disease," IEEE Trans. Biomed. Eng., vol. 56, no. 4, pp. 1015–1022, Apr. 2009. https://ieeexplore.ieee.org/document/4760677

[17] A. Tsanas et al., "Accurate telemonitoring of Parkinson’s disease progression by noninvasive speech tests," IEEE Trans. Biomed. Eng., vol. 57, no. 4, pp. 884–893, Apr. 2010. https://ieeexplore.ieee.org/document/5352700

[18] A. Iyer et al., "A machine learning method to process voice samples for identification of Parkinson’s disease," Sci. Rep., vol. 13, no. 1, p. 20615, Nov. 2023. https://www.nature.com/articles/s41598-023-47568-w

Bio Larynx: When Your Voice Knows Before You Do

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

Issue

Section

License

How to Cite