Machine Learning-Based Asthma Diagnosis Prediction Using Lung Function and Demographic Features
Keywords:
Asthma diagnosis, Machine learning models, FVC (Forced Vital Capacity), Lung function parameters, FEV1 (Forced Expiratory Volume in one second)Abstract
Asthma is a prevalent chronic respiratory disease, which poses significant diagnostic challenges because of its multifactorial nature. This study aims to develop a machine-learning approach for predicting asthma diagnosis using key features such as body mass index (BMI), age, lung function parameters (FEV1 and FVC), and demographic information. A dataset containing clinical and demographic records was utilized to train and evaluate models, including Random Forest, Neural Networks, and XGBoost classifiers. The performance of the following models was assessed using metrics such as precision, recall, accuracy, and F1-score, with Random Forest showing/exhibiting the highest predictive performance. In addition to traditional performance metrics, advanced visualization techniques like SHAP (Shapley Additive ex Planation’s) values were employed to interpret model predictions and assess feature importance. Results demonstrate that age, BMI, and lung function are key predictors of asthma diagnosis, with lung function parameters showing/exhibiting the strongest correlation with diagnosis outcomes. The study also explores various 3D and interactive visualizations to enhance the interpretability of the models. The proposed approach demonstrates that machine learning models when combined with clinical data, can accurately predict asthma diagnosis and potentially aid healthcare professionals in early detection and personalized treatment plans. This research highlights the potential of data-driven models in improving asthma diagnosis and contributing to better clinical decision-making.
References
S. A. S. Tsang Kevin C. H, Hilary Pinnock, Andrew M Wilson, “Application of Machine Learning to Support Self-Management of Asthma with mHealth,” 2020 42nd Annu. Int. Conf. IEEE Eng. Med. Biol. Soc. conjunction with 43rd Annu. Conf. Can. Med. Biol. Eng. Soc., 2020, doi: 10.1109/EMBC44109.2020.9175679.
S. J. C. T. Hilary Pinnock, Hannah L Parke, Maria Panagioti, Luke Daines, Gemma Pearce, Eleni Epiphaniou, Peter Bower, Aziz Sheikh, Chris J Griffiths, “Systematic meta-review of supported self-management for asthma: a healthcare perspective,” Prism. RECURSIVE groups, vol. 15, no. 1, p. 64, 2017, doi: 10.1186/s12916-017-0823-7.
E. Alharbi, A. Cherif, F. Nadeem, and T. Mirza, “Machine Learning Models for Early Prediction of Asthma Attacks Based on Bio-signals and Environmental Triggers,” Proc. IEEE/ACS Int. Conf. Comput. Syst. Appl. AICCSA, vol. 2022-December, 2022, doi: 10.1109/AICCSA56895.2022.10017305.
N. Kaffash-Charandabi, A. A. Alesheikh, and M. Sharif, “A ubiquitous asthma monitoring framework based on ambient air pollutants and individuals’ contexts,” Environ. Sci. Pollut. Res., vol. 26, no. 8, pp. 7525–7539, Jan. 2019, doi: 10.1007/S11356-019-04185-3.
S. A. S. Kevin C H Tsang, Hilary Pinnock, Andrew M Wilson, “Application of Machine Learning Algorithms for Asthma Management with mHealth: A Clinical Review,” J Asthma Allergy, vol. 15, pp. 855–873, 2022, doi: 10.2147/JAA.S285742.
A. M. Joseph Prinable, Peter Jones, David Boland, Cindy Thamrin, “Derivation of Breathing Metrics From a Photoplethysmogram at Rest: Machine Learning Methodology,” JMIR Mhealth Uhealth, vol. 8, no. 7, p. e13737, 2020, doi: 10.2196/13737.
“[PDF] Spirometry: step by step | Semantic Scholar.” Accessed: Feb. 15, 2025. [Online]. Available: https://www.semanticscholar.org/paper/Spirometry%3A-step-by-step-Moore/b51f939cd70887ba7b24225bdc26b5c796af1869
E. Bolat, H. Yildirim, S. Altin, and E. Yurtseven, “A COMPREHENSIVE COMPARISON OF MACHINE LEARNING ALGORITHMS ON DIAGNOSING ASTHMA DISEASE AND COPD,” PONTE Int. J. Sci. Res., vol. 76, no. 3, 2020, doi: 10.21506/J.PONTE.2020.3.17.
A. N. G. Sumanth Swaminathan , Klajdi Qirko, Ted Smith, Ethan Corcoran, Nicholas G. Wysham, Gaurav Bazaz, George Kappel, “A machine learning approach to triaging patients with chronic obstructive pulmonary disease,” plos 1, 2017, doi: https://doi.org/10.1371/journal.pone.0188532.
B.-J. H. Zne-Jung Lee, Ming-Ren Yang, “A Sustainable Approach to Asthma Diagnosis: Classification with Data Augmentation, Feature Selection, and Boosting Algorithm,” Diagnostics, vol. 14, no. 7, p. 723, 2024, doi: https://doi.org/10.3390/diagnostics14070723.
C. H. Lee, J. C. Y. Chen, and V. S. Tseng, “A novel data mining mechanism considering bio-signal and environmental data with applications on asthma monitoring,” Comput. Methods Programs Biomed., vol. 101, no. 1, pp. 44–61, Jan. 2011, doi: 10.1016/J.CMPB.2010.04.016.
A. Q. Ansari, N. K. Gupta, and Ekata, “Automatic diagnosis of asthma using neurofuzzy system,” Proc. - 4th Int. Conf. Comput. Intell. Commun. Networks, CICN 2012, pp. 819–823, 2012, doi: 10.1109/CICN.2012.55.
M. T. K. Mohammed Temraz, “Solving the class imbalance problem using a counterfactual method for data augmentation,” Mach. Learn. with Appl., vol. 9, no. 15, p. 100375, 2022, doi: https://doi.org/10.1016/j.mlwa.2022.100375.
F. K. Farnaz Farahanipad, Mohammad Rezaei, Mohammad Sadegh Nasr, “A Survey on GAN-Based Data Augmentation for Hand Pose Estimation Problem,” Technologies, vol. 10, no. 2, p. 43, 2022, doi: 10.3390/technologies10020043.
K. Nagarajaiah, “ASTHAMA PREDICTION APP USING DJANGO,” Int. Res. J. Mod. Eng. Technol. Sci., vol. 6, no. 6, pp. 221–225, 2024, [Online]. Available: https://www.researchgate.net/publication/382068471_ASTHAMA_PREDICTION_APP_USING_DJANGO
M. U. N Dinesh Kumar, “Digitalized Dust Alert System for Asthma Patients and their Care Takers,” IOP Conf. Ser. Mater. Sci. Eng, vol. 1057, p. 012092, 2021, doi: 10.1088/1757-899X/1057/1/012092.
M. P. Murugesan Dhasagounder, M Kaviyarasan, S Matheswari, “IOT based Assist Device for Pulmonary Diseased Patients Monitoring Framework,” Int. J. New Innov. Eng. Technol., 2023, [Online]. Available: https://www.researchgate.net/publication/371043675_IOT_based_Assist_Device_for_Pulmonary_Diseased_Patients_Monitoring_Framework
G. S. C. Furqan Rustam, Madiha Khalid, Waqar Aslam, Vaibhav Rupapara, Arif Mehmood, “A performance comparison of supervised machine learning models for Covid-19 tweets sentiment analysis,” PLoS One, vol. 16, no. 2, p. e0245909, 2021, doi: https://doi.org/10.1371/journal.pone.0245909.
T. Wang, Y. Bian, Y. Zhang, and X. Hou, “Classification of earthquakes, explosions and mining-induced earthquakes based on XGBoost algorithm,” Comput. Geosci., vol. 170, p. 105242, Jan. 2023, doi: 10.1016/J.CAGEO.2022.105242.
V. F. Varsha Nemade, “Machine Learning Techniques for Breast Cancer Prediction,” Procedia Comput. Sci., vol. 218, pp. 1314–1320, 2023, doi: https://doi.org/10.1016/j.procs.2023.01.110.
M. kumari and P. Ahlawat, “DCPM: an effective and robust approach for diabetes classification and prediction,” Int. J. Inf. Technol., vol. 13, no. 3, pp. 1079–1088, Jun. 2021, doi: 10.1007/S41870-021-00656-4.
P. Biswas and T. Samanta, “Anomaly detection using ensemble random forest in wireless sensor network,” Int. J. Inf. Technol., vol. 13, no. 5, pp. 2043–2052, Oct. 2021, doi: 10.1007/S41870-021-00717-8/METRICS.
S. P. Nadikatla Chandrasekhar, “Enhancing Heart Disease Prediction Accuracy through Machine Learning Techniques and Optimization,” Processes, vol. 11, no. 4, p. 1210, 2023, doi: https://doi.org/10.3390/pr11041210.
A. Sahu, H. Gm, M. K. Gourisaria, S. S. Rautaray, and M. Pandey, “Cardiovascular risk assessment using data mining inferencing and feature engineering techniques,” Int. J. Inf. Technol., vol. 13, no. 5, pp. 2011–2023, Oct. 2021, doi: 10.1007/S41870-021-00650-W/METRICS.
K. D. Rustam F, Mehmood A, Ahmad M, Ullah S, “Classification of Shopify App User Reviews Using Novel Multi Text Features,” IEEE Access, vol. 99, pp. 1–1, 2020, doi: 10.1109/ACCESS.2020.2972632.

Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 50SEA

This work is licensed under a Creative Commons Attribution 4.0 International License.