Skin Diseases Detection and Diagnosis Support System Using Yolov12s and Late Fusion Technique

Authors

  • Sahil Muneer Institute of Mathematics and Computer Science, University of Sindh, Jamshoro, Pakistan
  • Aleena Azam Institute of Mathematics and Computer Science, University of Sindh, Jamshoro, Pakistan
  • Yasir Arfat Malkani Institute of Mathematics and Computer Science, University of Sindh, Jamshoro, Pakistan
  • Noor e Hira Institute of Mathematics and Computer Science, University of Sindh, Jamshoro, Pakistan

Keywords:

clinical decision support, YOLOv12s, Multimodal Learning, Late Fusion, skin disease detection

Abstract

Millions of people worldwide suffer from dermatological conditions, but in areas with limited resources, such as Pakistan, access to prompt diagnosis is still restricted. In order to increase diagnostic reliability, this study suggests a multimodal framework for the detection of skin diseases that integrates patient-reported symptoms with visual lesion analysis. Using a late fusion approach that combines a Logistic Regression classifier trained on structured symptom features with a YOLOv12s-based object detection model for lesion localization, the system targets 16 common skin conditions. While the symptom model encodes clinical indicators such as redness, itching, and pustules, the visual model captures discriminative lesion patterns. To handle visually ambiguous situations that are challenging for Image-only approaches, outputs from both modalities were combined at the decision level. Experiments on a multimodal dataset show that the proposed fusion framework outperforms unimodal baselines in terms of accuracy and F1-score. The robustness of the proposed method under realistic dataset conditions is demonstrated through a comparison with previous research. The findings indicate that multimodal late fusion improves the performance of skin disease screening, making it suitable for tele dermatology and initial clinical decision support applications in low-resource environments.

References

H. Chen et al., (2022) published “The global burden of skin and subcutaneous diseases from 1990 to 2019: a systematic analysis of the Global Burden of Disease Study 2019,” in the British Journal of Dermatology. The paper is available in volume 187, no 3, and pages 344–353.

Henry W. Lim, Scott A.B. Collins, “The burden of skin disease in the United States,” J Am Acad Dermatol, vol. 76, no. 5, pp. 958–972, 2017, [Online]. Available: https://pubmed.ncbi.nlm.nih.gov/28259441/

L. Tomas-Aragones and S. E. Marron, (2023) published “The Psychosocial Burden of Skin Diseases,” Actas Dermo-Sifiliográficas (English Edition), This paper is available in volume 114, no 5 and pages 403–412, [Online]. Available: https://doi.org/10.1016/j.ad.2023.02.004

Awan, M. O., Khan, M. N., Ejaz, A, “The dermatology workforce crisis in Pakistan: Challenges and opportunities,” J. Pakistan Assoc. Dermatologists, vol. 30, no. 2, pp. 222–228, 2020.

K. M. Hosny, M. A. Kassem, and M. M. Fouad, (2023) published “Deep learning in dermatology: A systematic review,” Medical Image Analysis, vol. 85, p. 102744. [Online]. Available: https://doi.org/10.1016/j.media.2023.102744

P. Tschandl, C. Rosendahl, and H. Kittler, “Human–computer collaboration for skin cancer recognition,” Nature Medicine, vol. 26, pp. 1229–1234, 2020.

Y. Liu et al., “A deep learning system for differential diagnosis of skin diseases,” Nat. Med. 2020 266, vol. 26, no. 6, pp. 900–908, May 2020, doi: 10.1038/s41591-020-0842-3.

Yunjie Tian, Qixiang Ye, David Doermann, “YOLOv12: Attention-Centric Real-Time Object Detectors,” arXiv:2502.12524, 2025, [Online]. Available: https://arxiv.org/abs/2502.12524

Farhat Afza, Muhammad Sharif, “Multiclass Skin Lesion Classification Using Hybrid Deep Features Selection and Extreme Learning Machine,” Sensors, vol. 22, no. 3, p. 799, 2022, [Online]. Available: https://www.mdpi.com/1424-8220/22/3/799

Chubin Ou, Sitong Zhou, “A deep learning based multimodal fusion model for skin lesion diagnosis using smartphone collected clinical images and metadata,” Front. Surg., vol. 9, 2022, [Online]. Available: https://www.frontiersin.org/journals/

surgery/articles/10.3389/fsurg.2022.1029991/full

Sören Richard Stahlschmidt , Benjamin Ulfenborg, Jane Synnergren, “Multimodal deep learning for biomedical data fusion: a review,” Brief. Bioinform., vol. 23, no. 2, 2022, doi: https://doi.org/10.1093/bib/bbab569.

J. Terven, D. M. Córdova-Esparza, and J. A. Romero-González, “A Comprehensive Review of YOLO Architectures in Computer Vision: From YOLOv1 to YOLOv8 and YOLO-NAS,” Mach. Learn. Knowl. Extr., vol. 5, no. 4, pp. 1680–1716, Dec. 2023, doi: 10.3390/make5040083.

Manar Elshahawy, Ahmed Elnemr, “Early Melanoma Detection Based on a Hybrid YOLOv5 and ResNet Technique,” Diagnostics, vol. 13, no. 7, p. 2804, 2023, [Online]. Available: https://www.mdpi.com/2075-4418/13/17/2804

Karen Drukker, Weijie Chen, “Evaluation metrics in medical imaging AI: fundamentals, pitfalls, misapplications, and recommendations,” Eur. J. Radiol. Artif. Intell., vol. 3, p. 100030, 2025, doi: https://doi.org/10.1016/j.ejrai.2025.100030.

“Dermatology image dataset.” Accessed: Feb. 07, 2026. [Online]. Available: https://dermnetnz.org/dermatology-image-dataset

“DermNet.” Accessed: Feb. 07, 2026. [Online]. Available: https://dermnetnz.org/

“(PDF) A Survey in Deep Learning Model for Image Annotation.” Accessed: Mar. 09, 2026. [Online]. Available: https://www.researchgate.net/publication/

_A_Survey_in_Deep_Learning_Model_for_Image_Annotation

Zhengxia Zou, Keyan Chen, “Object Detection in 20 Years: A Survey,” Proc. IEEE, 2019, [Online]. Available: https://arxiv.org/abs/1905.05055

“ONNX | Home.” Accessed: Feb. 07, 2026. [Online]. Available: https://onnx.ai/

F. Pedregosa FABIANPEDREGOSA et al., “Scikit-learn: Machine Learning in Python,” J. Mach. Learn. Res., vol. 12, no. 85, pp. 2825–2830, 2011, Accessed: Jun. 16, 2024. [Online]. Available: http://jmlr.org/papers/v12/pedregosa11a.html

R. Y. Sun, “Optimization for Deep Learning: An Overview,” J. Oper. Res. Soc. China 2020 82, vol. 8, no. 2, pp. 249–294, Jun. 2020, doi: 10.1007/s40305-020-00309-6.

“ Django: The web framework for perfectionists with deadlines.” Accessed: Feb. 07, 2026. [Online]. Available: https://jashangill3592.pythonanywhere.com/article/6

“PostgreSQL: The world’s most advanced open source database.” Accessed: Feb. 07, 2026. [Online]. Available: https://www.postgresql.org/

Zakariya Ba Alawi, “A Comparative Survey of PyTorch vs TensorFlow for Deep Learning: Usability, Performance, and Deployment Trade-offs,” arXiv:2508.04035v1, vol. 8, 2025, [Online]. Available: https://arxiv.org/html/2508.04035v1

Downloads

Published

2025-11-28

How to Cite

Sahil Muneer, Aleena Azam, Yasir Arfat Malkani, & Noor e Hira. (2025). Skin Diseases Detection and Diagnosis Support System Using Yolov12s and Late Fusion Technique. International Journal of Innovations in Science & Technology, 7(10), 55–67. Retrieved from https://journal.50sea.com/index.php/IJIST/article/view/1723