Disease Clusterization Based on Patient Age and Disease Type Using K-Means Clustering

Authors

  • Jalaluddin Mahally Hasibuan Department of Mathematics, Faculty of Science and Technology, Universitas Islam Negeri Sumatera Utara
  • Hendra Cipta Department of Mathematics, Faculty of Science and Technology, Universitas Islam Negeri Sumatera Utara
  • Rini Halila Nasution Department of Mathematics, Faculty of Science and Technology, Universitas Islam Negeri Sumatera Utara

DOI:

https://doi.org/10.55227/ijhet.v4i5.521

Keywords:

Data mining, Disease classification, Hospital data, K-Means Clustering, Patient age

Abstract

This study aims to classify disease types based on patient age using the K-Means Clustering method in order to identify disease distribution patterns at Malahayati Islamic Hospital, Medan. The data used in this research consists of medical record data of patients from October to December 2024, including variables such as age, type of disease, gender, and area of residence. The research stages include data cleaning, data transformation of age and disease attributes into numerical values, and clustering analysis using the K-Means algorithm implemented through RapidMiner software. The clustering results produced three main clusters, representing high, moderate, and low disease prevalence levels. Diseases with the highest prevalence cluster include pregnancy-related cases, pneumonia, acute respiratory infections (ISPA), chronic obstructive pulmonary disease (COPD), and gastroenteritis (GEA), which are predominantly found in adult and elderly age groups. The results indicate that patient age significantly influences disease distribution patterns. This study demonstrates that K-Means Clustering is effective in identifying age-based disease patterns and can serve as a decision-support tool for healthcare planning, resource allocation, and disease prevention strategies in hospital management.

 

Downloads

Download data is not yet available.

References

Fajri, M. B., & Purnamasari, S. D. (2022). Klasterisasi Pola Penyebaran Penyakit Pasien Berdasarkan Usia Pasien Menggunakan K-Means Clustering. Journal of Information Technology Ampera, 3(3), 317–334. https://journalcomputing.org/index.php/journal-ita/index.

Lestari, S. (2022). Penerapan Algoritma K-Means Untuk Pemetaan PenyebaranPenyakit Demam Berdarah (DBD) Pada Kabupaten/Kota Di Jawa Barat. Jurnal Pendidikan Dan Konseling, 4, 1349–1358.

Nabila, Z., Rahman Isnain, A., & Abidin, Z. (2021). Analisis Data Mining Untuk Clustering Kasus Covid-19 Di Provinsi Lampung Dengan Algoritma KMeans. Jurnal Teknologi Dan Sistem Informasi (JTSI), 2(2), 100. http://jim.teknokrat.ac.id/index.php/JTSI.

Okta Jaya Harmaja., Hadirat Halawa., (2023). Implementasi Algoritma K-Means Clustering Untuk Pengelompokkan Penyakit Pasien Pada Puskesmas Pulo Brayan. Jurnal Sains dan Teknologi, Vol. 5. No. 1

Rosida, W., & Wijaya, Y. A. (2023). Klasterisasi Penyakit HIV/AIDS di Jawa Barat Menggunakan Algoritma K-Means Clustering. Blend Sains Jurnal Teknik, 1(4), 306–315. https://doi.org/10.56211/blendsains.v1i4.235

Sari, Y. R., Sudewa, A., Lestari, D. A., & Jaya, T. I. (2020). Penerapan Algoritma K-Means Untuk Clustering Data Kemiskinan Provinsi Banten Menggunakan Rapidminer. CESS (Journal of Computer Engineering, System and Science), 5(2), 192. https://doi.org/10.24114/cess.v5i2.18519

Downloads

Published

2026-01-26

How to Cite

Jalaluddin Mahally Hasibuan, Hendra Cipta, & Rini Halila Nasution. (2026). Disease Clusterization Based on Patient Age and Disease Type Using K-Means Clustering. International Journal of Health Engineering and Technology, 4(5). https://doi.org/10.55227/ijhet.v4i5.521