Disease Clusterization Based on Patient Age and Disease Type Using K-Means Clustering
DOI:
https://doi.org/10.55227/ijhet.v4i5.521Keywords:
Data mining, Disease classification, Hospital data, K-Means Clustering, Patient ageAbstract
This study aims to classify disease types based on patient age using the K-Means Clustering method in order to identify disease distribution patterns at Malahayati Islamic Hospital, Medan. The data used in this research consists of medical record data of patients from October to December 2024, including variables such as age, type of disease, gender, and area of residence. The research stages include data cleaning, data transformation of age and disease attributes into numerical values, and clustering analysis using the K-Means algorithm implemented through RapidMiner software. The clustering results produced three main clusters, representing high, moderate, and low disease prevalence levels. Diseases with the highest prevalence cluster include pregnancy-related cases, pneumonia, acute respiratory infections (ISPA), chronic obstructive pulmonary disease (COPD), and gastroenteritis (GEA), which are predominantly found in adult and elderly age groups. The results indicate that patient age significantly influences disease distribution patterns. This study demonstrates that K-Means Clustering is effective in identifying age-based disease patterns and can serve as a decision-support tool for healthcare planning, resource allocation, and disease prevention strategies in hospital management.
Downloads
References
Fajri, M. B., & Purnamasari, S. D. (2022). Klasterisasi Pola Penyebaran Penyakit Pasien Berdasarkan Usia Pasien Menggunakan K-Means Clustering. Journal of Information Technology Ampera, 3(3), 317–334. https://journalcomputing.org/index.php/journal-ita/index.
Lestari, S. (2022). Penerapan Algoritma K-Means Untuk Pemetaan PenyebaranPenyakit Demam Berdarah (DBD) Pada Kabupaten/Kota Di Jawa Barat. Jurnal Pendidikan Dan Konseling, 4, 1349–1358.
Nabila, Z., Rahman Isnain, A., & Abidin, Z. (2021). Analisis Data Mining Untuk Clustering Kasus Covid-19 Di Provinsi Lampung Dengan Algoritma KMeans. Jurnal Teknologi Dan Sistem Informasi (JTSI), 2(2), 100. http://jim.teknokrat.ac.id/index.php/JTSI.
Okta Jaya Harmaja., Hadirat Halawa., (2023). Implementasi Algoritma K-Means Clustering Untuk Pengelompokkan Penyakit Pasien Pada Puskesmas Pulo Brayan. Jurnal Sains dan Teknologi, Vol. 5. No. 1
Rosida, W., & Wijaya, Y. A. (2023). Klasterisasi Penyakit HIV/AIDS di Jawa Barat Menggunakan Algoritma K-Means Clustering. Blend Sains Jurnal Teknik, 1(4), 306–315. https://doi.org/10.56211/blendsains.v1i4.235
Sari, Y. R., Sudewa, A., Lestari, D. A., & Jaya, T. I. (2020). Penerapan Algoritma K-Means Untuk Clustering Data Kemiskinan Provinsi Banten Menggunakan Rapidminer. CESS (Journal of Computer Engineering, System and Science), 5(2), 192. https://doi.org/10.24114/cess.v5i2.18519
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Jalaluddin Mahally Hasibuan, Hendra Cipta , Rini Halila Nasution

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.
























