Analisis Radius Pada Algoritma Birch Berdampak Terhadap Distribusi dan Kualitas Cluster
Keywords:
Birch, Cluster, Radius ParametersAbstract
The BIRCH method is efficient in handling large data. However, the determination of the Radius (R) parameter, which is useful for determining the maximum radius of the cluster, must be considered. R values that are too small produce many small clusters (overclustering), while values that are too large produce clusters with high heterogeneity (underclustering). The R parameter affects the distribution results and the quality of the resulting clusters can be a major problem. The lack of clear guidance in determining the optimal R value can lead to the formation of inappropriate clusters or loss of important information in the data. This study aims to provide the impact of the R value on the distribution and quality of clusters in the BIRCH. Testing several R values on employee datasets that include non-linear distribution data. Analysis is carried out to identify the relationship between the R value and the resulting data distribution pattern. The eval__uation results show that R values that are too small tend to produce over-clustering, while values that are too large cause under-clustering. The best cluster quality is achieved at a balanced R value, which is adjusted to the distribution and density of the employee dataset. Thus, it is important to choose the right R value to improve the performance of the BIRCH and ensure a representative cluster distribution. This finding provides practical guidance for adjusting the R parameter in clustering applications that implement the BIRCH.References
H. Sunandar, “Machine Learning Pengenalan Anura Berdasarkan Corak dan Warna,” 2023. doi: 10.54367.
J. Yang, Z. Sun, and Y. Chen, “Fault detection using the clustering-kNN rule for gas sensor arrays,” Sensors (Switzerland), vol. 16, no. 12, Dec. 2016, doi: 10.3390/s16122069.
F. E. Ozturk, N. Demirel, and M. Bilgisi, “Comparison of the Methods to Determine Optimal Number of Cluster,” 2023. [Online]. Available: www.dergipark.gov.tr/veri
F. Ghifari, H. Rachmat, D. Sukma, and E. Atmaja, “DESIGN OF AUTOMATION INSPECTION SYSTEM USING CLUSTER IDENTIFICATION METHOD BASED ON LEATHER SHOES COLOUR AT VENAMON CORPORATION,” 2015.
J. Lei, “An extended BIRCH-based clustering algorithm for large time-series datasets,” Aug. 2016.
T. Zhang, R. Ramakrishnan, and M. Livny, “BIRCH: An Efficient Data Clustering Method for Very Large Databases,” 1996.
D. Kathiravan, “Sentence-Similarity Based Document Clustering Using Birch Algorithm,” International Journal of Innovative Research in Computer and Communication Engineering (An ISO, vol. 3297, no. 5, 2007, doi: 10.15680/ijircce.2015.0305055.
B. Yu and J. Xiong, “A Novel WSN Traffic Anomaly Detection Scheme Based on BIRCH,” Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, vol. 44, no. 1, pp. 305-313, Jan. 2022, doi: 10.11999/JEIT201004.
A. Alzu’Bi and M. Barham, “Automatic BIRCH thresholding with features transformation for hierarchical breast cancer clustering,” International Journal of Electrical and Computer Engineering, vol. 12, no. 2, pp. 1498-1507, Apr. 2022, doi: 10.11591/ijece.v12i2.pp1498-1507.
A. R. Rizalde, H. A. Mubarak, G. Ramadhan, and Mohd. A. Fatan, “Comparison of K-Means, BIRCH and Hierarchical Clustering Algorithms in Clustering OCD Symptom Data,” Public Research Journal of Engineering, Data Technology and Computer Science, vol. 1, no. 2, pp. 102-108, Feb. 2024, doi: 10.57152/predatecs.v1i2.1106.
M. C. Nwadiugwu, “Gene-Based Clustering Algorithms: Comparison Between Denclue, Fuzzy-C, and BIRCH,” 2020, SAGE Publications Inc. doi: 10.1177/1177932220909851.
X. Xia, “Clustering Analysis of Interactive Learning Activities Based on Improved BIRCH Algorithm.”
Z. Yan, G. Yang, R. He, H. Yang, H. Ci, and R. Wang, “Ship Trajectory Clustering Based on Trajectory Resampling and Enhanced BIRCH Algorithm,” J Mar Sci Eng, vol. 11, no. 2, Feb. 2023, doi: 10.3390/jmse11020407.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 KAKIFIKOM (Kumpulan Artikel Karya Ilmiah Fakultas Ilmu Komputer)

This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.