Implementasi Metode Regresi dengan Analisis Komponen Utama pada Kasus Diare Balita di Jawa Tengah

Moch Anjas Aprihartha, Mohamad Rijal Arfani, Indah Putianik

Abstract


Multicollinearity is a condition that occurs in a regression model when there is a high correlation between independent variables. To address the problem of multicollinearity, the principal component regression technique is used. Through this approach, the causes of health problems can be identified more accurately. Central Java Province ranks 11th out of 34 provinces with a proportion of diarrhea cases in toddlers of 11.1%. The relatively high proportion of diarrhea in toddlers emphasizes the urgency of modeling its risk factors. Therefore, the purpose of this study was to apply the principal component regression method to cases of diarrhea in toddlers in Central Java. The regression model produced an R^2 value of 96.2%, which is interpreted as a very good model in identifying the relationship between the causes of diarrhea in toddlers and the number of toddlers suffering from diarrhea in districts/cities in Central Java. The factors that have a significant influence include the use of proper sanitation, malnourished toddlers, active integrated health posts, and fully immunized toddlers.

Keywords: Analysis, Diarrhea, Multicollinearity, Principal component analysis, Regression, Toddlers.

Multikolinearitas merupakan kondisi yang terjadi pada model regresi ketika adanya korelasi tinggi antara variabel independen. Untuk mengatasi masalah multikolinearitas maka digunakan teknik regresi komponen utama. Melalui pendekatan tersebut, faktor-faktor risiko kesehatan dapat diidentifikasi secara lebih akurat. Provinsi Jawa Tengah menduduki urutan ke 11 dari 34 provinsi dengan proporsi kasus diare pada balita ialah 11,1%. Angka proporsi diare balita yang cukup tinggi mempertegas urgensi pemodelan faktor risikonya, sehingga tujuan penelitian ini adalah menerapkan metode regresi komponen utama pada kasus diare balita di Jawa Tengah. Model regresi menghasilkan nilai  yang dapat diartikan sebagai model sangat baik dalam menggambarkan hubungan penyebab diare pada balita terhadap jumlah balita menderita diare di kabupaten/ kota di Jawa Tengah. Adapun faktor-faktor yang berpengaruh signifikan diantaranya pengguna sanitasi layak, balita yang kekurangan gizi, posyandu aktif, dan balita yang diimunisasi dasar lengkap.


Keywords


Analisis komponen utama, Balita, Diare, Multikolinearitas, Regresi

Full Text:

PDF

References


Aflah, F. R., Risnawati, R., & Hamdani, M. F. (2025). Penerapan regresi linier berganda dalam menilai hubungan antar variabel dalam penelitian kuantitatif. Innovative: Journal Of Social Science Research, 5(3), 4195-4211.

Anjas, M., Sukarsa, I. K. G., & Kencana, I. P. E. N. (2019). Penerapan metode Geographically Weighted Regression (GWR) pada kasus penyakit pneumonia di provinsi Jawa Timur. E-Jurnal Matematika, 8(1), 27-34.

Aprihartha, M. A. (2025). Pemilihan model regresi linear berganda terbaik untuk menentukan faktor-faktor penyebab kasus balita gizi buruk di Jawa Tengah. Jurnal EurekaMatika, 13(1), 35-46.

Ari, M. (2014). Identifikasi perilaku calon mahasiswa baru dalam memilih program studi menggunakan analisis faktor. Paradigma, 16(1), 34-44.

Azhar, K., Dwi Sisca, K., & Hapsari, D. (2015). Diare balita di Provinsi DKI Jakarta ditinjau dari aspek air minum, sanitasi dan PHBS (Analisis Data Riskesdas 2013). Jurnal Ekologi Kesehatan, 14(1), 29-40.

Biau, D. J., Jolles, B. M., & Porcher, R. (2010). P value and the theory of hypothesis testing: an explanation for new researchers. Clinical Orthopaedics and Related Research®, 468(3), 885-892.

Charitaki, G., Soulis, S. G., & Tyropoli, R. (2021). Academic self-regulation in autism spectrum disorder: A principal components analysis. International Journal of Disability, Development and Education, 68(1), 26-45.

Choi, J., & Yun, J. I. (2025). Optimization of water chemistry to mitigate corrosion products in nuclear power plants using big data and multiple linear regression in machine learning. Progress in Nuclear Energy, 183, 105685.

Chen, K., Liu, K., Zhou, Y., Li, Y., Wu, G., Gao, G., ... & Djerdir, A. (2025). State of health prognosis for polymer electrolyte membrane fuel cell based on principal component analysis and Gaussian process regression. International Journal of Hydrogen Energy, 98, 933-943.

Dalić, I., & Terzić, S. (2021). Violation of the assumption of homoscedasticity and detection of heteroscedasticity. Decision Making: Applications in Management and Engineering, 4(1), 1-18.

Hanusz, Z., & TarasiŃska, J. (2014). Simulation study on improved Shapiro–Wilk tests for normality. Communications in Statistics-Simulation and Computation, 43(9), 2093-2105.

Jannah, M., & Susanti, R. (2023). Faktor yang memengaruhi kejadian diare pada balita di Indonesia dengan menggunakan regresi data panel. Prosiding Seminar Nasional Matematika dan Statistika, 3(1), 168-182.

Khoirunnisa, A., Nyoman, I. G., & Jaya, M. (2019). Modelling and mapping the average of incidence rate diarrhea among toddlers in Bandung city in 2013-2018 using Geographically Weighted Regression. International Journal of Innovative Science, Engineering & Technology, 6(11), 115-19.

Kristjanpoller, W., & Minutolo, M. C. (2018). A hybrid volatility forecasting framework integrating GARCH, artificial neural network, technical analysis and principal components analysis. Expert Systems with Applications, 109, 1-11.

Li, B., Lu, Y., Sun, X., Chen, X., Gong, W., & Miao, F. (2024). Radial artery pulse wave age-related assessment for diabetic patients based on multiple linear regression time domain analysis method. Extreme Mechanics Letters, 70, 102185.

Lu, X., Teh, S. Y., Tay, C. J., Kassim, N. F. A., Fam, P. S., & Soewono, E. (2025). Application of multiple linear regression model and long short-term memory with compartmental model to forecast dengue cases in Selangor, Malaysia based on climate variables. Infectious Disease Modelling, 10(1), 240-256.

Marcus, G. L., Wattimanela, H. J., & Lesnussa, Y. A. (2012). Analisis regresi komponen utama untuk mengatasi masalah multikolinieritas dalam analisis regresi linier berganda. BAREKENG: Jurnal Ilmu Matematika Dan Terapan, 6(1), 31-40.

Pendi, P. (2021). Analisis regresi dengan metode komponen utama dalam mengatasi masalah multikolinearitas. Bimaster: Buletin Ilmiah Matematika, Statistika dan Terapannya, 10(1).

Putri, F. C., Suciptawati, N. L. P., & Susilawati, M. (2023). Implementasi metode Geographically Weighted Regression (GWR) pada kasus diare balita di Provinsi Jawa Timur. E-Jurnal Matematika, 12(2), 92.

Sholihah, S. M. A., Aditiya, N. Y., Evani, E. S., & Maghfiroh, S. (2023). Konsep uji asumsi klasik pada regresi linier berganda. Jurnal Riset Akuntansi Soedirman, 2(2), 102-110.

Sitanggang, H. D., Ramadhanti, I. T., & Halim, R. D. (2022). Faktor risiko kejadian diare pada anak balita (12-59 bulan) di Puskesmas Kota Jambi. Riset Informasi Kesehatan, 11(1), 54-63.

Widyaningrum, A. R., Susanti, Y., & Slamet, I. (2021). Pemodelan penyakit diare balita di Jawa Timur menggunakan regresi robust. In SINASIS (Seminar Nasional Sains) (Vol. 2, No. 1).

Vu, D. H., Muttaqi, K. M., & Agalgaonkar, A. P. (2015). A variance inflation factor and backward elimination based robust regression model for forecasting monthly electricity demand using climatic variables. Applied Energy, 140, 385-394.




DOI: https://doi.org/10.17509/jem.v13i2.90359

Refbacks

  • There are currently no refbacks.


Copyright (c) 2025 Mathematics Study Program, Universitas Pendidikan Indonesia

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.