EXPLORATORY STATISTICS USING R PROGRAMMING FOR DIABETES MELLITUS DATASET

Authors

  • Yih Jeng Cheong Department of Digital Health and Informatics, International Medical University Malaysia (IMU), Bukit Jalil, 57000 Kuala Lumpur, Malaysia
  • Khairani Abd. Majid Department of Digital Health and Informatics, International Medical University Malaysia (IMU), Bukit Jalil, 57000 Kuala Lumpur, Malaysia
  • Mohd Syazwan Mohamad Anuar Department of Mathematics, Center for Defence Foundation Studies, National Defence University of Malaysia, Sg. Besi Camp, 57000 Kuala Lumpur, Malaysia

Keywords:

Exploratory Data Analysis, Diabetis Mellitus, R Programming, Boxplot, Correlation

Abstract

Exploratory data analysis is a must in conducting every research. By going through the process we could organize the dataset, understand the variables, identify relationships between variables, choose the right model, and help find patterns in a dataset. In this research we choose a healthcare dataset concerning diabetes mellitus. We explored the dataset and came up with few conclusions. We identified the relationship between variables in the dataset with diabetes mellitus. The correlation between variables and the disease were presented in visual graphs. We developed R codes to assist analysis as well as graphical representations. While conducting the exploratory statistics we discovered the flaws of the data collection and proposed few steps to advance the research in a developing a predictive model.

Downloads

Download data is not yet available.

Downloads

Published

30-05-2025

How to Cite

Cheong, Y. J., Abd. Majid, K. ., & Mohamad Anuar, M. S. (2025). EXPLORATORY STATISTICS USING R PROGRAMMING FOR DIABETES MELLITUS DATASET. Zulfaqar Journal of Defence Science, Engineering & Technology, 8(1). Retrieved from https://zulfaqarjdset.upnm.edu.my/index.php/zjdset/article/view/146

Issue

Section

Articles