Prediksi penyakit hati menggunakan model support vector machine dan logistic regression dengan kombinasi PCA dan SMOTE

Happy, Julina (2022) Prediksi penyakit hati menggunakan model support vector machine dan logistic regression dengan kombinasi PCA dan SMOTE. Bachelor thesis, Universitas Pelita Harapan.

[img]
Preview
Text (Title)
Title.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (331kB) | Preview
[img] Text (Abstract)
Abstract.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (399kB)
[img] Text (ToC)
ToC.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (689kB)
[img] Text (Chapter1)
Chapter1.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (851kB)
[img] Text (Chapter2)
Chapter2.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (3MB)
[img] Text (Chapter3)
Chapter3.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (1MB)
[img] Text (Chapter4)
Chapter4.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (2MB)
[img] Text (Chapter5)
Chapter5.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (292kB)
[img] Text (Bibliography)
Bibliography.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (750kB)
[img] Text (Appendices)
Appendices.pdf
Restricted to Repository staff only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (18MB)

Abstract

Hati merupakan salah satu organ yang penting pada tubuh manusia, fungsi utama hati adalah untuk menyaring darah dalam tubuh, memetabolisme racun dan obat dalam tubuh, ketika hati tidak menjalankan fungsinya dengan baik, maka kinerja tubuh akan terganggu. Oleh karena itu, diperlukan diagnosis penyakit hati yang lebih awal untuk memberikan harapan kesembuhan yang lebih tinggi bagi penderitanya. Penelitian ini dilakukan dengan menggunakan model supervised machine learning SVM dan LR, dimana dalam tahapan preprocessing digunakan metode PCA dan SMOTE untuk melihat pengaruh yang diberikan terhadap model. Dengan menggunakan Confusion Matrix untuk mengevaluasi hasil prediksi, hasil akurasi yang didapatkan sebelum metode PCA dan SMOTE adalah LR dengan 70% dan SVM dengan 88%, dan hasil akurasi yang didapatkan setelah metode PCA dan SMOTE adalah LR dengan 64% SVM dengan 87%. Dari hasil tersebut, algoritma SVM mempunyai hasil prediksi yang lebih baik dibandingkan dengan LR pada dataset ini dan terjadi penurunan hasil akurasi setelah penggunaan PCA dan SMOTE pada kedua algoritma. Jika dilihat dari pengurangan variabel independen menjadi lima, tidak terdapat perubahan yang signifikan dalam hasil prediksi, juga pada hasil Precision, Recall, F1-Score, dan Confusion Matrix hasil prediksi telah menuju ke arah yang lebih baik, dan melalui waktu eksekusi yang didapatkan, PCA dan SMOTE mempunyai pengaruh yang lebih terlihat pada algoritma LR. / Liver is one of the most important parts of the body, liver is used for detoxification of medicine and toxin, filtering the blood, and many more. When liver is not functioning properly then the whole function of the body is going to be affected, resulting in someone to be in a dangerous situation and even death. An early diagnosis of the liver disease will let the patient to have an earlier treatment, therefore resulting in a higher chance of survival. This final paper is using supervised machine learning SVM and LR for the modeling, where in the preprocessing stage will be using the PCA and SMOTE method to see the influenced of PCA and SMOTE in the machine learning model. Using Confusion Matrix as the performance evaluation, the prediction result of the model before PCA and SMOTE is LR with 70% of accuracy and SVM with 88% of accuracy, and after implementing PCA and SMOTE the accuracy result is LR with 64% of accuracy and SVM with 87% of accuracy. From the result, SVM gave a better prediction and the both algorithm result has gone down while using the PCA and SMOTE method. But, seeing that the independent variable has been reduced to five, the change of the prediction result isn’t that significant compared to the original dataset prediction result, and the prediction result have improved to a better prediction seeing from Precision, Recall, and F1-Score. From the aspect of execution time, the PCA and SMOTE method gave a better influence towards LR model than SVM.

Item Type: Thesis (Bachelor)
Creators:
CreatorsNIMEmail
Happy, JulinaNIM03082180002julina2630@gmail.com
Contributors:
ContributionContributorsNIDN/NIDKEmail
Thesis advisorBarus, Okky PutraNIDN0127068803okky.barus@uph.edu
Uncontrolled Keywords: klasifikasi; machine learning; support vector machine; logistic regression; PCA; SMOTE; confusion matrix
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: University Subject > Current > Faculty/School - UPH Medan > School of Information Science and Technology > Informatics
Current > Faculty/School - UPH Medan > School of Information Science and Technology > Informatics
Depositing User: Users 24139 not found.
Date Deposited: 16 Aug 2022 03:32
Last Modified: 25 Aug 2022 03:32
URI: http://repository.uph.edu/id/eprint/49579

Actions (login required)

View Item View Item