Image-to-text Menggunakan Tesseract OCR = Image-to-text Using Tesseract OCR

Wyeth, Kelvin (2023) Image-to-text Menggunakan Tesseract OCR = Image-to-text Using Tesseract OCR. Bachelor thesis, Universitas Pelita Harapan.

[img]
Preview
Text (Title)
Title.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (96kB) | Preview
[img]
Preview
Text (Abstract)
Abstract.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (322kB) | Preview
[img]
Preview
Text (ToC)
ToC.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (595kB) | Preview
[img]
Preview
Text (Chapter1)
Chapter1.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (914kB) | Preview
[img] Text (Chapter2)
Chapter2.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (2MB)
[img] Text (Chapter3)
Chapter3.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (1MB)
[img] Text (Chapter4)
Chapter4.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (9MB)
[img] Text (Chapter5)
Chapter5.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (345kB)
[img]
Preview
Text (Bibliography)
Bibliography.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (288kB) | Preview
[img] Text (Appendices)
Appendices.pdf
Restricted to Repository staff only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (1MB)

Abstract

Sinopsis laporan tugas akhir ini menampilkan penelitian yang melibatkan penggunaan bahasa pemrograman Python, menggunakan library Tesseract dengan data yang di-trained, opsi mode LSTM engine dan mode page segmentation dengan Orientation and Script Detection (OSD), dan library OpenCV untuk pemrosesan gambar. Tujuannya adalah untuk melihat keakuratan Tesseract dalam Optical Character Recognition (OCR) dan pengaruh metode pre-processing dan kualitas gambar terhadap hasil OCR. / This thesis report synopsis features research involving the usage of the Python programming language, alongside the Tesseract library, which utilizes trained data, the LSTM engine mode and the page segmentation mode with Orientation and Script detection (OSD), and the OpenCV library for the image pre-processing phase. The objective is to observe the accuracy of Tesseract in Optical Character Recognition (OCR) and the impact of the pre-processing methods and image quality on the OCR results.

Item Type: Thesis (Bachelor)
Creators:
CreatorsNIMEmail
Wyeth, KelvinNIM01082190015kelvin.w500@gmail.com
Contributors:
ContributionContributorsNIDN/NIDKEmail
Thesis advisorLukas, SamuelNIDN0331076001UNSPECIFIED
Thesis advisorMurwantara, I MadeNIDN0302057305UNSPECIFIED
Thesis advisorMitra, Aditya R.NIDN0305096901UNSPECIFIED
Uncontrolled Keywords: Python; Jupyter Notebook; Tesseract; Optical Character Recognition (OCR); Deep Learning; Long Short-Term Memory (LSTM); Tesseract; OpenCV; Levenshtein; tessdata;
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: University Subject > Current > Faculty/School - UPH Karawaci > School of Information Science and Technology > Informatics
Current > Faculty/School - UPH Karawaci > School of Information Science and Technology > Informatics
Depositing User: Kelvin Wyeth
Date Deposited: 04 Jul 2023 04:38
Last Modified: 05 Jul 2023 09:21
URI: http://repository.uph.edu/id/eprint/56403

Actions (login required)

View Item View Item