Image-to-text Menggunakan Tesseract OCR = Image-to-text Using Tesseract OCR

Wyeth, Kelvin (2023) Image-to-text Menggunakan Tesseract OCR = Image-to-text Using Tesseract OCR. Bachelor thesis, Universitas Pelita Harapan.

[thumbnail of Title]
Preview
Text (Title)
Title.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (96kB) | Preview
[thumbnail of Abstract]
Preview
Text (Abstract)
Abstract.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (322kB) | Preview
[thumbnail of ToC]
Preview
Text (ToC)
ToC.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (595kB) | Preview
[thumbnail of Chapter1]
Preview
Text (Chapter1)
Chapter1.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (914kB) | Preview
[thumbnail of Chapter2] Text (Chapter2)
Chapter2.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (2MB)
[thumbnail of Chapter3] Text (Chapter3)
Chapter3.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (1MB)
[thumbnail of Chapter4] Text (Chapter4)
Chapter4.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (9MB)
[thumbnail of Chapter5] Text (Chapter5)
Chapter5.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (345kB)
[thumbnail of Bibliography]
Preview
Text (Bibliography)
Bibliography.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (288kB) | Preview
[thumbnail of Appendices] Text (Appendices)
Appendices.pdf
Restricted to Repository staff only
Available under License Creative Commons Attribution Non-commercial Share Alike.

Download (1MB)

Abstract

Sinopsis laporan tugas akhir ini menampilkan penelitian yang melibatkan penggunaan bahasa pemrograman Python, menggunakan library Tesseract dengan data yang di-trained, opsi mode LSTM engine dan mode page segmentation dengan Orientation and Script Detection (OSD), dan library OpenCV untuk pemrosesan gambar. Tujuannya adalah untuk melihat keakuratan Tesseract dalam Optical Character Recognition (OCR) dan pengaruh metode pre-processing dan kualitas gambar terhadap hasil OCR. / This thesis report synopsis features research involving the usage of the Python programming language, alongside the Tesseract library, which utilizes trained data, the LSTM engine mode and the page segmentation mode with Orientation and Script detection (OSD), and the OpenCV library for the image pre-processing phase. The objective is to observe the accuracy of Tesseract in Optical Character Recognition (OCR) and the impact of the pre-processing methods and image quality on the OCR results.
Item Type: Thesis (Bachelor)
Creators:
Creators
NIM
Email
ORCID
Wyeth, Kelvin
NIM01082190015
kelvin.w500@gmail.com
UNSPECIFIED
Contributors:
Contribution
Contributors
NIDN/NIDK
Email
Thesis advisor
Lukas, Samuel
NIDN0331076001
UNSPECIFIED
Thesis advisor
Murwantara, I Made
NIDN0302057305
UNSPECIFIED
Thesis advisor
Mitra, Aditya R.
NIDN0305096901
UNSPECIFIED
Uncontrolled Keywords: Python; Jupyter Notebook; Tesseract; Optical Character Recognition (OCR); Deep Learning; Long Short-Term Memory (LSTM); Tesseract; OpenCV; Levenshtein; tessdata;
Subjects: Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Divisions: University Subject > Current > Faculty/School - UPH Karawaci > School of Information Science and Technology > Informatics
Current > Faculty/School - UPH Karawaci > School of Information Science and Technology > Informatics
Depositing User: Kelvin Wyeth
Date Deposited: 04 Jul 2023 04:38
Last Modified: 05 Jul 2023 09:21
URI: http://repository.uph.edu/id/eprint/56403

Actions (login required)

View Item
View Item