Image-to-text Menggunakan Tesseract OCR = Image-to-text Using Tesseract OCR

Wyeth, Kelvin (2023) Image-to-text Menggunakan Tesseract OCR = Image-to-text Using Tesseract OCR. Bachelor thesis, Universitas Pelita Harapan.

Preview

Text (Title)
Title.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (96kB) | Preview

Preview

Text (Abstract)
Abstract.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (322kB) | Preview

Preview

Text (ToC)
ToC.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (595kB) | Preview

Preview

Text (Chapter1)
Chapter1.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (914kB) | Preview

Text (Chapter2)
Chapter2.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (2MB)

Text (Chapter3)
Chapter3.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (1MB)

Text (Chapter4)
Chapter4.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (9MB)

Text (Chapter5)
Chapter5.pdf
Restricted to Registered users only
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (345kB)

Preview

Text (Bibliography)
Bibliography.pdf
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (288kB) | Preview

Text (Appendices)
Appendices.pdf
Restricted to Repository staff only
Available under License Creative Commons Attribution Non-commercial Share Alike.
Download (1MB)

Abstract

Sinopsis laporan tugas akhir ini menampilkan penelitian yang melibatkan penggunaan bahasa pemrograman Python, menggunakan library Tesseract dengan data yang di-trained, opsi mode LSTM engine dan mode page segmentation dengan Orientation and Script Detection (OSD), dan library OpenCV untuk pemrosesan gambar. Tujuannya adalah untuk melihat keakuratan Tesseract dalam Optical Character Recognition (OCR) dan pengaruh metode pre-processing dan kualitas gambar terhadap hasil OCR. / This thesis report synopsis features research involving the usage of the Python programming language, alongside the Tesseract library, which utilizes trained data, the LSTM engine mode and the page segmentation mode with Orientation and Script detection (OSD), and the OpenCV library for the image pre-processing phase. The objective is to observe the accuracy of Tesseract in Optical Character Recognition (OCR) and the impact of the pre-processing methods and image quality on the OCR results.

Item Type:

Thesis (Bachelor)

Creators:

Creators	NIM	Email
Wyeth, Kelvin	NIM01082190015	kelvin.w500@gmail.com

Contributors:

Contribution	Contributors	NIDN/NIDK	Email
Thesis advisor	Lukas, Samuel	NIDN0331076001	UNSPECIFIED
Thesis advisor	Murwantara, I Made	NIDN0302057305	UNSPECIFIED
Thesis advisor	Mitra, Aditya R.	NIDN0305096901	UNSPECIFIED

Uncontrolled Keywords:

Python; Jupyter Notebook; Tesseract; Optical Character Recognition (OCR); Deep Learning; Long Short-Term Memory (LSTM); Tesseract; OpenCV; Levenshtein; tessdata;

Subjects:

Q Science > QA Mathematics > QA75 Electronic computers. Computer science

Divisions:

University Subject > Current > Faculty/School - UPH Karawaci > School of Information Science and Technology > Informatics
Current > Faculty/School - UPH Karawaci > School of Information Science and Technology > Informatics

Depositing User:

Kelvin Wyeth

Date Deposited:

04 Jul 2023 04:38

Last Modified:

05 Jul 2023 09:21

URI:

http://repository.uph.edu/id/eprint/56403