kha-white
/

manga-ocr-base

vision-encoder-decoder

image-text-to-text

Inference Endpoints

Model card Files Files and versions Community

manga-ocr-base / README.md

kha-white's picture

Fix dead link (#1)

aa6573b over 2 years ago

|

history blame contribute delete

705 Bytes

	---
	language: ja
	tags:
	- image-to-text
	license: apache-2.0
	datasets:
	- manga109s
	---

	# Manga OCR

	Optical character recognition for Japanese text, with the main focus being Japanese manga.

	It uses [Vision Encoder Decoder](https://huggingface.co/docs/transformers/model_doc/vision-encoder-decoder) framework.

	Manga OCR can be used as a general purpose printed Japanese OCR, but its main goal was to provide a high quality
	text recognition, robust against various scenarios specific to manga:
	- both vertical and horizontal text
	- text with furigana
	- text overlaid on images
	- wide variety of fonts and font styles
	- low quality images

	Code is available [here](https://github.com/kha-white/manga_ocr).