Tesseract Open Source OCR Engine (main repository)
-
Updated
May 16, 2022 - C++
{{ message }}
Tesseract Open Source OCR Engine (main repository)
Pure Javascript OCR for more than 100 Languages
ShareX is a free and open source program that lets you capture or record any area of your screen and share it with a single press of a key. It also allows uploading images, text or other types of files to many supported destinations you can choose from.
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Experience, Learn and Code the latest breakthrough innovations with Microsoft AI
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
The Swift machine learning library.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A supercharged version of paperless: scan, index and archive all your physical documents
yolo3+ocr
Fast and simple OCR library written in Swift
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
Transforms PDF, Documents and Images into Enriched Structured Data
text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network
A Unified Toolkit for Deep Learning Based Document Image Analysis
A tensorflow implementation of EAST text detector
Text recognition (optical character recognition) with deep learning methods.
AdelaiDet is an open source toolbox for multiple instance-level detection and recognition tasks.
[python3.6] 运用tf实现自然场景文字检测,keras/pytorch实现ctpn+crnn+ctc实现不定长场景文字OCR识别
Hi all,
We are asking for help from our community, hoping to provide Chinese documentation to all users. It's not difficult and here is the chance to be one of the contributors of MMOCR, an open-source rising star!
Here is a list of documentation needed to be translated:
建议精简版抛弃exe,直接打包成压缩文件。再带一个批处理,利用已经安装好的Java环境直接命令行运行。
[验证码识别-训练] This project is based on CNN/ResNet/DenseNet+GRU/LSTM+CTC/CrossEntropy to realize verification code identification. This project is only for training the model.
Add a description, image, and links to the ocr topic page so that developers can more easily learn about it.
To associate your repository with the ocr topic, visit your repo's landing page and select "manage topics."
1、在使用PPOCRLabel进行图像标注的时候,图像上有很多文字有识别的需求,但有些位置的文字没有识别的需求,那么在进行标注的时候是否还需要对没有识别需求位置的文字进行标注呢?
2、在使用PPOCRLabel进行自动标注后,有些位置的文字被标注上了,但是还额外框住了其他不是文字的图案(软件的文字识别能够识别到正确的文字,但是这种情况出现的次数很多),还是否需要对标注框进行调整,来去除掉非文字的图案?不去除的话是否对最终文字识别网络模型造成影响呢?