This extension adds a toolbar button to your browser to allow the user to select a region in the currently active window. The extension captures the selected area and tries to recognize text inside this region using the internal powerful OCR engine. This extension uses the “tesseract.js” library that supports more than 100 languages, automatic text orientation, and script detection.
This extension loads the JS library on the page and removes it when you are done with the OCR. This way there is no long-term resource usage.
Note: On the first run, the extension might take a few minutes to fetch the training data from the internet. Since this resource is cached, all subsequent calls are going to be fast.
Note: Optical character recognition (OCR) is slow in nature, so this extension displays a progress bar for each detection module.
Note: This extension does the OCR process offline. There is no server-side interaction. It only fetches the language training database once.
Note: This tool can be used to extract the text content out of images or extract the content of a web page when user-section is forbidden.
发表评论