Skip to content → Skip to footer

OCR Functions and Settings in Jieshuo Screen Reader

Published 10 December 2024 by Kareen Kiwan

Last updated on 1 September 2025

Jieshuo Screen Reader utilizes Optical Character Recognition (OCR) to offer text recognition features. These include recognizing text in the currently focused item, the entire screen, or a virtual screen designed to improve accessibility for otherwise inaccessible app screens. Jieshuo supports several OCR engines, including custom ones, with a dedicated settings section.

Table of Contents

Recognize Text of the Current Focused Element and Recognize Text on the Current Screen
Recognize Subtitles Function
Virtual Screen The virtual screen feature aims to enhance the accessibility of inaccessible app screens. The results vary across apps but generally enable better focus on items and their activation. It also uses OCR to recognize text, including the names of controls. OCR-Related Settings
Notes:
Audio Demonstration

Recognize Text of the Current Focused Element and Recognize Text on the Current Screen

The key difference between these two functions lies in their scope. The current focus recognition captures a screenshot of the currently focused element and sends it to the OCR engine for recognition. In contrast, the entire screen recognition captures a screenshot of the entire screen and attempts to recognize all the text displayed. Both functions are available in the main menu, can be assigned to gestures, or activated through the voice asistant.

However, for current focus recognition, there are additional ways to perform the function:

Using the OCR and Translation Navigation Type
After adding OCR and translation as a navigation type and switching to it, the gesture to move to the next item or increase the value translates the currently focused text, while the gesture to move to the previous item or decrease the value performs OCR on the focused item.
From the Recognition Menu
The recognition menu, part of the functions menu but also accessible from the main menu, includes a “Text Recognition” function. Initially intended to recognize and translate the currently focused text, it now acts as a focus text recognition tool only.

Recognize Subtitles Function

This function performs continuous OCR recognition on a specific portion of the screen, as defined by the user. It is designed primarily for recognizing video subtitles.

Virtual Screen

The virtual screen feature aims to enhance the accessibility of inaccessible app screens. The results vary across apps but generally enable better focus on items and their activation. It also uses OCR to recognize text, including the names of controls.

OCR-Related Settings

To access OCR settings, navigate to “Screen reader settings” > “Advanced settings”, then activate “OCR Settings”. Available options are:

Use a dark background for the virtual screen: This option targets partially sighted users and determines whether a dark background is used when displaying results in both the “Virtual screen” and “Recognize text on the current screen” functions.
Automatically read the recognized text when using the virtual screen or the current screen recognition: Unlike the current focus recognition, which is always read automatically, it is possible to choose whether the automatic announcement should occur for the entire screen and virtual screen recognition. Note that for these two functions, the results are displayed in a new window, whereas with the current focus recognition, the text is only read aloud, and you must select recognition results from the recognition menu to read them again.
Automatically use OCR: Automatic OCR tries to perform text recognition whenever an item that lacks a label is focused.
Auto image description: When enabled, this option also acts on unlabeled focused items, but with one important difference—the method used to recognize the item. While the automatic OCR uses the selected OCR engine to recognize any text in the item, the auto image description uses the Blue Heart AI model to try to describe the item.
OCR engine: Here, the user selects the engine to be used for all OCR-related functions. It includes:
- Vivo: The default online engine that replaced Tencent Cloud, the previously used engine. This engine lacks support for most languages that were available with the former engine. It might also not deliver good results for international users.
- Custom Baidu OCR data: If this engine is selected, required data such as the API key must be provided by the user.
- Custom OCR extensions: If the user wants to use a custom extension and has the necessary data and capabilities to create it, they can select this option.
- Offline OCR: This offline OCR engine is for users who don’t want to rely on the internet for text recognition. Results are delivered faster than when using the online engine, with more supported languages. The engine used, along with the supported languages, differs between the Jieshuo regular + version and Jieshuo Max.
  The languages included in the + version are: English, mixed Chinese and English, Japanese, Korean, Russian, German, French, Italian, Portuguese, and Spanish.
  The Max version, which uses a larger engine, supports: Chinese, English, Japanese, Korean, Kannada, Tamil, Telugu, Vietnamese, Latin, Afrikaans, Bosnian, Czech, Welsh, Danish, Māori, German, Malay, Dutch, Occitan, Norwegian, Polish, Portuguese, Romanian, Slovak, Slovenian, Albanian, Swedish, Swahili, Tagalog, Turkish, Uzbek, Cyrillic, Belarusian, Russian, Serbian, Bulgarian, Ukrainian, Mongolian, Adyghe, Kabardian, Avaric, Dargwa, Ingush, Chechen, Lezghian, Arabic, Persian, Uyghur, Urdu, Devanagari, Hindi, Marathi, Nepali, Bhojpuri, Maithili, Angika, Marshallese, Newari, Konkani, Sanskrit, and Haryanvi.
  For more information about Jieshuo Max, please check this article.
OCR language: If the offline engine is selected, the recognition language is specified here.
Offline OCR accuracy (Jieshuo Max only): Specifies how precise the recognition should be. It ranges from 1 to 5. Selecting smaller values gives faster recognition speed, while selecting higher values gives more accurate results but with slightly decreased speed.

Filter keywords while recognizing subtitles: Here, users can replace misidentified common errors with the correct text, so it is used instead. Note that I have no information on whether this works only for the subtitle recognition function or for all OCR-related functions.

Subtitle recognition default landscape/portrait mode: These options specify the part of the screen that should be recognized when using the subtitle recognition, based on whether the screen orientation is landscape or portrait. There is also an option to always ask, so the position is determined by the user every time the function is used.

Baidu OCR: Data related to the Baidu OCR engine is added here for users who selected this engine to be able to use it.

Edit custom OCR extensions: Here, custom OCR engine support extensions are created.

Notes:

When performing the current screen recognition or virtual screen, results are provided in a new window with the ability to click items. For example, if a recognized text belongs to a certain item on the screen, tapping on the resulting text clicks the item. The results also include an edit option that opens Granular Editing mode, allowing users to edit the recognized text to copy or export it as a text file.
All OCR-related functions and settings are paid. Users must be subscribed to the premium version to access them.
The inquire by voice functions can be also helpful in recognizing text by using prompts to read the text included in the image. Additionally, other image and video related description functions can detect text sometimes.
The recognition menu includes a CAPTCHA recognition function that users may try if they encounter the older, less-used text insertion CAPTCHAs. It also includes an OCR function called “exam recognition”; however, it is unclear how this function works exactly or what it tries to detect compared to the text recognition functions discussed above. It is important to note, though, that these options might not work after the removal of the Tencent Cloud OCR engine.

Audio Demonstration

Share this:

X (Twitter) Facebook WhatsApp Telegram

Kareen Kiwan

Since her introduction to Android in late 2012, Kareen Kiwan has been a fan of the operating system, devoting some of her time to clear misconceptions about Android among blind people. She wrote articles about its accessibility and features on the Blindtec.net Arabic website, of which she was a member of its team. Kareen's experience was gained through her following of the Android-related communities and fueled by her love for technology and her desire to test new innovations. She enjoys writing Android-related articles and believes in the role of proper communication with both the blind screen reader Android users and app developers in building a more accessible and inclusive Android. Kareen is a member of the Blind Android Users podcast team and Accessible Android editorial staff.

Published in Tutorials

Tagged in

Comments

Leave a Reply Cancel reply

Untitled
New app added to Accessible Android apps directory Wispr Flow: AI Voice-to-Text accessible https://accessibleandroid.com/app/wispr-flow-ai-voice-to-text/ #Android #AI
Untitled
Huawei FreeBuds Pro 5 Review: Living With the Upgrade https://accessibleandroid.com/huawei-freebuds-pro-5-review-living-with-the-upgrade/
Untitled
Roads Audio: Voice Threads https://accessibleandroid.com/app/roads-audio-voice-threads/
Untitled
Infinix Zero 40: A Review from a Visually Impaired User’s Perspective https://accessibleandroid.com/infinix-zero-40-a-review-from-a-visually-impaired-users-perspective/