[Convert Images to Speech for Free] How to Use the Feature to Read Text from Images and Read it Aloud [Image-to-Speech]
Jan. 26, 2026
Hello, thank you for always using Ondoku.
In this article, we will introduce how to use the feature that "reads text aloud when you upload an image."
- Transcribing from images is difficult
- Want to turn images into audio immediately
- Want to use it for free
It is recommended for people like this.
Now, let's introduce in detail and explain how to use Ondoku's "feature to read text from images and read it aloud."
【How to use the Image to Text-to-Speech feature】
- Access the top page of Ondoku
- Click the "Image" tab located above the text box
- Click Image Upload and select an image
※For smartphones, it is also possible to start the camera and take a photo directly here. - After selecting the image, adjust the language, speaker, speed, and pitch.
- Click the Read Aloud button
Then, the image is analyzed in just a few seconds.
The audio can be read aloud.
After loading the image, the recognized text is displayed in the text box.
If there are recognition errors and you want to correct the content to be read aloud, you can edit it here.
Next, we will explain in more detail with images and videos.
1. Access the Ondoku top page
First, access the Ondoku top page.
2. Click the "Image" tab
Please check the Ondoku top page.
You can see that there are "Text" and "Image" tabs above the text box.
This can be done from any screen, whether it's a PC, smartphone, or tablet, so please check.

Click the "Image" tab.
Then, the text box switches to the button for uploading an image.
3. Click on Image Upload and select an image

For PC
When you click the Image Upload button, a separate window opens, allowing you to select an image from your computer.
Select the desired image and click Open.
For smartphones and tablets

When you click the Image Upload button, you can choose "where to reference the image from" (image folder, camera, etc.).
In the case of the camera, it is also possible to take a photo on the spot and upload it.
4. After selecting the image, adjust the reading speed and pitch.

After selecting the image you want to read aloud, check the settings.
Set the reading speed, pitch, and which voice to use for reading.
You can listen to samples of Japanese voices on this page, so please take a look.
5. Click the Read Aloud button

Once everything is ready, click the Read Aloud button.
Ondoku recognizes the text from the image and starts reading it aloud as audio.
Loading starts at the moment you click, and then the audio is automatically played.
During playback, the recognized text is displayed in the text box.
AI is used for recognizing text from photos.
If there is an error in text recognition, it can be corrected in the text box.
Image recognition range
Recognition from image to text recognizes "all the text shown in the image."
Therefore, when selecting an image, please choose one that has been cropped to only the parts you want to read aloud.
In addition, handwritten characters can also be read aloud, but the accuracy will be somewhat lower.
There are some characters that are difficult to recognize.
For example, the part written as "handwritten" (手書き) in this image was recognized as "hand-scooped" (手すき).
This is probably because my handwriting is messy.
Another example is the display of the ¥ symbol on receipts.
This may be mistaken for the kanji character "半" (half).
How many images can be used for free or for a fee?
The image reading feature that can convert images to speech can, of course, be used for free.
However, please note that the number of times it can be used varies depending on the plan.
- Non-member: 1 image/day
- Free member: 3 images/day
- Basic plan: 300 images/month
- Value plan: 1,000 images/month
- Premium plan: 2,500 images/month
【Points to note】
In the image reading feature,
both the number of images + the number of characters available for reading are consumed to read aloud.
Please note that reading aloud cannot be performed with the number of images alone.
Let's actually try it
Does it seem difficult? That's not the case at all.
Seeing is believing, so please actually try using it.
This feature can be used from both a PC and a smartphone.
Once you use the feature that can read aloud text from images, you will be surprised at how convenient it is!
- Reading necessary documents in daily life
- Turning textbooks or printouts into audio from images using Ondoku
By using it this way, ease of recognition and ease of understanding will improve significantly.
It is also possible to easily check documents that you had put off because they were a hassle to read by turning them into audio from images with Ondoku.
Please do try out the image reading feature.
We look forward to seeing you on the Ondoku site.
■ AI voice synthesis software "Ondoku"
"Ondoku" is an online text-to-speech tool that can be used with no initial costs.
- Supports approximately 50 languages, including Japanese, English, Chinese, Korean, Spanish, French, and German
- Available from both PC and smartphone
- Suitable for business, education, entertainment, etc.
- No installation required, can be used immediately from your browser
- Supports reading from images
To use it, simply enter text or upload a file on the site. A natural-sounding audio file will be generated within seconds. You can use voice synthesis up to 5,000 characters for free, so please give it a try.
Email: ondoku3.com@gmail.com
"Ondoku" is a Text-to-Speech service that anyone can use for free without installation. If you register for free, you can get up to 5000 characters for free each month. Register now for free
