How to Read Aloud Vertical Text Images: Convert with Mojiokoshi-san to Ondoku
May 16, 2026

I want to read aloud images of vertical books and documents with Ondoku, but it's not working very well...
It would be very convenient if you could listen to vertical books and documents while commuting or between household chores.
However, even if you upload vertical images to an AI text-to-speech site, it may not be converted into audio correctly.
In such cases, it is recommended to convert them into text with an AI transcription service first before reading them aloud!
By using Ondoku's sister service "Mojiokoshi-san", you can convert photos of vertical books and documents into text with high accuracy.
After that, you just need to paste the text into Ondoku!
In this article, we will explain how to read vertical images aloud by combining Mojiokoshi-san and Ondoku.
Since the procedure is completed entirely within your browser, there is no need to install any apps or purchase paid software.
What you will learn in this article:
- Reasons why Ondoku cannot read vertical images
- How to convert vertical images into text with Mojiokoshi-san
- How to paste text into Ondoku and read it aloud
- How to utilize audio converted from vertical books and documents
Can vertical images be read aloud as they are?

In conclusion, even if you upload vertical images directly to Ondoku, they cannot be read aloud correctly.
This is because the image reading function of the AI text-to-speech site "Ondoku" does not support vertical text recognition.
But don't worry!
By using Ondoku's sister service "Mojiokoshi-san", you can convert vertical images into audio.
"Mojiokoshi-san" is an AI site that can convert vertical text from newspapers, magazines, books, etc., into text with high accuracy.
By converting vertical images into text with Mojiokoshi-san and then pasting that text into Ondoku, you can read vertical images aloud with easy-to-hear audio.
Ondoku does not support vertical image reading
The AI text-to-speech site Ondoku has an "image reading function" that reads characters in photos and images and converts them into audio.
However, this function can only read horizontal text, and vertical images are not supported.
What happens if you upload a vertical image?
If you convert vertical books, newspapers, or documents into images and upload them to Ondoku, the order of characters may be swapped or mixed with other lines.
If you read it aloud in that state, the audio will not make sense.
This is because specific character recognition processing for vertical text is required to correctly interpret vertical layouts and line breaks.
Horizontal images can be converted to audio with Ondoku alone
Conversely, for horizontal paper materials or text in photos, you can convert them to audio just by uploading the image to Ondoku.
The procedure for horizontal image reading is introduced in detail in this article.
Now, let's explain the steps to read vertical images using Mojiokoshi-san and Ondoku in order.
Convert vertical images to text with Mojiokoshi-san
When converting vertical images to text, "Mojiokoshi-san" is recommended!
Mojiokoshi-san is an AI transcription site that can convert images, audio, and videos into text for free.
Its feature is that it supports vertical text such as newspapers, magazines, and books, and it can convert images into text with high precision while distinguishing layouts and headings.
How to read vertical images with Mojiokoshi-san
To convert a vertical image into text, first open the Mojiokoshi-san top page in your browser.

Click the "Select File" button or drag and drop the image to select the vertical image file.

When transcribing from an iPhone (iOS) or Android smartphone, you can also take a photo on the spot and transcribe it.

Once the file is selected or the photo is taken, the image will be displayed like this.

Confirm that the "Language" matches the language of the image (Mojiokoshi-san's vertical transcription function supports Japanese, Korean, Chinese, and Mongolian).

Here is the important point.
When you set it to a language that supports vertical transcription such as Japanese, the choice for "Horizontal/Vertical" will be displayed, so select "Vertical."

Preparation for transcribing the vertical image is now complete!
Click the "Upload" button to upload the vertical image.

Once the upload is finished, the transcription process begins automatically.
The text conversion is completed quickly with the latest AI, and the text read from the image is displayed on the screen.

Check the content, and if there are no problems, click "Download Text" to save the file.

Paste text into Ondoku and read aloud
Once you have converted it into text with Mojiokoshi-san, convert it into audio with Ondoku!
"Ondoku" is an AI text-to-speech site where you can read sentences simply by pasting text.
By registering as a free member, you can convert up to 5,000 characters into audio.
You can download the created audio as an MP3 and save it to your computer or smartphone.
The generated audio is OK for commercial use (click here for details), so it can also be used for video narration or podcasts.
How to read aloud text with Ondoku
To read aloud text transcribed from an image, first open the Ondoku top page in your browser.
Paste the sentence into the text box.

Select the language, such as Japanese or English.

Select the type of voice (speaker).

In Ondoku, you can choose from a wide variety of voices, including women, men, and children.
You can listen to Japanese voice samples in this article, so please take a look.
Preparation for reading the text is now complete.

Click the "Read Aloud" button to generate the audio.

The text-to-speech process is completed instantly with the latest AI.
The screen changes, and the audio is played.

Listen to the audio, and if there are no problems, click the "Download" button to save the MP3 file.
Now you have successfully read aloud a vertical image using Mojiokoshi-san and Ondoku.
It's a very simple method for reading aloud, so why not start by converting vertical images into text with "Mojiokoshi-san" and "Ondoku"?
Convenient when you want to convert vertical documents or books into audio

Combining Mojiokoshi-san and Ondoku is perfect for listening to vertical books and documents via audio.
For example, you can use it in scenes like these.
Checking vertical documents with your ears
Some contracts, reports, and internal materials are still produced in vertical format.
If you transcribe such documents with Mojiokoshi-san and read them aloud with Ondoku, you can check the content by ear without having to follow along with your eyes.
It's also very convenient when re-reading manuscripts you have written yourself.
Typos and unnatural phrasing that are often overlooked during silent reading can be easily found by listening to them in audio form.
Listening to paper books and newspapers with audio
Sometimes you want to read a vertical book but don't have the time to sit down and open it.
In such cases, using Mojiokoshi-san and Ondoku is recommended!
If you take photos of vertical books and read them aloud, you can create original audiobooks.
You can enjoy the content while commuting, in the bath, or during household chores.
Switching to listening when your eyes are tired is also a recommended way to read.
Can also be used to convert vertical PDFs to audio
Not only paper documents and books, but you may also want to convert vertical PDF materials into audio.
By converting the PDF to an image and then letting Mojiokoshi-san read it, you can convert it into an MP3 file using the same procedure as this article.
Reference: Horizontal images and PDFs can be converted to audio with Ondoku alone
Unlike vertical cases, horizontal images and PDF files can be converted to audio with Ondoku alone, so it is recommended to choose the method according to your purpose.
The methods for directly reading aloud horizontal image files and PDF files are explained in these articles respectively.
For vertical image reading, go from Mojiokoshi-san to Ondoku
To read vertical images aloud, it is recommended to convert them to text with Mojiokoshi-san first and then paste them into Ondoku.
Finally, let's summarize the flow once more.
- Upload vertical images to "Mojiokoshi-san" to convert them into text
- Copy the text and paste it into "Ondoku"
- Select language and voice to read aloud → Download as MP3
Everything from text conversion to audio generation can be done within your browser, and no paid software or apps are necessary.
Since vertical documents like papers, books, and newspaper scraps can be easily converted to audio, why not try using "Mojiokoshi-san" and "Ondoku" for free?
■ AI voice synthesis software "Ondoku"
"Ondoku" is an online text-to-speech tool that can be used with no initial costs.
- Supports approximately 50 languages, including Japanese, English, Chinese, Korean, Spanish, French, and German
- Available from both PC and smartphone
- Suitable for business, education, entertainment, etc.
- No installation required, can be used immediately from your browser
- Supports reading from images
To use it, simply enter text or upload a file on the site. A natural-sounding audio file will be generated within seconds. You can use voice synthesis up to 5,000 characters for free, so please give it a try.
Email: ondoku3.com@gmail.com
"Ondoku" is a Text-to-Speech service that anyone can use for free without installation. If you register for free, you can get up to 5000 characters for free each month. Register now for free
- What is Ondoku
- Start text-to-speech conversion
- Free registration
- Pricing
- Posts
- Try other free services

![[Free] AI Summary and Meeting Minutes Creation features now available on Mojiokoshi-san! Detailed explanation on how to use | AI Transcription Service - Mojiokoshi-san](https://storage.googleapis.com/mojiokoshi3/post/image/%E6%96%87%E5%AD%97%E8%B5%B7%E3%81%93%E3%81%97%E3%81%95%E3%82%93AI%E8%A6%81%E7%B4%84%E8%AD%B0%E4%BA%8B%E9%8C%B2%E3%81%8B%E7%84%A1%E6%96%99.png)

