Vovsoft Logo
Vovsoft Facebook Page Vovsoft Telegram Channel Vovsoft Youtube Channel Vovsoft Twitter Account
Menu
Home » Feature Requests » Speech to Text Converter

Speech to Text Converter Feature Requests



Here are the features requested by users.

πŸ‘‰ We will try to implement the features with the most votes first.
πŸ‘‰ A valid license key must be entered for voting!
πŸ‘‰ Each license key can be used for a maximum of 1 vote per feature!
πŸ‘‰ A universal license key can be used to vote for all programs, even the freeware tools!

Speech to Text Converter Icon Speech to Text Converter Windows

Ability send the recognized text to clipboard automatically

Ability send the recognized text to file automatically

Ability to change font settings

Ability to change timeout value

Ability to convert multiple files at once

Ability to convert very large files

Ability to work like a barcode scanner (the user presses a button to β€œStart Listening”, and whatever they say gets converted to text and placed wherever their cursor is)

Add support for "Google Speech to Text" (60 minutes per month free) https://cloud.google.com/free

Fix freeze problem during uploading and waiting for server response

Fix long "microphone recording" bug. A 192 kbps 12-minute footage has already caught on and keeps stopping with an error.

Make it multilingual

Replace language codes with actual language names

Support more engines:
- Windows local speech input (Windows-key plus "h" for human (voice) input.)
- Web speech API
- Speechmatics
- Vocapia
- Apptek
- Phillips SpeechLive


× Pay What You Want

A valid license key must be entered for voting!
Done πŸ‘ Ability to convert video files such as MP4

Done πŸ‘ Ability to export the transcribed text as SRT or VTT to use in subtitling

Developer Note: Published as new software: Speech to Subtitle Converter

Done πŸ‘ Add Russian language

Done πŸ‘ Auto-detect language

Done πŸ‘ Support more engines:
- Open AI Whisper (can be installed locally for on-prem use)
- Deepgram (very fast and low cost), great for live audio