A downloadable voice synthesiser for Windows

Download NowName your own price

Vox | [vɑks] is a voice synthesizer I made using Unreal Engine 5 and MetaSounds. It is not a realistic text-to-speech synthesizer, but rather a playful tool.

How to use

In the Speech mode, you have to generate each phone with the press of a key and by changing the pitch with the mouse. Note that everything can also be controlled without a mouse - read more about the accessibility features below.
I expect people to have fun struggling with the controls and hearing and seeing how the system reacts to their inputs. It's very hard to control the vowels and the pitch at the same time, but when it's done right, it almost sounds like some kind of weird broken language.

In the Drone Choir mode, you have four instances to play with and you won't have to manually control the pitch. Play with the controls to get some moody drone music.

More videos about Vox | [vɑks]

Controls

Ideally, you need a keyboard and a mouse, but it's possible to manage without a mouse. Without a keyboard, you won't be able to control the 12 vowels in the Speech mode, but the Drone Choir mode will work fine with the mouse only.

Here are the alternative keyboard controls:

  • Shift: toggle the visibility of the visual interface

Speech mode

  • Q/W/E/R/A/S/D/F/Z/X/C/V (hold): play one of the 12 vowels
  • 1/2/3: change the voice archetype between child, female, and male.
  • Y/O (hold): decrease/increase pitch a lot
  • U/I (hold): decrease/increase pitch a little
  • H/L (hold): decrease/increase the cord ripple frequency a lot (if the setting has been enabled in the Options screen)
  • J/K (hold) decrease/increase the cord ripple frequency a little (if the setting has been enabled in the Options screen)

Drone Choir mode

For the whole choir

  • 1: start all the voices
  • 2: stop all the voices
  • 3: change the scale
  • 4: change the preset (what voice archetypes the choir consists of)
  • 7: increase the time interpolation for when vowels are changing
  • 8: decrease the time interpolation for when vowels are changing
  • 9: increase the time interpolation for when notes are changing
  • 0: decrease the time interpolation for when notes are changing

For each voice

  • Q/W/E/R: start/stop a voice
  • A/S/D/F: change the vowel
  • Z/X/C/V: change the note
  • Y/U/I/O: change the voice archetype
  • H/J/K/L: change the octave the voice is singing

Accessibility

If you are interested in the project and need some critical accessibility features that are currently missing, please reach out in the comments and I will see what I can do.

Hearing

There are a few audio accessibility options available from the Options screen:

Options screen

Mono

Enabling this will make all the sound sources monophonic, which will be useful if you cannot hear well from one ear. You can only hear the difference in the Drone Choir mode.

To enable the mono option, press the down arrow key 5 times from the options screen and press Enter. It's disabled by default.

Hyperacusis Equaliser

Hyperacusis is the increased sensitivity to sound.

If you suffer from hyperacusis and are particularly sensitive to a specific frequency range, this equaliser can be useful to you.

Vision

Unfortunately, there is no screen reader support. It is still an experimental feature in Unreal Engine. However, the main menu has UI sounds that should help to know where you are. Each button you select with the arrow keys will have the voice synthesiser pronounce the vowels contained in that button's text.

For example, to access the Speech Mode:

  • launch the program
  • press any key twice to skip the intro. You should hear "ee" as in Speech.
  • press the Enter key to access the Speech Mode.

To access the Drone Choir Mode:

  • launch the program.
  • press any key twice to skip the intro. You should hear "ee" as in Speech.
  • press the down arrow key. You should hear "oh ahyuh" as in Drone Choir.
  • Press the Enter key to access the Drone Choir Mode.

To access the Options screen:

  • launch the program.
  • press any key twice to skip the intro. You should hear "ee" as in Speech.
  • press the down arrow key twice. You should hear "oh ahuh" as in Options.
  • Press the Enter key to access the Options screen.

To leave the Speech Mode, the Drone Choir Mode, or the Options screen, simply press the Escape key; it will bring you back to the main menu. To quit the game from the main menu, press the Escape key once; you will hear the vowel in "Quit". Press the Escape key again to confirm.

Controls binding

Unfortunately, the controls are not configurable. I will try to do better on my next projects.

Platforms

The program is only available for Windows for now, but I may be able to make a build for both Linux and macOS. Please tell me in the comments if you are interested. If there is enough demand, I will look into it.

Thanks

Miina, Quentin, Clément, Florent, @AutSciPerson, Aaron & Dan, and the Redhill audio team, thank you all for the great feedback and advice :)

Download

Download NowName your own price

Click download now to get access to the following files:

vox.zip 108 MB
Version 5

Development log

Leave a comment

Log in with itch.io to leave a comment.