Aria (firmware)

Installation

1.Save the firmware file to an SD card with a name like “Aria.bin”.

2.Load it from the Code > Flash menu.

3.The firmware will be copied as “firmware.bin”, and the system will reboot.

4.The original firmware file remains on the SD card under its own name.

This process works for EMY, Mea, Kaiwa, Sino, and Vax, allowing easy switching between firmwares.

Main Menu

Vowel
Use the main switch to change the desired mode :

•Trigger Mode – When triggered by the gate, vowels play for their entire frame duration.

•Gate Mode – Vowels play only while the gate is high and stop when low.

•Continuous Mode – Vowels loop seamlessly, with transitions affected by frame and merge duration.

Parameters

• P1 – Sample Rate

• P2 – Master Pitch

• P3 – Frame Duration

•P4 – Merge Duration (smooth transitions between vowels)

• P5 – Noise Level

• Sound – Choose Vowel to play

Code

•Code Display – Shows the current firmware version.

•Calibration – Potentiometer calibration tool.

•Flasher – Firmware loader to select and install different firmware versions.

Demo

View this post on Instagram

Nexus Vox

Features

Converts any ASCII text into speech automatically
Real-time control of speed, pitch, voice, reverb, volume

CV control

Voices (11)
Speed
Formant
Reverb
Expression
Pitch (in Phoneme and MIDI mode)

These controls can be applied in real time while speaking!

Inline commands

The text string can contain commands to adjust the voice on the fly

$1o $2s I am Vader and I speak slowly

Phonetic to speech

The speech can also be generated by stringing phonemes and prosody to control the pitch, speed, and volume.

k ax m p yy uw dx er 
Computer

//h aw -/d>/eh r +<\yy uw s p iy k t uw m iy dh ae t -w ey .+/
How dare you speak to me that way!

/ increase pitch. + inscrease speed > increase volume
 decrease pitch. - decrease speed < decrease volume

Phonemes mode

Allows for the easy selection of fundamental phoneme groups, such as “Vowels” and “Consonants,” and provides straightforward controls for pitch and speed adjustments.

While providing fundamental control over pitch and speed, it’s a straightforward function ideal for those just starting their journey in speech synthesis and sound exploration.

This requires just a gate signal to get started…

Latency

From the Gate signal to the start of the speech, the latency is relatively tiny: 30 ms to 80 ms.

MIDI

Full MIDI control over the voice characteristics.
Phonetic, Phoneme or Text to speech from MIDI notes.

20K words for NEXUS MIDI alphabet

1 file(s) 10.62 MB

Login is required to access this page

Login

Nexus Alphabet MIDI effect rack for Ableton Live

1 file(s) 12.33 KB

Login is required to access this page

Login

Nexus Phonemes MIDI effect rack for Ableton Live

1 file(s) 12.33 KB

Login is required to access this page

Login

Firmware

Nexus vox firmware (for Emy & Terminal)

2 file(s) 81.43 KB

Login is required to access this page
Login

Nexus MIDI for Kraftor

1 file(s) 74.50 KB

Login is required to access this page

Login

Text files

Phonetics 0.32 KB 33 downloads

Check this file showing how to use the phonetics commands Nexus Phoneme Symbols PhonemeExamplePhonemeExample Adas...

Login is required to access this page

Login

Commands 0.28 KB 28 downloads

Text with embedded commands Nexus commands Use the $ symbol to add a command in your...

Login is required to access this page

Login

Stand alone

You can easily connect Nexus to your preferred microcontroller (here the Raspberry Pi Pico) using a simple serial interface.

Click the image below to learn more

Media

View this post on Instagram

A post shared by Jean-Luc Deladrière (@polaxis)

S.A.M (firmware)

Read texts from the micro SD card in plain English.
The sound produced by S.A.M is a robotic, monotone voice with a slightly metallic quality.

The user can adjust the Pitch, Speed, Mouth, and Throat.

Note that the code is blocking, and the potentiometer positions are read during the pauses between utterances.
(only the Rate P1 can be used in real-time)

SAM can read .txt file
You can try this file as an example.

SAM test file

1 file(s) 0.10 KB

Login is required to access this page

Login

Installation

Save it on the SD card with a name like “SAM.bin,” and you can load it from the code>flash menu.
The same applies to the EMY, Mea, Kaiwa, Sino & Vax firmware so that you can switch from one to another within the main menu.

This function copies the firmware to “firmware.bin” and reboots the system.
It will delete the firmware.bin but keep the original firmware on the card with its own name.

Demo

View this post on Instagram

A post shared by Jean-Luc Deladrière (@polaxis)

Sino Vox

It generates speech from plain English or Chinese by reading files on the microSD card.

Timing

Real-time
The various voice parameters are applied just before triggering the speech and are ineffective while speaking.
They are applied to the next utterance. The best is to fiddle a bit with the knobs for the desired effects.
While playing with the interface, the display shows the parameters changes.

Firmware

Sino Firmware + USB MIDI

5 file(s) 68.54 KB

Login is required to access this page

Login

Sino Firmware + TRS MIDI

3 file(s) 68.54 KB

Login is required to access this page

Login

SD card

Here are some text files to test the various voice options (including text in Chinese Pinyin)

Sino SD card

1 file(s) 24.45 KB

Download

MIDI

You can use a midi sequencer to store letters as MIDI notes.

(C#6 is letter a) The speech starts when Sinovox receives the “.” character (MIDI note A#1). The space character is the MIDI note G#0

Pitch bend can be used to set the pitch from 0-10.

The MIDI clock is present on the Busy jack.

You can use a custom MIDI track in Ableton live to ease the process of “typing” text.

Ableton MIDI rack

Ableton SinoVox Rack

1 file(s) 10.95 KB

Download

Media

Voir cette publication sur Instagram

Une publication partagée par Jean-Luc Deladrière (@polaxis)

Jean-Luc Deladrière · Sino Vox

Robovox

Robovox’s design is inspired by the Patent “System for and method of synthesizing singing in real-time” filed on 03.05.90 by Florian Schneider, Gert Joachim Ott, and Gert Jalass.

It describes a musical instrument that can produce a speech and sing in real-time via MIDI controls.
A magical speech synthesizer chip lives at the heart of the instrument: the Votrax SC-02 invented by Richard T. Gagnon.
The Votrax SC-02 chip can generate speech by stringing phonemes together.
The Votrax SC-02 contains registers that allow software control of the speech rate, pitch, pitch movement rate, amplitude, articulation rate, vocal tract filter, and of course, phoneme selection.
Most of these functions are implemented in the MIDI control software.

In Robovox, we also implemented an undocumented function of the chip: the internal audio carrier can be replaced by an external one to produce vocoder-like effects.
The result is simply excellent. You can also use this feature as an effect to apply to the carrier! Check this demo where I play with the pitch of the external carrier with a Genki ring (@ 0:48) (sorry for the lousy audio -video sync)

Specifications

Hardware

Add-on board to fit on Kraftor, Emy or Emy Terminal via the MikroBUS port
Host the vintage SC-02 Votrax speech chip
Audio carrier input for vocoder like effect

Midi controls

MIDI note ON note OFF mapped to the SC02’s phonemes
Velocity
Pitch (coarse control by clocking the chip)
CC2 Note Pitch (fine control using the chip inflection/pitch register)
Mod Wheel mapped to filter
CC 64 sustain ON/OFF to choose between the internal and external carrier

Emy / Terminal EMY Firmware

The code runs on Terminal or Emy. It comes in 2 versions, depending on how you want to sequence it. You can drive it either as a USB MIDI instrument or via a classic MIDI serial TRS connector. The code also allows CV controls so it can be used in a Eurorack setting

Robovox MIDI Serial

3 file(s) 44.86 KB

Download

Robovox MIDI USB

3 file(s) 45.93 KB

Download

Sequencing the chip

VST-Plugin

The chip is quite easy to sequence with the help of a plugin that works almost like the original Atari software used by Kraftwerk.

Robovox PEC VST plugin

2 file(s) 5.26 MB

Download

Ableton MIDI Effect Rack

You can also sequence the chip by sending phonemes from the piano roll (here: UH R A N E U M )

I made a MIDI effect rack for Ableton to simplify the introduction of text

Ableton Robovox Rack

1 file(s) 11.32 KB

Download

Media

Here are a few Kraftwerk covers made by Andrew Shaw. These are astonishing variants of the original ones: respectful to the original songs but with their specificities or identities.

Andrew is using the Robovox precisely as Kraftwerk used to do: first by finding the correct phonemes to be uttered, then launching them in perfect timing and duration, and finally tuning the pitch to have the chip sing in tune!
( all parameters sent live to the Robovox from within the DAW)

Jean-Luc Deladrière · Robovox

Terminal

The casing allows adding any of the Vox modules inside just by removing four screws.
As with Emy, Terminal can be used in a CV-controlled environment or act as a MIDI instrument.

You can install the various firmware on the SD card before powering the device.

A pair of shunts allows the user to connect Terminal either as a MIDI USB unit or as a host for a USB keyboard for text entry.

On request, a serial TRS jack MIDI Type B connector can also be added.
(note: not all firmware are supporting MIDI yet. MIDI support is under development)

Please check this page for more information: https://www.polaxis.be/

Features :

100 % Emy compatible, including adding additional voices
USB port for power or MIDI or USB connection
Optional TRS jack MIDI Type B connector
USB host connector for computer keyboard
Shunts (must be removed when connecting a keyboard, otherwise, they must be placed vertically)
Line in & out stereo jack 1/4″ (line in only used for the Robovox module)
Trimpot to adjust the audio level input (only used for the Robovox module)

Terminal in action

View this post on Instagram

A post shared by Jean-Luc Deladrière (@polaxis)

VAX vox

What is Dectalk

Dectalk was a speech synthesizer and text-to-speech technology developed by Digital Equipment Corporation in 1984, mainly based on the work of Dennis Klatt at MIT.

The Dectalk Express what connected to the serial port and would speak what was being “printed.”

The synthesizer can process text and produce speech with nine different voices. The Dectalk engine includes a parser that gives users fine control over the synthesized speech’s quality, pitch, and intonation. Dectalk can also be programmed to play phonemes and sing with quite a realistic expression.

[hxae<300,10>piy<300,10> brr<600,12>th<100>dey<600,10> tuw<600,15> yu<1200,14>_<120>]
[hxae<300,10>piy<300,10> brr<600,12>th<100>dey<600,10> tuw<600,17> yu<1200,15>_<120>]
[hxae<300,10>piy<300,10> brr<600,22>th<100>dey<600,19>xdih<600,15>rdeh<600,14>ktao<600,12>k_<120>_<120>]
[hxae<300,20>piy<300,20> brr<600,19>th<100>dey<600,15> tuw<600,17> yu<1200,15>]

The command syntax for musical coding sequences is:

[phoneme<duration, pitch number

Timing

Latency
There is a latency of 200 ms between the trigger and the start of the speech. This latency is very consistent, allowing the speech to stay in the tempo even if not precisely on the beat. The firmware uses the gate going down to stop the speech, preparing the chip for the next utterance, so when sequencing some stutter-like speech in a loop, they still fire up in sync with the tempo.

Real-time
The various voice parameters are applied just before triggering the speech and are ineffective while speaking. They are applied to the next utterance. The best is to fiddle a bit with the knobs for the desired effects.

Firmware

Vax Firmware 82.59 KB 12233 downloads

Download

DECtalk user guide 1.08 MB 8316 downloads

A very useful guide to help you using the DECtalk command and building your own very...

Download

Text files

Apollo 11 complete transcript 709.93 KB 8322 downloads

Credits here I modified the script so the various callsigns use differents DECtalk...

Download

Long text 73.41 KB 9595 downloads

These text files with a .spk are to be read by the SD reader mode in VAX. The text...

Download

Singing demo 0.61 KB 5832 downloads

...

Download

Designing Voices 0.73 KB 5053 downloads

Load this file to learn how to design your own voices by tweaking the various voice...

Download

Spell mode demo 2.71 KB 7880 downloads

2 files to show how to use the spell mode : one with a list of three letter acronym...

Download

Isaac Asimov's "Three Laws of Robotics" - pseudo singing mode demo 0.00 KB 42 downloads

Isaac Asimov's "Three Laws of Robotics" First 3 lines in normal text to speech mode,...

Download

Oblique Strategies 3.58 KB 6117 downloads

Brian Eno's Oblique Strategies Original 1975 version (113 lines) ...

Download

Songs

Songs 3.89 KB 8037 downloads

A few songs to play with VAX ... Daisy Imperial March The Star-Spangled Banner USSR...

Download

Media

Kaiwa Alternative Voice : F4 (Kaiwaii cute female)

Kaiwa Alternative Voice : M6 (male)

Kaiwa Alternative Voice : F1 (slow female)

Kaiwa Assembled

Prepare your text file in Romaji and have Kaiwa read it out loud for you!

Just in case you wonder how Romaji look like: here is the Japanese lyrics of Kraftwerk’s Radioactivity :

cherunobuiri
harisuba-gu
serafi-rudo
hiroshi-ma
fukushi-ma
nihon,demo,houshanou
kyoumo,itsumademo
fukushima,houshanou
kuuki,mizu,subete
nihon,demo,houshanou
i-ma--sugu,ya-mero-

note: the hyphen is used to prolong the vowels (see the Kaiwa-vox’s manual for more on this)

Workflow

Group your lines into a song and save it as a text file on the SD card :

On Kaiwa, manually pick a song and use the Sound pot or its CV entry to pick the line in that song.

A gate signal will trigger the sound while the various pot and CV entries allow tweaking the voice characteristics before processing the speech (not in real-time, but they are applied on the next utterance).

Firmware

Kaiwa Firmware 28.03 KB 637 downloads

Firmware load : Option 1 Saving the "firmware.bin" on the SD card and restart Option...

Login is required to access this page

Login

SD card

(if you need to get started with a new sd card)

Kaiwa SD card 1.36 KB 9706 downloads

Image of Kaiwa's default SD card. It contains some files to get started Unzip and...

Download

Quickstart Manual

Kaiwa quickstart manual 658.63 KB 6937 downloads

...

Download

USB TTS mode

This little demo to shows how to use the USB TTS mode on Kaiwa. This function is designed to help you debug your Japanese romaji lines before storing them into a text file for the SD card. (see the user manual for more details).

The speech is triggered by the gate signal, so this mode can also be used to edit and play live text.

Media

Mea vox

Firmware

Mea Firmware + MIDI USB 62.44 KB 418 downloads

Firmware load : Option 1 Saving the "firmware.bin" on the SD card and restart Option...

Login is required to access this page

Login

Mea Firmware + MIDI TRS 62.44 KB 16 downloads

Please note that when using MIDI from the TRS jack on Terminal, the front switch...

Login is required to access this page

Login

SD Card

(if you need to get started with a new SD card)

Mea SD card 1.28 KB 7078 downloads

Some examples to get you started ... ...

Download

Extras

Citroen / Renault 25 vocabulary for MEA vox

6 file(s) 163.27 KB

Login is required to access this page

Login

610 words for MEA vox

2 file(s) 163.27 KB

Login is required to access this page

Login

MIDI

The firmware has a MIDI function :

– MIDI note assigned to 39 phonemes.
– Velocity and Pitch bend supported.
– Can hold the note for singing effect.

Ableton MEA vox MIDI effect rack 40 12.07 KB 2557 downloads

Midi notes are named to match Mea's 40 phonemes. (See MIDI mode in the manual) ...

Login is required to access this page

Login

Use with a microcontroller

The hardware is compatible with the mikroBUS^TM standard, allowing rapid prototyping and easy integration into existing projects.

The onboard class-d amplifier can drive a small 8 ohms speaker.

This vox board was designed to be used with EMY and its appropriate firmware. (in this case, the audio signal is forwarded via the “PWM” pin – the jack is not used with Emy)

It can also be used in robots, toys, or any other retrocomputing speaking devices you may think of.

Thanks to the popular mikroBUS™, this board can be instantly connected to multiple different platforms like the Arduino, the STMT32 of the Clicker2.
The onboard amplifier is quite powerful and is just waiting for the speaker to be plugged in!

Breadboard Connection

Arduino Connect

Code

The code is available here

Text to Speech

The chip can string phonemes to produce quite realistic speech (although with a French accent). It can also sustain notes for a nice singing effect. Here is how the Arduino code looks like :

dire("/bonj+w_r");
delay(400);
dire("&l&kt€Ronik miwzik");
delay(400);
dire("i$ b+i-n ka+I+n -ma/$=+_ine");
delay(400);
dire("+?je pE ossi murmur*?");
delay(400);
dire("je suis un arduino qui parle"); // à corriger ;-)
delay(400);

You will find more information about the various phonemes and the prosodic markers in the following manual.

Hardware User Manual

Media

Voir cette publication sur Instagram

Une publication partagée par Jean-Luc Deladrière (@polaxis)

Jean-Luc Deladrière · MEA

Aria (firmware)

Installation

Main Menu

Code

Demo

Nexus Vox

Features

CV control

Inline commands

Phonetic to speech

Phonemes mode

Latency

MIDI

Firmware Nexus vox firmware (for Emy & Terminal) 2 file(s) 81.43 KB Login is required to access this page Login

Text files

Phonetics 0.32 KB 33 downloads

Commands 0.28 KB 28 downloads

Stand alone

Media

S.A.M (firmware)

Installation

Demo

Sino Vox

Timing

Firmware

SD card

MIDI

Ableton MIDI rack

Media

Robovox

Specifications

Hardware

Midi controls

Emy / Terminal EMY Firmware

Sequencing the chip

VST-Plugin

Ableton MIDI Effect Rack

Media

Terminal

Terminal in action

VAX vox

What is Dectalk

Timing

Firmware

Vax Firmware 82.59 KB 12233 downloads

DECtalk user guide 1.08 MB 8316 downloads

Text files

Apollo 11 complete transcript 709.93 KB 8322 downloads

Long text 73.41 KB 9595 downloads

Singing demo 0.61 KB 5832 downloads

Designing Voices 0.73 KB 5053 downloads

Spell mode demo 2.71 KB 7880 downloads

Isaac Asimov's "Three Laws of Robotics" - pseudo singing mode demo 0.00 KB 42 downloads

Oblique Strategies 3.58 KB 6117 downloads

Songs

Songs 3.89 KB 8037 downloads

Media

Kaiwa Alternative Voice : F4 (Kaiwaii cute female)

Kaiwa Alternative Voice : M6 (male)

Kaiwa Alternative Voice : F1 (slow female)

Kaiwa Assembled

Workflow

Firmware

Kaiwa Firmware 28.03 KB 637 downloads

SD card

Kaiwa SD card 1.36 KB 9706 downloads

Quickstart Manual

Kaiwa quickstart manual 658.63 KB 6937 downloads

USB TTS mode

Media

Mea vox

Firmware

Mea Firmware + MIDI USB 62.44 KB 418 downloads

Mea Firmware + MIDI TRS 62.44 KB 16 downloads

SD Card

Mea SD card 1.28 KB 7078 downloads

Extras

MIDI

Ableton MEA vox MIDI effect rack 40 12.07 KB 2557 downloads

Use with a microcontroller

Firmware

Nexus vox firmware (for Emy & Terminal)

2 file(s) 81.43 KB

Login is required to access this page
Login