Voices update

❌ We have now deprecated all our VocaliD Voices. If you encounter an error, please try one of our other voices from 7 providers, featuring DeepZen, CereProc, and Resemble. ai.

👍

Spoiler Alert: We have some surprise new voices coming in the following weeks, so keep an eye out!

IBM Voices💡

IBM Voices: In line with IBM deprecation of the voices listed here, we have also updated our offering to only present you with V3 and Expressive voices.

Expressive neural voices offer natural-sounding speech that is exceptionally clear and crisp. Their pronunciation and inflections are natural and conversational, and the resulting speech offers extremely smooth transitions between words. The voices determine sentiment from context and automatically use the proper intonation to suit the text. You can try duchess and reynold to test out the expressive voices.

Stay tuned with our updates and let’s build the future of audio together!🎧

I wanna hear an example

A demo is below with reynold

import audiostack


"""
Hello! Welcome to the audiostack python SDK. 
"""

# make sure you change this to be your api key, or export it as APIKEY="<key>"
audiostack.api_key =  "APIKEY"
print("In Content you can create scripts and manage your porduction assets.")
script = audiostack.Content.Script.create(scriptText= 
"Welcome to AudioStack, the world's most powerful AI audio creation infrastructure. The unlimited possibilities of generative AI at your fingertips. In one single API.", projectName="testingthings")
print(script)

print("In Speech you can access almost a thousand AI voice models or your own, cloned voice.")
voices = audiostack.Speech.TTS.list()
tts = audiostack.Speech.TTS.create(scriptItem=script, voice="reynold")


print("In Production you can dynamically mix it with a sound design of your choice and master it so it sounds great.")
mix = audiostack.Production.Mix.create(speechItem=tts, soundTemplate="cityechoes", masteringPreset = "balanced")
print(mix)

print("In Delivery, we produce a great sounding file and deliver it where you need it.")
encoder = audiostack.Delivery.Encoder.encode_mix(productionItem=mix, preset="mp3")
encoder.download(fileName="MyFirstAudioStackTrack")

We've got a lot more coming.

We are delighted to announce new features.

  • You can now handle Roman Numerals in our Voice Intelligence Layer. What does this mean? Well most text to speech providers struggle with stuff like "Charles IV" this is particularly hard problem. Here's an example!
text = """Johanna VI. war eine große Königin. Benedikt XVI. starb letzes Jahr.
/Die Kinder Marias II. standen weiter hinten in der Thronfolge.
  Die Herrschaft Karls V. dauerte mehrere Jahrzehnte. Ich sah Charles III. /
    zum ersten Mal. Er zeichnete Anna I. auf ihrem Sterbebett. Vor Edward XX gab / 
      es keine Feinde. Ich kämpfte nie gegen Pedro VI."""

script = audiostack.Content.Script.create(scriptText=text)

tts = audiostack.Speech.TTS.create(scriptItem=script, voice="vicki", useDictionary= True, useTextNormalizer= True)
print(tts)

item = audiostack.Speech.TTS.get(tts.speechId)
        
item.download(fileName=item.speechId)
  • We’ve now integrated our voice cloning capabilities into Audiostack! You can invite yourself to try it out from the AudioStack Console

Welcome to Audiostack

by Peadar Coyle

Welcome to the developer hub and documentation for Audiostack!