We made a major update to our mastering engine, updating libraries and improving performance ๐ You should have less error rates and a better customer experience.
Our celebrated voice intelligence layer is now UP TO 5X faster ๐ค
We've added to the voice intelligence layer an enhanced load balancer so now our platform works with longer form text. ๐ฏ
This solves a major issue that many customers reported which was "fizzing out" of content. This should make your customers and users even happier ๐ถ
Error messages
We've made many improvements to error messages so it should be clearer where some speech requests fail, and how to fix the error. This enhances the developer and user experience.
We added better filtering to our sound templates, thanks to Pascal who pointed out this bug and we fixed it๐ท
We made some security improvements to enhance customer trust!
We fixed a bug in correct MIME type of uploaded audio in our Audio Engine ๐
Voice Intelligence Layer updates
We're continuously working on our Voice Intelligence Layer.
We fixed some 11labs errors and artefacts when script texts are long ๐ Thanks to our customers for pointing this out ๐ฏ
Allows to process long scripts as each section is fragmented in several sections when necessary. This means longer scripts will be processed faster no timeouts ๐ฏ
Ballsy Rock
Chopper Horns
Chopper Strings
Clap Together
Cool Industries
Dark And Unsettling
Design Grove
Epic Brass Trap
Epic Pace
Ethereal Dream
Fascinating Technology
Friendly Electro Pop
Friendly Fantasy
Garage Banger
Indie Disco
Laid Back Funk
Piano Artist
Power Sports
Pulsing Out Score
Rainbow Rock
Relaxed Vacation
RelaXmas
Rockabilly Mischief
Stepping Up
Sunny Skies
Surfing Dog
Trailer Beast
Vintage Swing
Welcoming Piano
Wild Beast
We fixed some UX problems (thanks to our customers for pointing this out) in the Media endpoints.
You can now do a lot more with media files, namely, place the name of the media file directly in the script using the name="" attribute.
You can also use the id="" attribute to create a placeholder, that can be overwritten in the mastering call:
import audiostack
import os
audiostack.api_key = os.environ["AUDIO_STACK_DEV_KEY"]
response = audiostack.Content.Media.create(filePath="default.wav")
print(response)
mediaId = response.mediaId
script = """
<as:section name="intro" soundsegment="intro">
hello world <as:media name="default.wav" id="file1"/>
</as:section>
"""
script = audiostack.Content.Script.create(scriptText=script)
print(script)
speech = audiostack.Speech.TTS.create(scriptItem=script, voice="sara")
print(speech)
mastering = audiostack.Production.Mix.create(
speechItem=speech,
# mediaFiles={
# "file1" : <any media id of an uploaded file">
# }
)
print(mastering)
Intern showcase
Our intern William built something cool ๐
https://github.com/aflorithmic/news_article_summarizer have a look here at this news article summarizer - leveraging Beautiful Soup ๐, OpenAI ๐ฏ and our AudioStack API, it takes a URL and produces a beautiful audio summary. Have a play!
Exciting news! We've been hard at work collaborating with our providers to bring you more hyper-realistic English voices ๐ฅณ We're thrilled to introduce 35 new voices from ElevenLabs, including 14 American accents ๐บ๐ธ, 19 British accents ๐ฌ๐ง, and 2 Australian accents ๐ฆ๐บ. Additionally, we've added 41 voices from WellSaid Labs, featuring 28 American English ๐บ๐ธ, 6 British English ๐ฌ๐ง, 2 English Mexican ๐ฒ๐ฝ , 1 English South African ๐ฟ๐ฆ, and 4 English Australian voices ๐ฆ๐บ. Head over to our Library (library.audiostack.ai) to explore and try them all out!
Suggestions to try:
Finch, 11L, US
Elora, 11L GB
Conversational_greg WSL AU
Narration_Issa WSL ZA
Narration_lorenzo WSL MX:
**Stay tuned as thereโs more coming in the next month!
We've enhanced our msnr voices with punctuation symbols and respective lengths, as well as enabling break tags. This should add naturalness and enhanced pronunciation, as well as improved speech flow. Some examples:
Full break is 600 milliseconds.
Comma is 300 milliseconds,
Colon is 300 milliseconds:
Semicolon is 400 milliseconds;
Question mark is 400 milliseconds?
Exclamation mark is 600 milliseconds!
Break tag with three seconds
Break tag with thousand millisecond
Break tag with thousand milliseconds using single quote
Break tag with thousand milliseconds using single quote and no space
ย Voice improvements
Special characters are supported again in the API: & and %. These are both normalised for English and German voices!