Mastering engine

We made a major update to our mastering engine, updating libraries and improving performance ๐Ÿš€ You should have less error rates and a better customer experience.

Voice Intelligence Layer

  • Voice Intelligence Layer Is the link to the voice intelligence layer check it out!
  • Our celebrated voice intelligence layer is now UP TO 5X faster ๐Ÿšค
  • We've added to the voice intelligence layer an enhanced load balancer so now our platform works with longer form text. ๐Ÿ’ฏ
  • This solves a major issue that many customers reported which was "fizzing out" of content. This should make your customers and users even happier ๐Ÿถ

Error messages

  • We've made many improvements to error messages so it should be clearer where some speech requests fail, and how to fix the error. This enhances the developer and user experience.

Share your SonicSell creation by QR code ๐ŸŽ‰

Our customers asked us for the ability to share your creations in sonic sell by QR code. And we answered them with this feature!

Try it now! So you can share your beautiful creations with your friends! ๐Ÿ’ฏ

Some bug fixes and usability enhancements

  • We added better filtering to our sound templates, thanks to Pascal who pointed out this bug and we fixed it๐Ÿ‘ท
  • We made some security improvements to enhance customer trust!
  • We fixed a bug in correct MIME type of uploaded audio in our Audio Engine ๐Ÿ›

Voice Intelligence Layer updates

We're continuously working on our Voice Intelligence Layer.

  • We fixed some 11labs errors and artefacts when script texts are long ๐Ÿ› Thanks to our customers for pointing this out ๐Ÿ’ฏ
  • Allows to process long scripts as each section is fragmented in several sections when necessary. This means longer scripts will be processed faster no timeouts ๐Ÿ’ฏ

SSML Harmonisation

A commonly asked question by customers is "how do I make sure the same SSML works across providers".

So today we're VERY excited to launch our first (of many) features ๐ŸŽ‰ ๐ŸŽ‚

You can run this on ANY provider and it'll either work or it'll fail. So your code will work with ANY provider and ANY voice

  •    body = {
            "projectName": "__TEST",
            "scriptText": '<as:section> hello <as:break time="4s"/> worlds </as:section>',
        }
    

We showed this in beta to our customers and we got this quote

"This will save us so much time 10% of our bugs are due to things like this" - Software Developer at an Advertising company

How do I get started?

Simply to get stared just update

<break time="150ms"/>

to <as:break time="150ms"/>

Other reading

SSML Tags

If you want an update on SSML wikipedia is great.

ย Better previews with sharing

We've enabled better previews when you share, enabling meta data when you share your audio creations from SonicSell on socials.

Particularly the title, description and image metatags as you can see below:

Enabling you to have better sharing of your beautiful audio ๐ŸŽง

New Sound Templates

  • Added 30 NEW sound templates
Ballsy Rock
Chopper Horns
Chopper Strings
Clap Together
Cool Industries
Dark And Unsettling
Design Grove
Epic Brass Trap
Epic Pace
Ethereal Dream
Fascinating Technology
Friendly Electro Pop
Friendly Fantasy
Garage Banger
Indie Disco
Laid Back Funk
Piano Artist
Power Sports
Pulsing Out Score
Rainbow Rock
Relaxed Vacation
RelaXmas
Rockabilly Mischief
Stepping Up
Sunny Skies
Surfing Dog
Trailer Beast
Vintage Swing
Welcoming Piano
Wild Beast

Have fun sampling these :)

Examples

We've fleshed out our examples.

Video Voice Over use case for a video voice over use case

And we added to our github a news summarizer

We added a beautiful example using our new Sound Template functionality

  • Autotagger is enabled.

How to make beautiful Audio in seconds

ย Media improvements

We fixed some UX problems (thanks to our customers for pointing this out) in the Media endpoints.

You can now do a lot more with media files, namely, place the name of the media file directly in the script using the name="" attribute.
You can also use the id="" attribute to create a placeholder, that can be overwritten in the mastering call:

import audiostack
import os

audiostack.api_key = os.environ["AUDIO_STACK_DEV_KEY"]

response = audiostack.Content.Media.create(filePath="default.wav")
print(response)
mediaId = response.mediaId


script = """
<as:section name="intro" soundsegment="intro">
  hello world <as:media name="default.wav" id="file1"/>
</as:section>
"""

script = audiostack.Content.Script.create(scriptText=script)
print(script)
speech = audiostack.Speech.TTS.create(scriptItem=script, voice="sara")
print(speech)

mastering = audiostack.Production.Mix.create(
    speechItem=speech, 
    # mediaFiles={
    #     "file1" : <any media id of an uploaded file">
    # }
)
print(mastering)

Intern showcase

Our intern William built something cool ๐Ÿ†’

https://github.com/aflorithmic/news_article_summarizer have a look here at this news article summarizer - leveraging Beautiful Soup ๐Ÿ˜, OpenAI ๐Ÿ’ฏ and our AudioStack API, it takes a URL and produces a beautiful audio summary. Have a play!

Multilingual voices in our Frontend

We updated our library

You can see all of the multilingual voices - plus what languages they support.

Bug fixes

As we build out the AudioStack we're constantly looking for improvements.

  • We fixed the trailing silence issue.

New voices WellSaid labs, ElevenLabs

Exciting news! We've been hard at work collaborating with our providers to bring you more hyper-realistic English voices ๐Ÿฅณ We're thrilled to introduce 35 new voices from ElevenLabs, including 14 American accents ๐Ÿ‡บ๐Ÿ‡ธ, 19 British accents ๐Ÿ‡ฌ๐Ÿ‡ง, and 2 Australian accents ๐Ÿ‡ฆ๐Ÿ‡บ. Additionally, we've added 41 voices from WellSaid Labs, featuring 28 American English ๐Ÿ‡บ๐Ÿ‡ธ, 6 British English ๐Ÿ‡ฌ๐Ÿ‡ง, 2 English Mexican ๐Ÿ‡ฒ๐Ÿ‡ฝ , 1 English South African ๐Ÿ‡ฟ๐Ÿ‡ฆ, and 4 English Australian voices ๐Ÿ‡ฆ๐Ÿ‡บ. Head over to our Library (library.audiostack.ai) to explore and try them all out!

Suggestions to try:

Finch, 11L, US

Elora, 11L GB

Conversational_greg WSL AU

Narration_Issa WSL ZA

Narration_lorenzo WSL MX:

**Stay tuned as thereโ€™s more coming in the next month!

**

https://library.audiostack.ai/

ย Voice improvements

We've enhanced our msnr voices with punctuation symbols and respective lengths, as well as enabling break tags. This should add naturalness and enhanced pronunciation, as well as improved speech flow. Some examples:

Full break is 600 milliseconds.
Comma is 300 milliseconds,
Colon is 300 milliseconds:
Semicolon is 400 milliseconds;
Question mark is 400 milliseconds?
Exclamation mark is 600 milliseconds!
Break tag with three seconds
Break tag with thousand millisecond
Break tag with thousand milliseconds using single quote
Break tag with thousand milliseconds using single quote and no space

ย Voice improvements

Special characters are supported again in the API: & and %. These are both normalised for English and German voices!