/TTS/Reduce

We've added a new endpoint to the AudioStack API, which allows users to condense their speech files to fit within a fixed duration, while retaining the speech's natural sound.

Common use cases include:

  • πŸ•˜ Creating audio adverts that need to fit within a fixed time slot (e.g. 15 or 30 seconds)
  • πŸ’¬ Generating audio "tags" and terms and conditions text for the end of your audio file

Check out a code example and more info in our user guide.

Mastering engine

We made a major update to our mastering engine, updating libraries and improving performance πŸš€ You should have less error rates and a better customer experience.

Voice Intelligence Layer

  • Voice Intelligence Layer Is the link to the voice intelligence layer check it out!
  • Our celebrated voice intelligence layer is now UP TO 5X faster 🚀
  • We've added to the voice intelligence layer an enhanced load balancer so now our platform works with longer form text. πŸ’―
  • This solves a major issue that many customers reported which was "fizzing out" of content. This should make your customers and users even happier 🐢

Error messages

  • We've made many improvements to error messages so it should be clearer where some speech requests fail, and how to fix the error. This enhances the developer and user experience.

Share your SonicSell creation by QR code πŸŽ‰

Our customers asked us for the ability to share your creations in sonic sell by QR code. And we answered them with this feature!

Try it now! So you can share your beautiful creations with your friends! πŸ’―

Some bug fixes and usability enhancements

  • We added better filtering to our sound templates, thanks to Pascal who pointed out this bug and we fixed itπŸ‘·
  • We made some security improvements to enhance customer trust!
  • We fixed a bug in correct MIME type of uploaded audio in our Audio Engine πŸ›

Voice Intelligence Layer updates

We're continuously working on our Voice Intelligence Layer.

  • We fixed some 11labs errors and artefacts when script texts are long πŸ› Thanks to our customers for pointing this out πŸ’―
  • Allows to process long scripts as each section is fragmented in several sections when necessary. This means longer scripts will be processed faster no timeouts πŸ’―

SSML Harmonisation

A commonly asked question by customers is "how do I make sure the same SSML works across providers".

So today we're VERY excited to launch our first (of many) features πŸŽ‰ πŸŽ‚

You can run this on ANY provider and it'll either work or it'll fail. So your code will work with ANY provider and ANY voice

  •    body = {
            "projectName": "__TEST",
            "scriptText": '<as:section> hello <as:break time="4s"/> worlds </as:section>',
        }
    

We showed this in beta to our customers and we got this quote

"This will save us so much time 10% of our bugs are due to things like this" - Software Developer at an Advertising company

How do I get started?

Simply to get stared just update

<break time="150ms"/>

to <as:break time="150ms"/>

Other reading

SSML Tags

If you want an update on SSML wikipedia is great.

Β Better previews with sharing

We've enabled better previews when you share, enabling meta data when you share your audio creations from SonicSell on socials.

Particularly the title, description and image metatags as you can see below:

Enabling you to have better sharing of your beautiful audio 🎧

New Sound Templates

  • Added 30 NEW sound templates
Ballsy Rock
Chopper Horns
Chopper Strings
Clap Together
Cool Industries
Dark And Unsettling
Design Grove
Epic Brass Trap
Epic Pace
Ethereal Dream
Fascinating Technology
Friendly Electro Pop
Friendly Fantasy
Garage Banger
Indie Disco
Laid Back Funk
Piano Artist
Power Sports
Pulsing Out Score
Rainbow Rock
Relaxed Vacation
RelaXmas
Rockabilly Mischief
Stepping Up
Sunny Skies
Surfing Dog
Trailer Beast
Vintage Swing
Welcoming Piano
Wild Beast

Have fun sampling these :)

Examples

We've fleshed out our examples.

Video Voice Over use case for a video voice over use case

And we added to our github a news summarizer

We added a beautiful example using our new Sound Template functionality

  • Autotagger is enabled.

How to make beautiful Audio in seconds

Β Media improvements

We fixed some UX problems (thanks to our customers for pointing this out) in the Media endpoints.

You can now do a lot more with media files, namely, place the name of the media file directly in the script using the name="" attribute.
You can also use the id="" attribute to create a placeholder, that can be overwritten in the mastering call:

import audiostack
import os

audiostack.api_key = os.environ["AUDIO_STACK_DEV_KEY"]

response = audiostack.Content.Media.create(filePath="default.wav")
print(response)
mediaId = response.mediaId


script = """
<as:section name="intro" soundsegment="intro">
  hello world <as:media name="default.wav" id="file1"/>
</as:section>
"""

script = audiostack.Content.Script.create(scriptText=script)
print(script)
speech = audiostack.Speech.TTS.create(scriptItem=script, voice="sara")
print(speech)

mastering = audiostack.Production.Mix.create(
    speechItem=speech, 
    # mediaFiles={
    #     "file1" : <any media id of an uploaded file">
    # }
)
print(mastering)

Intern showcase

Our intern William built something cool πŸ†’

https://github.com/aflorithmic/news_article_summarizer have a look here at this news article summarizer - leveraging Beautiful Soup 😍, OpenAI πŸ’― and our AudioStack API, it takes a URL and produces a beautiful audio summary. Have a play!

Multilingual voices in our Frontend

We updated our library

You can see all of the multilingual voices - plus what languages they support.

Bug fixes

As we build out the AudioStack we're constantly looking for improvements.

  • We fixed the trailing silence issue.