These docs are for v1.0. Click to read the latest docs for v2.0.

Bug fixes and code quality enhancements

  • Bug fix for joining speech files back together when there was an error, customer facing bug
  • We improved the code quality in one of our core services improving your customer experience
  • New SDK version https://pypi.org/project/audiostack/

 Platform update

  • New features in Platform. Save audio assets locally, so that your session is saved, improving the customer experience.

New mastering preset 🎉

  • We added a "voice only" preset (mastering engine feature) 💯
  • By using this preset, mastering will only applied to the generated voice and not the sound design beneath it. This is perfect for cases where you've already applied mastering to the sound template (or received a finished advert that you want to add an audio tag to).

Code Example:

mix = audiostack.Production.Mix.create(
    speechItem=tts,
    soundTemplate="pulsing_guitar",
    masteringPreset="voiceonly",
)

Docs

Terms of use and Legal

Sound templates

Speech / Predict

  • We've added a new endpoint which draws on 4 million data points to predict the duration of speech from a given script and list of voice names 🤓
  • This will allow users to determine whether a voice is suitable, or whether their script is the correct length for their project, without needing to generate the audio itself, saving time and credits! ⏰
  • Find out more and check out a code example in our guide, in the API reference or in the video below 👇

📢 We've added documentation and code examples on how to generate an audio "tag" and add it to the end of an existing audio advert. Find out more here.

Sound Templates

Sound template terms of use have been added for free and paying customers - we can now refer to the terms of use if people have questions about how they are allowed to use our templates.

You can refer here - https://audiostack.ai/terms-of-use

 Security updates

📘

Security updates

This doesn't mean that our old API was insecure, we're committed to updating and reducing any risks of compromise to our customers. We regularly review security updates, and we regularly patch any found errors.

We did a range of fixes to cover some security issues. These include

We are continuously working on these improvements and also decommissioned some older parts of our API as well.

Voice Playground (Platform) - Beta 🚧

  • New workflow which is the replacement for the simple frontend we previously had in the example frontends
  • A few small UI improvements and new sound library will come next week
  • See it at http://platform.audiostack.ai

New Voice Library

  • Updated the design
  • Fixed provider names
  • Is completely reusable and can be accessed at: http://library.audiostack.ai in the platform as a standalone workflow with the ability to preview, and in the voice select modal in the voice playground and sonic sell

SSML Prosody

We added the following functionality to have better prosody across providers. , ,

import audiostack


audiostack.api_key = "APIKEY"

text = """ The following part should be <as:prosody pitch="x-high" rate="x-fast" volume="x-loud">higher, faster and louder</as:prosody>,   
           """ 

script = audiostack.Content.Script.create(scriptText=text)
print(script.message, script.scriptId)
tts = audiostack.Speech.TTS.create(scriptItem=script, voice="henry", voiceIntelligence=True)
print(tts)
item = audiostack.Speech.TTS.get(tts.speechId)
item.download()           

We are changing the way we're providing sound templates to make it simpler for end users. Going forward, we're making it easier to understand what rights you have over the content you're working with, as we move towards offering a range of exclusive PRO-free sound templates (public rights organization free), which means that users can use the content accessed much more flexibly and with fewer restrictions.

More content is coming in the next few weeks, but while we move forward with this some of our old demo content, which had much more restrictive usage conditions, will be unavailable. If there's a sound template you really like and would like to use something similar for a broadcast ad or in a commercial setting, let us know and we will make sure we include a sound template that meets your needs.

Find out more about the sound templates we have available here:

https://docs.audiostack.ai/docs/use-sound-templates

/TTS/Reduce

We've added a new endpoint to the AudioStack API, which allows users to condense their speech files to fit within a fixed duration, while retaining the speech's natural sound.

Common use cases include:

  • 🕘 Creating audio adverts that need to fit within a fixed time slot (e.g. 15 or 30 seconds)
  • 💬 Generating audio "tags" and terms and conditions text for the end of your audio file

Check out a code example and more info in our user guide.