We have added audio enhancement endpoints, added our Text Tagger to the SDK and added some security improvements and API improvements for our Voice Intelligence Layer

New Audio enhancement endpoints

We are continuously working on enriching our audio capabilities so we've added the ability to evaluate, enhance and denoise audio files.

:headphones: We added evaluate audioEvaluate an audio file.

:headphones: Enhance audio Evaluate and enhance an audio file.

:headphones: Denoise an audio file in order to remove background noise. Denoise an audio file.

Improvements

:100: A quality of life improvement the Voice Intelligence Layer is now enabled if you use one of our in built tags such as <as:break>

:100: We also added improvements to the file transfer routes so you no longer need to add file names

:bug:We added some security improvements for our customers to the voice preview

Recommendations using our Text Tagger

We've added recommendations for tags, moods and tones using the Text Tagger

See how to use them on our docs - https://docs.audiostack.ai/reference/postrecommendmoods

text = "AudioStack's technology seamlessly integrates into your product or workflow and cuts your audio production cycles to seconds while making your budgets go further."
number_of_results = 2
tags = ["happy", "fun", "valuable", "fast"]
category = "my category"

tag_item = audiostack.Content.RecommendTag.create(
    text=text, number_of_results=number_of_results, tags=tags, category=category
)
print(tag_item)

mood_item = audiostack.Content.RecommendMood.create(
    text=text, number_of_results=number_of_results
)
print(mood_item)

tone_item = audiostack.Content.RecommendTone.create(
    text=text, number_of_results=number_of_results
)
print(tone_item)

Another exciting addition to the AudioStack API - the ability to evaluate a given audio file to receive a quality evaluation, and then to enhance or denoise your audio file as needed.

We added three new endpoints as part of the Production Suite (i.e. /production/suite/), which can help you to check the quality of your recording, for example before using it for voice cloning.

With the /production/suite/enhance endpoint, you can let AudioStack do the work and both evaluate and enhance your file as needed.

Find out more in the docs:

https://docs.audiostack.ai/reference/evaluateaudio
https://docs.audiostack.ai/reference/enhanceaudio
https://docs.audiostack.ai/reference/denoiseaudio

Voice Cloner FileSize Update :loudspeaker:

:100: We have shipped some improvements to the maximum file size you can upload to our file manager to clone your voice: now you can upload recordings of any file size! :confetti-ball:

These changes offer increased flexibility in uploading larger recordings for voice cloning, and this also means more natural sounding voice clones with higher audio quality outputs. Also, this optimises the processing workflow for improved efficiency. :rainbow:

Recommendation Endpoint

We are excited to introduce new enhancements to our recommendation endpoint, offering users an even more tailored and dynamic experience :confetti-ball: This update focuses on providing personalized suggestions based on tone, mood, or user-specified tags.

See how to use them on our docs here, here and here

Here’s a breakdown of the key changes:

:triangular-ruler:Recommend Tags

:one: Users can now take control of their content recommendations by providing a personalized list of tags. This empowers them to fine-tune suggestions based on specific interests or preferences.

:straight-ruler:Recommend Tones

:two: Introducing the ability to recommend the best tone from AudioStack’s curated list. Tones play a crucial role in determining the most suitable voice for your text, enhancing the overall audio experience.

:mortar-board: Recommend Moods

:three: Users can now discover the perfect mood for their content with recommendations from AudioStack’s carefully curated list. Selecting the right mood is essential for finding the optimal sound design that complements the text.

🦄 These updates aim to elevate user experience by offering greater flexibility and customisation in content recommendations. Stay tuned for more exciting features and enhancements in future releases!🦄

Voice Cloning update!

We updated the voice cloner, where we integrated our AES (audio evaluation service) so you get a better recoded voice :100:

  • Speech page with button and voices carousel
  • New api key modal copy based on customer feedback
  • New recording script

Fixes and enhancements

:bug:We fixed a bug by restricting the length of organisation name

:notebook-with-decorative-cover: Updated docs to give clearer examples of using Files as sound templates with Python SDK: https://docs.audiostack.ai/docs/custom-sound-design-templates

:100: Improvements to the UX for the AES (audio evaluation service).

  • Added the terms and conditions to our sign up page

:ship:We've shipped some improvements on the length prediction model to account for filtering out break tags and SSML tags at inference, manage negative prediction as well as empty text prediction. This results in a more accurate estimate of the duration of your produced audio!

:moneybag:We shipped improvements to the billing system, enhancing our customer experience :100:

  • We fixed some bugs with inaccurate billing and improved the processing of invoices, so a better experience.
  • Better sign up experience, so a faster checkout flow.
  • Added yearly and quarterly payment programs allowing better customer experience
  • We fixed some errors in credits being accounted wrongly.
  • We fixed some mismatches in the platform and the billing systems, and made our extra credits system more robust.

:speedboat: We made significant improvements in our authentication flow in tests it was up to 10x faster so you should notice some speed improvements throughout the platform and in particular the API.

📘

Recording Booth is in Beta 🎉

You can get access to our new Recording Booth to record your voice.

If you want to test out new AudioStack features as a beta tester, please reach out to support[at]audiostack[dot]ai

  • :bug:[SonicSell] Language changes implemented - produce the RIGHT language for your voice. Customers reported that some times the voices were returning the wrong voices.
  • 🏷️ Improved tagging and categories for your files in the Content area.
  • 📄 Instant voice cloning guide added to the docs
  • Improved the "view more" control on SonicSell ads to make it easier to use 🤓
  • Intergrated new File Management service mastering: this means you can now upload media files in the platform and use them in your sound templates and mastering 📂

🚀 We are excited to announce a significant update to our API, now supporting Speech Synthesis Markup Language (SSML) among all our providers. This enhancement empowers users with greater control and customization over the synthesized audio output.

At AudioStack, as a provider-agnostic service we recognise the importance of granular control over your content regardless or provider, and we have now harmonised the most used and important tags to help you tailor your content to the maximum. :arrow-right: This means that you can use the same format of tag across multiple providers. If the change in question isn't supported by a particular provider, your code will still run :clap:.

:100: You can now leverage various speech elements such as: pitch, rate, language, along with the ability to insert pauses and control pronunciation, enhancing the naturalness and expressiveness of the generated synthetic voices.

:cookie: Brownie point: For our users that have cloned their voice with AudioStack, we now offer the capabilty of Localization of their content in any EFIGS (English, French, Italian, German, Spanish) language, plus Mexican Spanish. Please refer to our docs here to find out how! (*Note that the syntax used is different)

🦜This addition aligns with our commitment to providing cutting-edge tools for a seamless and immersive audio experience, unlocking new possibilities for developers and users alike. Explore the updated documentation here to incorporate SSML and elevate your audio applications to the next level!

📘

About Deprecation Warnings

To ensure we keep providing the best customer experience and performance possible, AudioStack will occasionally deprecate a feature or service from the API or the platform. We’ll always provide advanced notice and let you know of any actions needed, and communicate this both here in the changelog and directly to customers we know are affected before replacing the feature and removing it from the API.

We're in the process of upgrading our infrastructure around media files. We recently added the ability to upload and manage files in the AudioStack platform. We also upgraded our media files management to a new set of endpoints, which can be found here: https://docs.audiostack.ai/reference/postfile.

The old method for uploading media files, using /media/ is now being deprecated: https://docs.audiostack.ai/reference/postmedia

All customers using /media/ endpoints should switch over to /file/ within the next week to avoid potential interruptions. If you have any concerns or need support with this migration, please contact us ASAP at [email protected].