In today's release, we gave Workshop a new visual design to make it easier to use and more consistent with the rest of the AudioStack platform. We also added some exciting new features, including:
Upload your own speech file to use as speech in your Workshop asset;
Generate speech-to-speech to use as speech in your Workshop asset;
Invite someone to record their voice from Workshop, using your asset's script; and,
Automatically apply the correct mastering for Voice files added to your asset.
To find out more about how to use these features (available in the Speech tab under Advanced Controls), please see the user guide for Workshop here.
Platform UX Improvements
We released a number of UX improvements in v1.25.0, including:
After closing the Voice Library modal, the filters will now stay in place, so if you accidentally close the window you will be able to pick up where you left off.
Added an Info button to make it clearer which licensing terms apply to our sound templates in the Sound Library.
Power users can now copy and paste media files from the files area as tags, to paste directly into their workshop session - this means it's now possible to use voice and sound effect recordings alongside your text-to-speech!
Improved the UX of buttons on audio player cards, which direct you to different Workflows, allow you to save and upload your asset.
Supported smaller screen sizes and updated copy on the Workflows page to make it easier for non-admin users to understand what our Workflows are.
Now, users will be directed to the Projects area to find created assets that are more than a day old in SonicSell or Workshop.
Platform Bug Fixes
Fixed a bug where the audio previews in Workshop were playing twice at the same time, leading to distorted audio.
We're excited to announce the latest Workflow coming to the AudioStack Platform: PowerImport. The wait is over— It’s time to bring your outdated ads into the future with AudioStack.
Starting today, you can:
Instantly Import and Recreate your Existing Ads: Log in to your AudioStack account and upload your legacy ads to make them fully editable with a single click.
Bulk Import: Import up to three ads at once.
Choose from 4 different languages: Upload ads in English, German, French or Spanish.
Customise and Personalise: Let AI select the perfect sound design, update the script, and select a new voice to bring your ads up to date, or pick them out yourself to resonate with new audiences and markets.
PowerImport works the best with single voice ads, that have music in the background. However, we're constantly improving our AI, so watch this space 👀
In the latest release of the AudioStack platform, we updated the navigation to make it easier for you to find what you're looking for. This included a refresh of our Stacks, so now all of your workflows should be exactly where you expect them to be.
Looking for your files or for sessions you've already started working on? These can now be found right at the top of the navigation, in the Files and Projects areas, respectively.
From the Projects area, it's also now possible to create a new session in one of the workflows you have access to.
Workshop:
You can now record your voice (or record a voice input for speech-to-speech generation) from your script in workshop
Simply enter your script in the script box, and then click "Record Speech" to send your script directly over to the Recording Booth.
SonicSell:
Updated SonicSell ad player to enable a better user experience.
Fixed a bug with language selection.
Other Platform Improvements
We made a UX improvement to the speech-to-speech flow from Files, making it possible to go directly to your STS file after the process is complete.
We also released an improvement to the user invite page, to make it easier for admins to add users to multiple workflows.
As of today, 22 voices from provider DeepZen will be deprecated ⚰️ . If you are using these voices and you'd like to be offered an alternative, please contact us at support [at] audiostack [dot] ai.
From the end of this week, the old version of SonicSell (v.1.5.0) will no longer be available.
Don’t worry - you’ll still be able to do everything you need (plus more new features!) in the new version (v.1.6.0). If you have any issues getting started, please let Support know.
Some users might have had beta access to the new version and may now need their admin to invite them to use SonicSell - admins can do this by going to the Organisations page in the Account menu.
Please download any assets you’d like to save from the old version of SonicSell ASAP - this won't automatically transfer across, but all assets created in the new version will be saved in your Projects area going forward.
Why are we doing this?
At AudioStack, we try to avoid breaking changes wherever possible. We first released the new version of SonicSell and announced this deprecation back in June to give users time to move over to the new version.
In the old version of SonicSell, all of your sessions were stored locally in your browser. This meant that if you cleared the cache or changed device, your history wouldn't be available. In the new version, we introduced a change which means that all ads created in SonicSell will automatically save to your Projects area, making it easier to manage your work. Unfortunately, this change means that sessions from the old version of the app are not compatible with the new version. However, if you have an asset you'd like to have recreated in your Projects area, we can help you with this.
Any questions? Contact us at support [at] audiostack [dot] ai.
In the latest AudioStack Platform release, we've made several improvements to enhance users' experiences.
Workshop:
Fixed a bug with default project names and added validation to project names.
Improved the UX of length selection for audio assets - made it easier to understand how long your asset will be. Use the "match script" button to create an asset based on your script's length.
Projects (formerly known as Library):
Renamed Library to Projects based on user feedback
Split up Audioforms that contain multiple versions - so users can choose which version of a session they want to edit
Fixed a bug with sorting behaviour
Files (formerly known as Resources/Content):
Renamed Files and moved this page to the Essentials area to make it easier to find
🛠️ We are thrilled to introduce AutoFix, a powerful new feature designed to enhance the quality of Text-to-Speech (TTS) output by automatically detecting and fixing hallucinations and artifacts generated by TTS models across all 15 providers in AudioStack.
Why use AutoFix?
As we continue to onboard more niche TTS providers, some models may generate unwanted audio hallucinations or artifacts due to rapid development and technical limitations.
Traditionally, users have had to manually review and regenerate these problematic assets, which is both time-consuming and cumbersome. AutoFix streamlines this process by automatically detecting and regenerating faulty assets, saving you time and ensuring high-quality audio outputs.
Examples of Issues AutoFix Resolves:
AutoFix eliminates various types of hallucinations and artifacts, such as 👻 ghostly sounds, 🧟 distorted voices, and 🚌 background noise. Listen to some examples here
How Does AutoFix Work?
Quality Scoring: AutoFix evaluates TTS assets for speech quality and background noise using an internal scoring system.
Automatic Regeneration: If an asset is flagged for hallucinations or artifacts, AutoFix automatically regenerates it, ensuring cleaner, higher-quality output.
Consistent Results: This automated process reduces manual quality assurance and improves the reliability of your TTS assets.
Pricing
AutoFix is available at a cost of 💰5 production credits per minute of audio. You can easily activate AutoFix by setting useAutofix = True in your API call.
While AutoFix significantly reduces artifacts in generated assets, we strongly encourage users to manually review the quality of all assets before publishing.
🗣️STS (Speech to Speech) Voices Update
We have just added more STS voices to our library, bringing the total of the voices that support STS technology to an astonishing 287 🤯.
This makes 🦜 AudioStack's STS library the biggest and most diverse. STS is delivering incredible value to creative customers by producing lifelike speech with unparalleled naturalness!
📘
One thing to note:
The accent of the source speaker will be transferred to the resulting STS asset, so it is recommended to choose voices that are closer to your accent for the speech transfer (i.e. if your accent is American, choose a voice whose accent is tagged as American)
🎙️ More Azure Multilingual Voices!
We’re excited to announce that our Text-to-Speech (TTS) library just got an upgrade! 🌍🎙️
We’ve added 22 new multilingual voices ,that speak 50+ languages 🥳 to the library, making it more diverse and flexible for all your projects! 🗣️✨ Whether you need voices for different languages, accents, or styles, we’ve got you covered.
Try them out and let us know what you think! 😄 Here are their aliases:
In the last release of the AudioStack Platform, we added lots of new functionality.
SonicSell:
You can now specify the accent you'd like a voice to have. Simply select the language of your script and choose an appropriate accent from the dropdown.
We added the audioform ID (used to identify your asset) to all ad cards, to make it easier to work out if you're editing the right version.
Recording Booth:
Made it easier to record without a script, and added a "Save As" button to make it easier to find your recorded files.
Platform:
Report an Issue with the click of a button. Our team will be notified so can more easily troubleshoot issues.
Clarified the acceptable file names in our file upload modal, to improve the UX of file upload.
Speech Playground:
Added option to customise the asset name so that when you share an asset, the recipient can tell what it is.
Developers:
We renamed the "AudioStack for Developers" page to "API Key", based on feedback, to make it easier to find your API key.
In this release, we have added small new features to several workflows as well as bug fixes and UX improvements across the platform.
Platform Updates
Workshop: You can now undo and redo changes.
Workshop: Improved UX of the asset length. This will either be autodetected, in which case the estimated length will be shown, or it can be selected by the user, in which case a character limit is displayed.
Workshop: Pronunciation tips have been added to help you to add expression to your TTS using punctuation.
Workshop and SonicSell: The customise button has been changed to Advanced Controls and repositioned to make its function clearer.
Workshop and SonicSell: Increased the amount of default fade out time on audio ads based on customer feedback.
Voice Library: We added tags to voice cards to make it clearer which voices can be used for Speech-to-speech.
Recording Booth: Made it possible to easily view your saved file when recording is completed.
Platform: We added a button to easily report when something has gone wrong in your session. You can use this to report bugs directly to the development team.
Bug Fixes 🐛
Resources/ Content: Fixed overflow of private sound table in sound library modal