AudioStack python SDK 2.0.0 released:

  • Removed support for end-of-life python versions 😀
  • Fixed bug - Added missing required arg to pass project to content.list_modeules 🐛
  • Fixed bug - Added missing x-assume-org header to File.download_url
  • Added linting, typehints and code formatting to CI/CD to make easy to read code for our users 💯
  • Added poetry package management to align with our best engineering practises - WE LOVE POETRY! 🤓
  • You can install this with pip install -U audiostack 🚀

Docs Docs Docs

We're constantly working on enhancing our docs based on customer feedback 🚀

We've just added these two examples this week.

Broadcasting your content

https://docs.audiostack.ai/docs/broadcasting-your-content
Delivery release 2.3.1:

  • Adding an endpoint to purchase broadcasting rights for an audioform

  • As soon as audioform is stable and released to users, they can get a license and be sure to have the broadcasting rights for a produced asset

New voice provider Respeecher 🇺🇦

AudioForm v1.1.0

  • Audioform expresses audio production in a json format and includes the definition of content, speech, sound design, and production elements. It is a novel approach to audio production and offers a number of advantages:
  • Leverage the most recent development in AI audio creation
  • Making audio production scalable
  • A way to "build" audio that is developer-friendly
  • A standardised way to facilitate echange across platforms

There's still a lot to improve but here's the first iteration step.

  • Added TTS reduce endpoint to SonicSell flow in Audioform, making the TTS in v6 always fit in the ad's duration.
  • We've also added it to Audio Playground as well.

JS SDK:

  • Added pipeline polling to JS SDK and updated Suite types (related to handling pipeline statuses in Platform that will be released later)

Speech to Speech

ms-sts is now live in prod with 3 new customer facing endpoints:

  • get_voices

  • post_sts

  • get_sts

The first lists the voice available for STS. The others are used to start and get the status of a STS pipeline using our internal mono-audio-suite.
In this release 11labs public and private voices are available and works for files up to 50 mb.Documentation available here:
https://docs.audiostack.ai/reference/getspeechtospeechvoices
https://docs.audiostack.ai/reference/postspeechtospeech
https://docs.audiostack.ai/reference/getspeechtospeechpipeline

https://docs.audiostack.ai/docs/examples-production

New platform release

  • We added new developer plans
  • Voice library spatial navigation
  • We made improvements to Sonic Sell V6 - to set us up for the future.
  • We add endpoint to generate audioform template.
  • Now only successfully produced ads will now be saved
  • 🐛Bug fix for Voice-Library Cards returning a 403 error;
  • Improved our billing updates - improving our billing experience;
  • We added enhanced voice permissions for private voices 🔒;
  • https://docs.audiostack.ai/docs/examples We added some more examples to the docs, so you should be able to get up and running faster;
  • We added the ability to handle media files on top of sound templates so you can now upload your own media files/ speech files and overlay them on top of sound templates;
  • We improved several error messages throughout the platform - leading to a better user experience;
  • Add error feedback when users upload a file >1MB or not JPG/PNG for the UserPicture. ⚠️

Better media experience

  • We added startPadding and endPadding in section properties for better control of your production.

Here's an example of it being used

SECTIONPROPERTIES= {
    "section1": {"startPadding":2.0, "endAt":5.0, "alignment":"left"},
    "section2": {
        "startPadding":2.0, "endAt":10.0, "alignment":"left"},
    "section3": {"endAt": 40, "alignment":"left"}   
}
print(f"Creating your mix...")
mix = audiostack.Production.Mix.create(
        speechItem=speech,
        soundTemplate="example_sound_tempalte",
        timelineProperties={"fadeIn": 0, "fadeOut": 0},
        masteringPreset="balanced",
        sectionProperties=SECTIONPROPERTIES
    )

You can see more docs at Advanced Timing Parameters

We made a bunch of improvements to the platform.

Website update 💯

We made some big changes to our website, improving our design and adding some great new pages. Thanks to the team who worked on this

Bug fixes and improvements

  • Updated file regexes and added decodeURIComponent 
  • Fix get colors from org logo error 
  • Aesthetic improvements to the sign up flow 
  • Add error info on upload Org Profile Picture
  • Platform Dashboard Redesign
  • Beta Release of SonicSell V6 (enabled for ReleaseRadar only) 
  • Audioform/sonicsell endpoint now supports creating more than 1 asset. Exposed through versions to create
  • Updated the TTS endpoint to support maxLength per sections. will be used in the sonic sell audioform endpoints.

Docs improvements

Reducing bias in our models and systems

One of the challenges with AI models is bias. This is well documented - and has been discussed elsewhere like here
We shipped significant changes, by applying better guard rails and using our proprietary AI guard rails

These changes lead to:

  • more gender diversity (before: ~ 1/3 male voices, now: ~50% male/50% female
  • less gender mismatches (avoiding e.g. “Hi, I’, m Sarah…” spoken by male voice
  • less gender-biased outcomes (e.g. ads on cleaning products have same chance for male/female voice)

Improvements and enhancements

🚢 Example files for new/existing organisations 🚀

  • By default an example folder (audiostack_examples) with mediafiles and assets will be created when a new organisation is created. This means you can get started sooner.
  • Suite and recommendation endpoints added to the JS SDK
  • Shipped some modifications on speed calculation of Spanish ads in SonicSell, which improve ads that are too slow. 🇪🇸
  • We added attribution in the Library to http://generated.photos who provide our generated images for our voices.

Beta feature

📘

Beta feature

Please contact support[at]audiostack[dot]ai if you wanted to be added to this

Whitelabel Support

We added a dynamically generated color banner based on org logo

We've been busy refreshing our sound templates, to make our licensing simpler and ensure the content we offer is suited to your most common AudioStack usecases. As a result, we've added a whole bunch of new Sound Templates in both the API and platform, and will be removing outdated "demo" templates at the end of May.

🚧

If you're currently using "demo" sound templates either via the platform or the AudioStack API, you will need to switch to alternative sound designs.

If you're struggling to find something that meets your needs, please contact support [at] audiostack.ai.

37 Demo Sound Templates to be Removed ⚠️

Sunny Skies
Surfing Dog
Laid Back Funk
Newsflash
Ethereal Dream
Power Sports
Garage Banger
Vintage Swing
Trailer Beast
Friendly Electro Pop
Bullmarket
Relaxed Vacation
Dark And Unsettling
Wild Beast
Feeling Good
Friendly Fantasy
Runway Rhythm
Cool Industries
Chopper Horns
Hotwheels
Epic Brass Trap
Pulse
Piano Artist
Fascinating Technology
Clap Together
Chopper Strings
Rockabilly Mischief
Relaxmas
Epic Pace
Stepping Up
Welcoming Piano
Indie Disco
Ballsy Rock
Design Grove
Vinylhits
Pulsing Out Score
Rainbow Rock

95 New Sound Templates Added 🎉

Liberation
Rising High
Fashion Forward
The Cheer Is Here
New Happy Vibe
Circus
Sword of Omens
Beside the Sea
Technology - Electronic
Drive To The Dream
Calm Groove
Major Winner
Octagon
Free Radical
Growing Flair
Careful Approach
Sprightly
Darbuka Song
Road Blues
Hype City Jam
Golden Moments
In The Willows
The Palace
Road To Marsaille
Hand Clap Tune
World of Wonders
Funky Lounge
Percussion Stomps
The Epic Mind
Hip-Hop Fashion
Heroic Drama
Zen Garden
Love Is All I have Got
The Evolution
O yeah
Street Orchestra
Fuego Party
Spirit Momentum
Neon Phase
Dance All Night
Organic Timelapse
Are We Alone
Future Technology
Bike Riding
Italy Classic
On My Mind
Gracious Journey
For The Happy Times
Spring Day
Green Meadow
Hopeful Future
Soiree a Illange
Oktoberfest
Sunlight
Dancing Youth
Energy of Success
No Rush
Terrace Relaxing
Traveling Together
Void Echo
Still Crazy For You
Lull Me
Keys To Joy
Gold Digger
Golden Sunrays
Holiday Sprinkles
Perfect Grind
Breakbeat Horizon
Rock and Blues
Be strong
Aspire Time
Peaceful Rising
Jazz Night
Ignite the Night
Binaural Aquamarine
Groovy Vibe
Drive
Sunny Dance
Designer Background
The Snow Is Coming
Precious Event
La Ballerina
Glass Of Rum
Like A Dream
Ragtime Rhapsody
A Time To Shine
Cool Cascade
Spring Piano
Fancy Summer
Night Waltz
Grace Is Abound
Visible
Spa Resort
The Quiet Hours
Country Heartbeat

This week we worked hard on improving our SonicSell experience based on customer feedback. All these little changes every week help us move the product forward. 💯

We also shipped a major improvement to our copy generation and we shipped stem separation.

Stem Separation

Improvements and a lot of bug fixes

  • We shipped some major improvements to copy generation in SonicSell - so your advert should now have significantly less deviations. We also added retry logic and better examples through improved prompt engineering. 💯
  • 👍Update success/error toasts file uploads
  • 👍SonicSell Header improvements
  • 👍Remove credit-counter (users complained it was distracting)
  • Powered by AudioStack is now clearer
  • 🐛 Fix Workflow Inconsistencies in SideBar
  • 🐛 Support special chars in files table
  • 🐛 Updated copywriting to enhance the user experience

Coming soon 🚧

  • We shipped some great stuff in the backend too, and we'll speak more about that next week 💯
  • We also invested in our ad-serving capabilities and analytics capabilities - please reach out if you want to know more about this.

This week, we've added a few UX improvements to make your experience of AudioStack Platform and workflows smoother.

New ad player in SonicSell

  • You can now give feedback (thumbs up or down) about the quality of ads in SonicSell, to help improve the content you hear in future.

Platform improvements

  • Added a new parrot to the favicon 🐦
  • New workflow cover images on Home page
  • New Navigation - easier to find your workflow

Files Page improvements

  • File view made into page - this makes it easier to use on mobile and allows us to add exciting new features. 👀 Watch this space!

Voice Cloning improvements

  • You can now add your own alias for new voices instant cloning voice to make it easier to find in your workflows.
  • Integrated new voice builder in the platform to deprecate the previous one.

Denoise your noisy audio files

Docs improvements

We added some improvements to improve the onboarding experience 💯

Improvements

  • We added zip downloader added to the api for exporting zips ⚡

Platform

  • Users can join a Waitlist for the ‘Coming Soon’ Workflows
  • Implement security headers into nextjs apps enhancing our security 🔒
  • Users can now brand the Platform with their logo - a much requested feature
  • Audio title encoding in the links to player doesn’t support special chars 🐛
  • The voice library now shows the best voices at the top

📘

Check out the library

You can check it out at our Library page!

SonicSell

  • AdCards now provide better error feedback for invalid input

SDK

  • Bug fixes in the voice cloning experience 🐛

Platform improvements

  • Sound Library now shows the sound segments that belong to each template
  • Sound Library page layout is now improved to have more consistent sizing
  • Added Denoise and Source separation functionalities to the File Edit modal
  • Clicking on the sidebar header now redirects to home
  • Updated link to contact about enterprise plans

SonicSell

  • SonicSell now catches bad product descriptions and errors appropriately.
  • SonicSell now calls /content/recommend/ to get better tone and mood selections via ms--text-tagger (ML endpoint fully integrated into a workflow and charging credits)
  • We enhanced our LLM integrations and improved the workflow

New Features

  • https://docs.audiostack.ai/reference/separateaudio Source separation and better performance - this means that you can now upload an audio file containing both speech and music, and separate the voice out from the music to use in your workflows (such as voice cloning or sound template creation).

New voice builder in production!

What's new?

  • central service for all things voice cloning - easy integrations of new vc providers
  • possibility of cloning using multiple files
  • fixes a bunch of issues we've had so far e.g. timeouts, file errors
  • Adding more features , (SSML, STS in API, fast synthesis)
  • internal file processing to make sure user's input matches provider's requirements
  • internal tool - python package useful for experiments and testing integrations

Improvements

  • Shipped some small improvements to file managements, i.e. better validation on broken files, and better performance 💯
  • shipped the capacity of using any file id as sound template in mastering 🎉
  • Improvements in the speed and reliability of AES 💯
  • mastering improvements for bugs with not file errors 💽