StoryToolkitAI v0.23.1 - stories and speakers alpha
What's new in this release
Main new features:
- Story Editor
make selections from your transcripts to create timelines
that can be imported into your editing or screenwriting software - Speaker Detection
detect speaker changes in audio either while ingesting or directly from the transcription window - Meta Transcription Segments
add speakers, notes, etc. to your transcripts using SHIFT+ENTER while editing a transcript - Major improvements to the Assistant
including the ability to provide new Transcriptions or Stories as responses
A few highlights:
- Export Story Editor content as EDL, XML or Fountain for editing in Resolve, Premiere, Final Cut Pro, Avid, etc.
- Ability to request transcription and story responses from the Assistant using the
[t]
or[st]
keywords - Ability to create new transcriptions, groups and stories based on Assistant responses
- Ability to change the models and settings used for the Assistant
- Ability to edit the Assistant chat history by enabling or disabling messages via right click context menu
- Ability to reuse Assistant prompts via right click context menu
- Ability to copy the Assistant prompts and conversations to clipboard via right click context menu
- Support for storytoolkit.ai Assistant Models
- Automatic Speaker Change Detection via Ingest or from the Transcription Window
- Ability to add transcript meta segments, such as speaker, notes, etc.
- Find and Replace for Transcriptions via Find window
- Improved handling of timecodes for EDL, XML, AVID DS and Fusion Comp exports
Download links
This update is available early for Frequent Users only, to download see this post.
Everyone else can still download version 0.19.5 from here.
Help us make a better tool!
This tool and its code is free and open source.
If you find the tool useful, please consider becoming a patron.
This way, we can speed up development to work on cool features that you might need and keep up with rapid advancements in AI.
It also helps if you report any potential bugs or problems using the Issues tab here on Github: https://github.com/octimot/StoryToolkitAI/issues
Installation
macOS
Requirements
macOS 12.6 Monterey or newer
Intel or arm64 (M1, M2 etc.)
8 GB RAM
Installation Steps
Download the zip file that fits with your platform.
Unzip, copy the app file into your Applications folder, then double-click and it should start.
Please also read Info For All Platforms below.
Unsafe warnings
If you get something like "StoryToolkitAI is damaged and can't be opened", try to right click and then click open in the dropdown menu. It will warn you that it's unsafe, but that's just because we haven't notarized the tool with Apple.
If you still have problems opening it, try to take it out of quarantine: open up Terminal and execute:
xattr -d com.apple.quarantine ./StoryToolkitAI.0.23.1.arm64.app
(or x86_64 if Intel).
More details on the issue here.
Windows
Requirements
Windows 10 or newer
Installation
Just follow the installer to get the tool on your machine.
Notes for Windows
Do not install this version of the tool in Program Files unless you know what you're doing. Some users have experienced permission issues which will make some features unusable.
If the tool returns errors or would not start after updating an older version, we recommend completely uninstalling and re-installing before reporting the issue.
If you get an error saying something like "Microsoft Defender SmartScreen prevented an unrecognized app from starting...", just press More info
and Run anyway
.
If the Speaker Detection process fails the first time you run it, it's most likely due to permission errors. So, close the app, right click and Run as administrator
so that the tool can download the model into the cache folder. Once the first detection is performed, you should be able to run the tool with your normal user next time.
Also, see below:
Info for all platforms
You don't need to have Python installed on your machine for the standalone version to work, only if you want to use the Resolve API with the tool.
Download Permissions
If you get something like PermissionError: [Errno 13] Permission denied: '/Users/[your user]/.cache/torch/hub/trusted_list'
, it's most likely because the new standalone version uses different ssl certificates (issue #77) , so it's best if you delete the old hub cache, like this:
On macOS
Open terminal
, and execute
rm -rf /Users/USERNAME/.cache/hub
rm -rf /Users/USERNAME/.cache/torch
rm -rf /Users/USERNAME/.cache/whisper
- replace USERNAME with your MacOS user
On Windows
Open CMD, and execute
rmdir /s /q C:\Users\USERNAME\.cache\hub
rmdir /s /q C:\Users\USERNAME\.cache\torch
rmdir /s /q C:\Users\USERNAME\.cache\whisper
- replace USERNAME with your Windows user
Keep in mind that if you do this, the first time you transcribe or search something it will need to re-download the models, so it will take a bit longer.
Make sure Resolve scripting is enabled
The tool only works with the Studio version of Davinci Resolve.
Davinci Resolve Preferences -> General, "External Scripting using", set to: Local