Skip to content

StoryToolkitAI v0.23.2 - stories and speakers beta

Latest
Compare
Choose a tag to compare
@octimot octimot released this 26 Jan 12:55
· 62 commits to main since this release
1a20491

What's new in this release

Main new features:

  • Story Editor
    make selections from your transcripts to create timelines
    that can be imported into your editing or screenwriting software
  • Speaker Detection
    detect speaker changes in audio either while ingesting or directly from the transcription window
  • Meta Transcription Segments
    add speakers, notes, etc. to your transcripts using SHIFT+ENTER while editing a transcript
  • Major improvements to the Assistant
    including the ability to provide new Transcriptions or Stories as responses

A few highlights:

  • Export Story Editor content as EDL, XML or Fountain for editing in Resolve, Premiere, Final Cut Pro, Avid, etc.
  • Ability to request transcription and story responses from the Assistant using the [t] or [st] keywords
  • Ability to create new transcriptions, groups and stories based on Assistant responses
  • Ability to change the models and settings used for the Assistant
  • Ability to edit the Assistant chat history by enabling or disabling messages via right click context menu
  • Ability to reuse Assistant prompts via right click context menu
  • Ability to copy the Assistant prompts and conversations to clipboard via right click context menu
  • Support for storytoolkit.ai Assistant Models
  • Automatic Speaker Change Detection via Ingest or from the Transcription Window
  • Ability to add transcript meta segments, such as speaker, notes, etc.
  • Find and Replace for Transcriptions via Find window
  • Improved handling of timecodes for EDL, XML, AVID DS and Fusion Comp exports

See CHANGELOG.md for more info.

Download links

This update is available early for Frequent Users only, to download see this post.

Everyone else can still download version 0.19.5 from here.

Help us make a better tool!

This tool and its code is free and open source.

If you find the tool useful, please consider becoming a patron.

This way, we can speed up development to work on cool features that you might need and keep up with rapid advancements in AI.

It also helps if you report any potential bugs or problems using the Issues tab here on Github: https://github.com/octimot/StoryToolkitAI/issues

Installation

macOS

Requirements

macOS 12.6 Monterey or newer
Intel or arm64 (M1, M2 etc.)
8 GB RAM

Installation Steps

Download the zip file that fits with your platform.
Unzip, copy the app file into your Applications folder, then double-click and it should start.

Please also read Info For All Platforms below.

Unsafe warnings

If you get something like "StoryToolkitAI is damaged and can't be opened", try to right click and then click open in the dropdown menu. It will warn you that it's unsafe, but that's just because we haven't notarized the tool with Apple.

If you still have problems opening it, try to take it out of quarantine: open up Terminal and execute:
xattr -d com.apple.quarantine ./StoryToolkitAI.0.23.1.arm64.app (or x86_64 if Intel).
More details on the issue here.

Windows

Requirements

Windows 10 or newer

Installation

Just follow the installer to get the tool on your machine.

Notes for Windows

Do not install this version of the tool in Program Files unless you know what you're doing. Some users have experienced permission issues which will make some features unusable.

If the tool returns errors or would not start after updating an older version, we recommend completely uninstalling and re-installing before reporting the issue.

If you get an error saying something like "Microsoft Defender SmartScreen prevented an unrecognized app from starting...", just press More info and Run anyway.

If the Speaker Detection process fails the first time you run it, it's most likely due to permission errors. So, close the app, right click and Run as administrator so that the tool can download the model into the cache folder. Once the first detection is performed, you should be able to run the tool with your normal user next time.

Also, see below:

Info for all platforms

You don't need to have Python installed on your machine for the standalone version to work, only if you want to use the Resolve API with the tool.

Download Permissions

If you get something like PermissionError: [Errno 13] Permission denied: '/Users/[your user]/.cache/torch/hub/trusted_list', it's most likely because the new standalone version uses different ssl certificates (issue #77) , so it's best if you delete the old hub cache, like this:

On macOS

Open terminal, and execute

rm -rf /Users/USERNAME/.cache/hub
rm -rf /Users/USERNAME/.cache/torch
rm -rf /Users/USERNAME/.cache/whisper
  • replace USERNAME with your MacOS user

On Windows

Open CMD, and execute

rmdir /s /q C:\Users\USERNAME\.cache\hub
rmdir /s /q C:\Users\USERNAME\.cache\torch
rmdir /s /q C:\Users\USERNAME\.cache\whisper
  • replace USERNAME with your Windows user

Keep in mind that if you do this, the first time you transcribe or search something it will need to re-download the models, so it will take a bit longer.

Make sure Resolve scripting is enabled

The tool only works with the Studio version of Davinci Resolve.

Davinci Resolve Preferences -> General, "External Scripting using", set to: Local