Feat: Add tokens counter #41

Angelchev · 2023-12-29T16:02:58Z

Overview

It would be useful to add the ability for neural to get the token count for some given input. This would help prevent initiating requests that accidentally go over the maximum token count for some given model source.

This will also be useful in situations where we want to extract the maximum possible response from a model via request_token_num = model_max_token_len - context_tokens_len

Implementation

The tokenizer should be appropriate for the respective model
We should use an open-source (Ideally MIT) tokenizer that we can bundle to not require installing additional dependencies

The text was updated successfully, but these errors were encountered:

Angelchev added the enhancement New feature or request label Dec 29, 2023

Angelchev mentioned this issue Dec 29, 2023

Allow the maximum requested response size (tokens) to be specified in the command #18

Open

Angelchev self-assigned this Jan 3, 2024

Angelchev linked a pull request Jan 3, 2024 that will close this issue

Feat: Add token counting #47

Draft

16 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Add tokens counter #41

Feat: Add tokens counter #41

Angelchev commented Dec 29, 2023

Feat: Add tokens counter #41

Feat: Add tokens counter #41

Comments

Angelchev commented Dec 29, 2023

Overview

Implementation