Context Size + extra 128 tokens

Hey! I've noticed something a bit weird with the context size. It seems to be running 128 tokens larger than what I'm setting, even when I use the `--noshift` `--nofastforward` flags.  This isn't happening with llamacpp, so I thought I'd bring it up.

```
--contextsize 8192 --noshift --nofastforward
```
```
llama_context: n_ctx_per_seq (8320) < n_ctx_train (131072) -- the full capacity of the model will not be utilized
```

<img width="398" height="84" alt="Image" src="https://github.com/user-attachments/assets/8cc6c353-462d-4523-b78b-7595c35d51e2" />

The GUI just shows the value I set.  Any chance I can actually run the model with just the context size I'm specifying? I'm guessing it might be something to do with context shifting?

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Context Size + extra 128 tokens #1729

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Context Size + extra 128 tokens #1729

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions