Project
vgrep
Description
When text exceeds the context size limit, tokens are silently truncated without any warning to the user. This can lead to:
- Incomplete embeddings for long files/chunks
- Misleading search results
- User unaware that content was lost
Error Message
None - silent truncation.
Debug Logs
System Information
- Bounty Version: 0.1.0
- OS: Ubuntu 24.04 LTS
- Rust: 1.75+
Screenshots
No response
Steps to Reproduce
- Create a very long code file (>2048 tokens)
- Index the file:
vgrep index
- Tokens beyond
n_ctx are silently dropped
- Search only considers truncated content
Expected Behavior
- Log a warning when truncation occurs
- Track truncation statistics during indexing
- Option to fail on truncation instead of silently dropping
- Consider chunking strategy to avoid truncation
Actual Behavior
- Tokens silently dropped
- No indication to user
- No metrics on truncation rate
- Potentially misleading search results
Additional Context
No response
Project
vgrep
Description
When text exceeds the context size limit, tokens are silently truncated without any warning to the user. This can lead to:
Error Message
Debug Logs
System Information
Screenshots
No response
Steps to Reproduce
vgrep indexn_ctxare silently droppedExpected Behavior
Actual Behavior
Additional Context
No response