[BUG] Llama Context Recreated for Every Batch

### Project

vgrep

### Description

The `embed_batch()` function creates a new `LlamaContext` for every batch of texts to embed. This is expensive as context creation involves memory allocation and initialization. The context should be reused across batches.

### Error Message

```shell
None - performance issue.
```

### Debug Logs

```shell

```

### System Information

```shell
- Bounty Version: 0.1.0
- OS: Ubuntu 24.04 LTS
- Rust: 1.75+
```

### Screenshots

_No response_

### Steps to Reproduce

1. Index a large project with many files
2. Observe that embedding is slower than expected
3. Each batch call recreates the context


### Expected Behavior

1. Create context once during `EmbeddingEngine::new()`
2. Reuse context for all embedding operations
3. Use configured `n_threads` instead of recalculating


### Actual Behavior

1. Every `embed_batch()` call creates a new context
2. Thread count is queried from OS every time
3. Significant overhead for many small batches


### Additional Context

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] Llama Context Recreated for Every Batch #159

Project

Description

Error Message

Debug Logs

System Information

Screenshots

Steps to Reproduce

Expected Behavior

Actual Behavior

Additional Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[BUG] Llama Context Recreated for Every Batch #159

Description

Project

Description

Error Message

Debug Logs

System Information

Screenshots

Steps to Reproduce

Expected Behavior

Actual Behavior

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions