-
Notifications
You must be signed in to change notification settings - Fork 381
Cannot open model_weights.bin: No such file or directory #8
Copy link
Copy link
Open
Description
cd metal_infer/
make
...
14 warnings generated.
./infer --prompt "Explain quantum computing" --tokens 100
[metal] Device: Apple M4 Pro
[metal] Shader compile: 93 ms
[metal] GPU attention buffers: 15 KV caches (16.8 MB each), scores buf 134.2 MB
[metal] Delta-net GPU buffers: 45 layers (195.4 MB state + 0.2 MB scratch)
[metal] Inference pipelines ready (multi-expert[8] + shared buffers allocated)
=== Qwen3.5-397B-A17B Metal Inference Engine ===
Model: /Users/danielwoods/.cache/huggingface/hub/models--mlx-community--Qwen3.5-397B-A17B-4bit/snapshots/39159bd8aa74f5c8446d2b2dc584f62bb51cb0d3
Weights: model_weights.bin
Manifest: model_weights.json
Vocab: vocab.bin
K: 4 experts/layer
Quant: 4-bit experts (7077888 bytes each)
Linear: fused GPU delta-net
Tokens: 100
Cache: 0 entries (disabled)
ERROR: Cannot open model_weights.bin: No such file or directory
ERROR: Failed to load weights
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels