Skip to content

Cannot open model_weights.bin: No such file or directory #8

@mrichman

Description

@mrichman
cd metal_infer/
make
...
14 warnings generated.
./infer --prompt "Explain quantum computing" --tokens 100
[metal] Device: Apple M4 Pro
[metal] Shader compile: 93 ms
[metal] GPU attention buffers: 15 KV caches (16.8 MB each), scores buf 134.2 MB
[metal] Delta-net GPU buffers: 45 layers (195.4 MB state + 0.2 MB scratch)
[metal] Inference pipelines ready (multi-expert[8] + shared buffers allocated)
=== Qwen3.5-397B-A17B Metal Inference Engine ===
Model:    /Users/danielwoods/.cache/huggingface/hub/models--mlx-community--Qwen3.5-397B-A17B-4bit/snapshots/39159bd8aa74f5c8446d2b2dc584f62bb51cb0d3
Weights:  model_weights.bin
Manifest: model_weights.json
Vocab:    vocab.bin
K:        4 experts/layer
Quant:    4-bit experts (7077888 bytes each)
Linear:   fused GPU delta-net
Tokens:   100
Cache:    0 entries (disabled)
ERROR: Cannot open model_weights.bin: No such file or directory
ERROR: Failed to load weights

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions