Skip to content

Commit

Permalink
[Bugfix] Fix assertion in NeuronExecutor (vllm-project#5841)
Browse files Browse the repository at this point in the history
  • Loading branch information
aws-patlange authored and prashantgupta24 committed Jun 28, 2024
1 parent 1c02c31 commit b18f8f4
Showing 1 changed file with 3 additions and 3 deletions.
6 changes: 3 additions & 3 deletions vllm/executor/neuron_executor.py
Original file line number Diff line number Diff line change
Expand Up @@ -48,9 +48,9 @@ def initialize_cache(self, num_gpu_blocks: int,
def execute_model(
self,
execute_model_req: ExecuteModelRequest) -> List[SamplerOutput]:
assert (execute_model_req.blocks_to_swap_in == {}
and execute_model_req.blocks_to_swap_out == {}
and execute_model_req.blocks_to_copy == {}), (
assert (not execute_model_req.blocks_to_swap_in
and not execute_model_req.blocks_to_swap_out
and not execute_model_req.blocks_to_copy), (
"Cache operations are not supported for Neuron backend.")
assert execute_model_req.num_lookahead_slots == 0, (
"lookahead not supported for Neuron backend.")
Expand Down

0 comments on commit b18f8f4

Please sign in to comment.