We have the plan and contact at vLLM team, but just could not find person to implement it, because it would involve considerable efforts and understanding of vLLM internals. So please let us know if you are interested and want to help integrate and do more research onwards.

I'm not familiar with ISPC, after I manually add typedef __fp16 data_t;in dtype.h, I get this error:

[ 33%] Building ISPC object CMakeFiles/pacpu-llama2_13b-tp1.dir/pacpu.ispc.o
Warning: No --target specified on command-line. Using default system target "avx2-i32x8".
/data//NEO/pacpu/dtype.h:12:16: Error: syntax error, unexpected identifier, expecting ',' or ';'.
typedef __fp16 data_t;
               ^^^^^^

/data//NEO/pacpu/pacpu.ispc:10:17: Warning: No type specified in declaration. Assuming int32.
  const uniform data_t q[], // [NUM_Q_HEADS, HEAD_DIM]
                ^^^^^^

/data//NEO/pacpu/pacpu.ispc:10:24: Error: syntax error, unexpected identifier, expecting ')'.
  const uniform data_t q[], // [NUM_Q_HEADS, HEAD_DIM]
                       ^

/data//NEO/pacpu/pacpu.ispc:16:22: Error: Undeclared symbol "seq_len".
  uniform int imax = seq_len / BLOCK_SIZE + 1;
                     ^^^^^^^

/data//NEO/pacpu/pacpu.ispc:18:26: Error: syntax error, unexpected '*', expecting ',' or ';'.
    const uniform data_t * k = k_cache +
                         ^

/data//NEO/pacpu/pacpu.ispc:24:34: Error: Undeclared symbol "seq_len".
      uniform int tmax = min(BLOCK_SIZE, seq_len - i * BLOCK_SIZE);
                                 ^^^^^^^

Plans to support sglang #1

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions