Skip to content

Commit

Permalink
bugfix: fix min-p AOT compilation in #713 (#717)
Browse files Browse the repository at this point in the history
#713 didn't update the function signature of min-p API in AOT pybind
file, this PR fixes the issue.
  • Loading branch information
yzh119 authored Jan 4, 2025
1 parent 989dbfa commit def019e
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions csrc/flashinfer_ops.cu
Original file line number Diff line number Diff line change
Expand Up @@ -174,8 +174,8 @@ void top_k_sampling_from_probs(at::Tensor probs, at::Tensor uniform_samples, at:
unsigned int top_k_val, bool deterministic, int64_t cuda_stream);

void min_p_sampling_from_probs(at::Tensor probs, at::Tensor uniform_samples, at::Tensor samples,
at::Tensor success, std::optional<at::Tensor> maybe_min_p_arr,
double min_p_val, bool deterministic, int64_t cuda_stream);
std::optional<at::Tensor> maybe_min_p_arr, double min_p_val,
bool deterministic, int64_t cuda_stream);

void top_k_top_p_sampling_from_probs(at::Tensor probs, at::Tensor uniform_samples,
at::Tensor samples, at::Tensor success,
Expand Down

0 comments on commit def019e

Please sign in to comment.