You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
有没有什么快速的办法能够统计到predictor的耗时呢,例如输出如下结果:
...
llama_print_timings: eval time = 2082.97 ms / 127 runs ( 16.40 ms per token, 60.97 tokens per second)
llama_print_timings: predictor eval time = xx.xx ms / 127 runs ( xx.xx ms per token, xx.xx tokens per second)
...
The text was updated successfully, but these errors were encountered:
Prerequisites
Before submitting your question, please ensure the following:
Question Details
有没有什么快速的办法能够统计到predictor的耗时呢,例如输出如下结果:
...
llama_print_timings: eval time = 2082.97 ms / 127 runs ( 16.40 ms per token, 60.97 tokens per second)
llama_print_timings: predictor eval time = xx.xx ms / 127 runs ( xx.xx ms per token, xx.xx tokens per second)
...
The text was updated successfully, but these errors were encountered: