You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Traceback (most recent call last):
File "/Users/yaron/projects/llmperf/token_benchmark_ray.py", line 456, in <module>
run_token_benchmark(
File "/Users/yaron/projects/llmperf/token_benchmark_ray.py", line 297, in run_token_benchmark
summary, individual_responses = get_token_throughput_latencies(
File "/Users/yaron/projects/llmperf/token_benchmark_ray.py", line 116, in get_token_throughput_latencies
request_metrics[common_metrics.REQ_OUTPUT_THROUGHPUT] = num_output_tokens / request_metrics[common_metrics.E2E_LAT]
ZeroDivisionError: division by zero
The text was updated successfully, but these errors were encountered:
Running the benchmark script on a llama-3-8b-inst on inferentia 2 (djl-serving) results in:
The text was updated successfully, but these errors were encountered: