Error when running triton server with whisper model #522

jackNhat · 2023-12-19T16:29:25Z

When i ran client.py, i got errror message:
tritonclient.utils.InferenceServerException: [StatusCode.INTERNAL] in ensemble 'whisper', Failed to process the request(s) for model instance 'scorer_0', message: AssertionError: <EMPTY MESSAGE>
How to fix?
I ran triton server with whisper model verson large-v2

The text was updated successfully, but these errors were encountered:

jwkyeongzz · 2024-01-12T04:47:46Z

I got same issue. but it work properly.

Error env:
windows (ubuntu 20.04)
worksation ( intel xeon gold 6246 / rtx 3090 )
success pc ::
centox 7.9
server ( intel xeon gold 5218 / v100 )

Up to 7 channels can be operated simultaneously. ( v100 32G)

csukuangfj · 2024-01-12T06:24:07Z

@yuekaizhang

Could you have a look at this issue?

yuekaizhang · 2024-01-12T06:34:47Z

I got same issue. but it work properly.

Error env:
windows (ubuntu 20.04)
worksation ( intel xeon gold 6246 / rtx 3090 )

success pc ::
centox 7.9
server ( intel xeon gold 5218 / v100 )

Up to 7 channels can be operated simultaneously. ( v100 32G)

@jwkyeongzz You mean using V100 is good. The issue only happened with RTX3090 GPU ?

yuekaizhang · 2024-01-12T06:35:40Z

When i ran client.py, i got errror message: tritonclient.utils.InferenceServerException: [StatusCode.INTERNAL] in ensemble 'whisper', Failed to process the request(s) for model instance 'scorer_0', message: AssertionError: <EMPTY MESSAGE> How to fix? I ran triton server with whisper model verson large-v2

@jackNhat May I ask what's your GPU's name? Also, would you mind attaching more details? e.g. how to reproduce the error.

jwkyeongzz · 2024-01-16T04:16:09Z

I got same issue. but it work properly.

Error env:
windows (ubuntu 20.04)
worksation ( intel xeon gold 6246 / rtx 3090 )

success pc ::
centox 7.9
server ( intel xeon gold 5218 / v100 )

Up to 7 channels can be operated simultaneously. ( v100 32G)

@jwkyeongzz You mean using V100 is good. The issue only happened with RTX3090 GPU ?

I thought the test environment might be the problem. At first, since the error environment was in Windows' virtual Ubuntu 20.04, it was assumed that there was a problem with cuda memory allocation. In addition, it seems that it may have occurred due to insufficient memory of the RTX 3090. Therefore, it seems that the rtx3090 is not necessarily the problem.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error when running triton server with whisper model #522

Error when running triton server with whisper model #522

jackNhat commented Dec 19, 2023

jwkyeongzz commented Jan 12, 2024 •

edited

Loading

csukuangfj commented Jan 12, 2024

yuekaizhang commented Jan 12, 2024 •

edited

Loading

yuekaizhang commented Jan 12, 2024 •

edited

Loading

jwkyeongzz commented Jan 16, 2024

Error when running triton server with whisper model #522

Error when running triton server with whisper model #522

Comments

jackNhat commented Dec 19, 2023

jwkyeongzz commented Jan 12, 2024 • edited Loading

csukuangfj commented Jan 12, 2024

yuekaizhang commented Jan 12, 2024 • edited Loading

yuekaizhang commented Jan 12, 2024 • edited Loading

jwkyeongzz commented Jan 16, 2024

jwkyeongzz commented Jan 12, 2024 •

edited

Loading

yuekaizhang commented Jan 12, 2024 •

edited

Loading

yuekaizhang commented Jan 12, 2024 •

edited

Loading