Skip to content

Conversation

@louie-tsai
Copy link
Contributor

@louie-tsai louie-tsai commented Mar 11, 2025

Description

fixed a missing package for benchmarking with Dockerfile.
Add benchmark dockerfile to get metrics and do benchmark among micoservices starting by docker compose.

here is the benchmark_putput from ChatQnA on Gaudi
chatqnafixed_20250322_084132.zip

there will be a seperate opea-eval instance along with other docker instances from ChatQnA.
image

Users basically can run benchmark inside the opea-eval instance directly and all are same gaudi_default network.
image

Gaudi2 Snapshot running GenAIEval
https://snapshots.raintank.io/dashboard/snapshot/WC02g0ZugBYibM6I177TGsVEu3PktIZ1?orgId=0&refresh=5s
https://snapshots.raintank.io/dashboard/snapshot/SAJFc4xELoEt8417yXsEvTgb0tKQG8Jk?orgId=0
https://snapshots.raintank.io/dashboard/snapshot/heUhMVzHffbuMKGZH9rp39dYohQmDzhA?orgId=0&refresh=1m
https://snapshots.raintank.io/dashboard/snapshot/vv9k6CqWHdSCG7xSxTWDnz1g1wL1RjgZ?orgId=0
https://snapshots.raintank.io/dashboard/snapshot/iMELRy0JH2KT5cXLYaZ3luvKPYjQqrXE?orgId=0&refresh=5s

Gaudi2 snapshot for 1 request output token length 128

https://snapshots.raintank.io/dashboard/snapshot/oTUYdvDaCJSqV6TG730pDuYp97WyPoNH?orgId=0
https://snapshots.raintank.io/dashboard/snapshot/KEkRh8Pau2mgv04j3C20Fg6kQFrO2UNc?orgId=0&refresh=5s
https://snapshots.raintank.io/dashboard/snapshot/1fjlelsleLm4EhrGXzz5IXAvjHH7v2H9?orgId=0&refresh=5s
https://snapshots.raintank.io/dashboard/snapshot/X9q4GaL2KOL12Q7yktRtf7dFPPeFaj44?orgId=0&refresh=5s

Screenshot 2025-04-02 102713
Screenshot 2025-04-02 102829
Screenshot 2025-04-02 102855

Gaudi2 snapshot for 1 request output token length 560
https://snapshots.raintank.io/dashboard/snapshot/9pGI1IItBrFtkPcqZkrtjNNkjqZCciaF?orgId=0
https://snapshots.raintank.io/dashboard/snapshot/OdK4AhQQApS28wYZNv62eM7oLdZtNIc9?orgId=0&refresh=5s
Screenshot 2025-04-02 103142
Screenshot 2025-04-02 103208

Gaudi2 snapshot for 32 request with 4 concurrency output token length 128
https://snapshots.raintank.io/dashboard/snapshot/WzgByEohhBEd936YcmjuvHhsA6xZ8tbd
https://snapshots.raintank.io/dashboard/snapshot/la2aqQ6weent7QrFchTgD2DWKAkfPsxk
https://snapshots.raintank.io/dashboard/snapshot/9a2Vqiaqq5XdkO1EbrpYYPfMJYcDn5v0
image
image

Gaudi2 snapshot for 128 request with 4 concurrency output token length 128
https://snapshots.raintank.io/dashboard/snapshot/NjqIt6hvSPIQiKbtB2JwFGJGZn34m99q
https://snapshots.raintank.io/dashboard/snapshot/a1QMfo16vB9aWoLfYwlShF6YXu66GxtU
https://snapshots.raintank.io/dashboard/snapshot/U8UMdhoi4PjsCf4ecev0FIZFMiOCZScJ
image
image

Gaudi2 snapshot for 512 request with 4 concurrency output token length 128
https://snapshots.raintank.io/dashboard/snapshot/c1knBpBywmQ0mgIi12JiNSHEA6w8uoIq
https://snapshots.raintank.io/dashboard/snapshot/BxadppaKHnuKIZvnT1jsiv0iB2MKdWMk
https://snapshots.raintank.io/dashboard/snapshot/9OUecRMBQ8vMkmIkrL7dJFrpccuyV1WF
image
image

Issues

n/a.

Type of change

List the type of change like below. Please delete options that are not relevant.

  • Bug fix (non-breaking change which fixes an issue)
  • New feature (non-breaking change which adds new functionality)
  • Breaking change (fix or feature that would break existing design and interface)

Dependencies

NA

Tests

test it on both Gaudi and Xeon

@louie-tsai louie-tsai force-pushed the fix_req branch 2 times, most recently from 21e0d26 to dc75b00 Compare March 21, 2025 23:54
@louie-tsai louie-tsai requested a review from lkk12014402 March 21, 2025 23:57
@louie-tsai louie-tsai changed the title fixed a missing package for benchmarking with Dockerfile an also add a new CPU dockerfile Add benchmark docker image to support getting metrics among microservices and fixed a missing package for benchmarking with Dockerfile Mar 22, 2025
@louie-tsai louie-tsai force-pushed the fix_req branch 3 times, most recently from ba488ac to c8e8973 Compare March 26, 2025 00:38
@louie-tsai
Copy link
Contributor Author

@lvliang-intel @lkk12014402 please help to review it.

@louie-tsai louie-tsai force-pushed the fix_req branch 3 times, most recently from 5cbb1f9 to e556cf4 Compare March 27, 2025 23:29
@louie-tsai
Copy link
Contributor Author

For Gaudi2, lots of time spent on TEI
image

@louie-tsai louie-tsai force-pushed the fix_req branch 2 times, most recently from 8cc6b23 to 9931eb5 Compare April 2, 2025 15:26
Signed-off-by: Tsai, Louie <[email protected]>
Signed-off-by: louie-tsai <[email protected]>
Signed-off-by: Tsai, Louie <[email protected]>
@louie-tsai louie-tsai requested a review from lvliang-intel April 3, 2025 04:09
… compose runs

Signed-off-by: louie-tsai <[email protected]>

Signed-off-by: Tsai, Louie <[email protected]>
@lvliang-intel lvliang-intel requested a review from Zhenzhong1 April 3, 2025 06:15
@lvliang-intel lvliang-intel merged commit dc3409f into opea-project:main Apr 3, 2025
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants