Skip to content

[Feature][KubeAI] Creating model profiles for 10 LLMs on Gaudi and Xeon #1076

@joshuayao

Description

@joshuayao

Priority

Undecided

OS type

Ubuntu

Hardware type

Xeon-GNR

Running nodes

Single Node

Description

  • Select 10 models and create model profiles on Gaudi and Xeon
  • Benchmark and tune each model for different replicas (1,2,4,8)

Depends on #284, #286 and #288

Metadata

Metadata

Assignees

Labels

Backlogfeatures in backlogfeatureNew feature or request

Projects

Status

Done

Relationships

None yet

Development

No branches or pull requests

Issue actions