-
Notifications
You must be signed in to change notification settings - Fork 127
add slide and readme for infer #148
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: dev
Are you sure you want to change the base?
Conversation
| @@ -0,0 +1,279 @@ | |||
| # MindSpore Transformers & vLLM-MindSpore 插件式服务化部署与评测 | |||
|
|
|||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
建议补充下之前直播录屏的地址
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已更新
| # MindSpore Transformers & vLLM-MindSpore 插件式服务化部署与评测 | ||
|
|
||
| ## 目录 | ||
| - [安装部署](#安装部署) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这里直接给个链接参考官网环境安装, 不在这里再维护一份了
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已更新
| ais_bench/benchmark/configs/models/vllm_api/vllm_api_general_chat.py | ||
|
|
||
| # 启动评测 | ||
| python run_benchmark.py --models vllm_api_general --datasets gsm8k_gen_0_shot_cot_str |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这里的run_benchmark是在哪可以贴一下地址
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已更新
| ### 精度评测 | ||
| ```bash | ||
| # 修改配置文件 | ||
| ais_bench/benchmark/configs/models/vllm_api/vllm_api_general_chat.py |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ais_bench地址可以贴一下
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
已更新
2b3cfa5 to
7285069
Compare
| | 4 | xxx | xxx | [PPT](跳转链接) · [代码](跳转链接) · [视频](跳转链接) · [云沙箱实验](跳转链接) · [学习路径](跳转链接) | [中级认证入口](xxxx) | | ||
| | 1 | MindSpore Transformers基础 | 介绍MindSpore Transformers架构及基本使用。 | [PPT](#) · [代码](#) · [视频](#) · [云沙箱实验](#) · [学习路径](#) | | | ||
| | 2 | vLLM-MindSpore服务化部署 | 学习vLLM-MindSpore的安装、启动及参数配置。 | [PPT](#) · [代码](#) · [视频](https://www.bilibili.com/video/BV1Ys1aBxEFD/?share_source=copy_web&vd_source=fd4588b77d7b0209a532d9279088f606) · [云沙箱实验](#) · [学习路径](#) | [初级认证入口](#) | | ||
| | 3 | 大模型推理高级特性 | 深入理解Chunked Prefill、Prefix Caching等优化技术。 | [PPT](#) · [代码](#) · [视频](#) · [云沙箱实验](#) · [学习路径](#) | | |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ppt 代码 云沙箱啥的,没有的是不是先不写了,就保留有的就好了。视频如果是共用的话可以改成一列
| | 版本名 | Python | MindSpore | MindSpore NLP | | ||
| | :----- | :----- |:------ |:------ | | ||
| | master | xxx | xxx | xxx | | ||
| | r1.0 | xxx | xxx | xxx | | ||
|
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这里版本维护是不是要写一下
| | 3 | 大模型推理高级特性 | 深入理解Chunked Prefill、Prefix Caching等优化技术。 | [PPT](#) · [代码](#) · [视频](#) · [云沙箱实验](#) · [学习路径](#) | | | ||
| | 4 | 混合并行与量化推理 | 掌握混合并行部署及模型量化推理的最佳实践。 | [PPT](#) · [代码](#) · [视频](#) · [云沙箱实验](#) · [学习路径](#) | [中级认证入口](#) | | ||
|
|
||
| # MindSpore Transformers & vLLM-MindSpore 插件式服务化部署与评测 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
这里标题比较长,直接改成“服务化部署与评测指导”是不是好点
No description provided.