-
Notifications
You must be signed in to change notification settings - Fork 280
Issues: InternLM/lmdeploy
[Benchmark] benchmarks on different cuda architecture with mo...
#815
opened Dec 11, 2023 by
lvhan028
Open
6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug] qwen 2 issue when transformers>4.41.2 for PyTorch Engine
#1885
opened Jun 29, 2024 by
zhyncs
2 tasks done
[Bug] 量化时候采取默认参数能够正常推理量化,设置了--search-scale True --batch-size 8,量化后无法推理
#1883
opened Jun 28, 2024 by
AIFFFENG
2 tasks
[Bug] AttributeError: 'LlavaNextConfig' object has no attribute 'hidden_size'
#1868
opened Jun 27, 2024 by
zhaozeno
1 of 2 tasks
使用pipeline加载Qwen1.5-32B-Chat,tp=4,使用openai prompt格式提示其清洗中文但生成回复都是英文
#1864
opened Jun 26, 2024 by
Yang-bug-star
使用OpenAI format的输入得到的response要如何提取出回复文本,返回的response好像是分段的
#1863
opened Jun 26, 2024 by
Yang-bug-star
[Bug] Segmentation fault: address not mapped to object at address 0x2058
#1849
opened Jun 25, 2024 by
austingg
2 tasks done
[Bug] InternLM2MLP.forward() missing 1 required positional argument: 'im_mask'
#1847
opened Jun 25, 2024 by
jiangjingz
2 tasks done
[Bug] lmdeploy - [31mERROR[0m - Truncate max_new_tokens to 221
#1841
opened Jun 24, 2024 by
tairen99
1 of 2 tasks
[Feature] How to support bf16 when inferencing Internvl-chat
#1839
opened Jun 24, 2024 by
Leo-yang-1020
[Bug] Qwen-7B-Chat 量化报错 AttributeError: 'RMSNorm' object has no attribute 'variance_epsilon'
#1830
opened Jun 23, 2024 by
CodexDive
2 tasks
Previous Next
ProTip!
no:milestone will show everything without a milestone.