New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

微调效果问题：相同的数据集，在bge-base上微调带来比较好的提升，但是在bge-large上没有提升，loss也不怎么下降。 #1377

Open

meu98 opened this issue Feb 21, 2025 · 0 comments

meu98 commented Feb 21, 2025

背景：在自建的场景上，测试bge-base和bge-largex效果差异不大。
如题的问题，这里相同的数据集和测试集都是同源的，来自线上真实用户的query。请问这种现象正常吗？
在上面的数据的基础上再加上一批其他来源例如通过llm根据doc生成的query的数据，bge-large的效果会有提升，但loss下降依旧不明显。因为llm生成query的加入，初始loss会从3的量级降到0.6。但是最终的提升的效果还是不如微调bge-base。请问有可能的原因和解决办法吗？

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment