V0.1.4.post3: remove flash_attn dependency
github-actions
released this
15 Jul 05:48
·
7 commits
to main
since this release
What's Changed
- Feature(MInference): add triton-based decoding in case flash_attn is not available by @liyucheng09 in #35
Full Changelog: v0.1.4.post2...v0.1.4.post3