Skip to content

V0.1.4.post3: remove flash_attn dependency

Compare
Choose a tag to compare
@github-actions github-actions released this 15 Jul 05:48
· 7 commits to main since this release
50d17d9

What's Changed

  • Feature(MInference): add triton-based decoding in case flash_attn is not available by @liyucheng09 in #35

Full Changelog: v0.1.4.post2...v0.1.4.post3