v0.6.4
What's new in 0.6.4 (2023-11-24)
These are the changes in inference v0.6.4.
New features
- FEAT: Support registering custom embedding model by @ChengjieLi28 in #667
- FEAT: Supports
qwen.cpp
forqwen-chat
withggml
format by @ChengjieLi28 in #675 - FEAT: Xverse by @fengsxy in #678
- FEAT: Support rerank models by @aresnow1 in #672
Enhancements
- ENH: Add
generate
interface forchatglm
withggml
format by @ChengjieLi28 in #671
Bug fixes
- BUG: Fix custom model missing config json by @codingl2k1 in #674
- BUG: Fix http error is not raised by @codingl2k1 in #657
- BUG: Fix pip install xinference[all] by @codingl2k1 in #679
Documentation
- DOC: update pot files by @UranusSeven in #638
- DOC: A more detailed beginner's guide has been created, covering various aspects of the first-time usage experience for new users. by @onesuper in #651
- DOC: documentation for using xinference by @fengsxy in #677
- DOC: Register custom embedding model by @ChengjieLi28 in #683
Others
- Add why xinf section to readme to compare pivitol features with others by @onesuper in #652
- Fix README.md by @aresnow1 in #669
New Contributors
Full Changelog: v0.6.3...v0.6.4