Releases: michaelfeil/infinity
Releases · michaelfeil/infinity
0.0.39
0.0.38
What's Changed
- update poetry lock to latest by @michaelfeil in #234
- Add auth and classify endpoint to openai server by @Rololi in #237
- ENV-Manager update by @michaelfeil in #238
New Contributors
Full Changelog: 0.0.37...0.0.38
0.0.37
What's Changed
- bump sentence-transformers to 3.0 by @michaelfeil in #232
- Add einops as extra dependency by @chiragjn in #233
Full Changelog: 0.0.36...0.0.37
0.0.36
What's Changed
- add infinity server kwargs for device by @michaelfeil in #230
Full Changelog: 0.0.35...0.0.36
0.0.35
What's Changed
- update docs: v2 cli and async request handling by @michaelfeil in #229
Full Changelog: 0.0.34...0.0.35
0.0.34
What's Changed
- Add option to enable permissive CORS headers to allow api access from… by @kir-gadjello in #214
- add v2 to CLI by @michaelfeil in #227
- Add revision and trust_remote_code to from_pretrained calls by @chiragjn in #224
New Contributors
- @kir-gadjello made their first contribution in #214
- @chiragjn made their first contribution in #224
Full Changelog: 0.0.33...0.0.34
0.0.33
What's Changed
- fix-orjson by @michaelfeil in #201
- Add
EngineArray
Multi-Model [1/3] by @michaelfeil in #200 - Openapi tests by @michaelfeil in #199
- refactor
BatchHandler
intoModelWorker
by @michaelfeil in #202 - Add fp32 as runtime dtype by @michaelfeil in #211
Full Changelog: 0.0.32...0.0.33
0.0.32
What's Changed
You can now run a model with a alias. This will help you communicating with the API.
infinity_emb --served-model-name "your_nickname"
You can now use preload
models. This acts as a "run download and load into ram" test. Upon execution, all files are cached, which will speedup consecutive loads. For additonal speedups, use --no-model-warmup
to skip model warmup after loading.
infinity_emb --preload-only --model--name-or-path BAAI/bge-large-en-v1.5
PR's
- feat: add served_model_name argument for the infinity_server by @bufferoverflow in #180
- FIX: import crossencoder without torch installed and git push of creds by @michaelfeil in #181
- update default model_name to be unified name across routes by @michaelfeil in #179
- python39 type hints by @michaelfeil in #182
- pydantic cli / args validation by @michaelfeil in #183
- update defered moving to cpu & type hints improvement by @michaelfeil in #187
- Update README.md - add Contributors by @michaelfeil in #189
- update infinity offline solution by @michaelfeil in #195
- update offline-mode: deployment docs v2 by @michaelfeil in #196
New Contributors
- @bufferoverflow made their first contribution in #180 Thanks!
Full Changelog: 0.0.31...0.0.32
0.0.31
What's Changed
- Create ISSUE_TEMPLATE by @michaelfeil in #168
- bump sentence transformers to v.2.6.0 by @michaelfeil in #169
- Embedding quant by @michaelfeil in #170
- refactor
ENUM..TypeHint
into a function by @michaelfeil in #172 - refactored more imports by @michaelfeil in #171
- redirect to
/docs
and optional imports by @michaelfeil in #175 - update typing by @michaelfeil in #176
- update lock by @michaelfeil in #177
Full Changelog: 0.0.30...0.0.31
0.0.30
What's Changed
- remove fastembed by @michaelfeil in #141
- Sentence transformers bump to 2.5.0 by @michaelfeil in #142
- Revert "Sentence transformers bump to 2.5.0" by @michaelfeil in #143
- Update README.md by @michaelfeil in #145
- update poetry lock - sentence-transformers 2.5.0 by @michaelfeil in #144
- Support for Inferentia2 (draft) by @michaelfeil in #118
- Add bettertransformer to cli by @michaelfeil in #152
- Fp8 support by @michaelfeil in #153
- Some docstring and typing fixes by @lckr in #156
- add async tokenization to reranker in torch by @michaelfeil in #154
- Update README.md by @sherwin684 in #167
New Contributors
- @lckr made their first contribution in #156
- @sherwin684 made their first contribution in #167
Full Changelog: 0.0.29...0.0.30