Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

michaelfeil / infinity Public

Notifications You must be signed in to change notification settings
Fork 111
Star 1.4k

Code
Issues 41
Pull requests 4
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Releases: michaelfeil/infinity

Releases Tags

Releases · michaelfeil/infinity

0.0.7

12 Nov 10:41

michaelfeil

0.0.7

6cad81c

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

0.0.7

What's Changed

Docker: Cuda11.8, make dependencies optional by @michaelfeil in #33
Breaking changes: New install groups pip install infinity-emb[server,logging,onnx-gpu-runtime]
add onnx-gpu by @michaelfeil in #26

Full Changelog: 0.0.6...0.0.7

Contributors

michaelfeil

Assets 2

All reactions

0.0.6

11 Nov 13:29

michaelfeil

0.0.6

2b930f1

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

0.0.6

What's Changed

add python api by @michaelfeil in #32

Full Changelog: 0.0.5...0.0.6

Contributors

michaelfeil

Assets 2

All reactions

0.0.5

06 Nov 07:58

michaelfeil

0.0.5

a1389ce

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

0.0.5

What's Changed

Docker image multi by @michaelfeil in #24
patch missing event -> 200ms to 7ms inference at Batch size 1

Full Changelog: 0.0.4...0.0.5

Contributors

michaelfeil

Assets 2

All reactions

0.0.4

04 Nov 12:33

michaelfeil

0.0.4

be64ff2

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

0.0.4

What's Changed

PRs:

Fastembed v2 by @michaelfeil in #21 :

Issues:
Closes #5 ONNX Support via https://github.com/qdrant/fastembed/
Closes #22 making pytorch and optional dependency

tl,dr

fastembed as backend besides ct2 or torch
v1/models returns "backend"
makes torch an optional dependency
calculates "min" sleep time dynamically on startup _> slightly optimized.
default model is now "BAAI/bge-small-en-v1.5"

Full Changelog: 0.0.3...0.0.4

Contributors

michaelfeil

Assets 2

All reactions

0.0.3

30 Oct 13:58

michaelfeil

0.0.3

8116680

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

0.0.3

What's Changed

add Flash-Attention+ optimum-BetterTransformers by @michaelfeil in #20
Improve real-time / sleep strategy, async await for queues and result futures - reducing latency a bit by @michaelfeil in #12
add better FIFO queueing strategy - your requests now have a upper bound how long they queue by @michaelfeil in #19

Docs:

Docs: Update README.md by @michaelfeil in #8
Update description. Update pyproject.toml by @michaelfeil in #9
Refactor model dir by @michaelfeil in #10
Update README.md by @michaelfeil in #14
Update README.md by @michaelfeil in #15

Full Changelog: 0.0.2rc0...0.0.3

Contributors

michaelfeil

Assets 2

All reactions

0.0.2

22 Oct 10:51

michaelfeil

0.0.2rc0

4cec671

Compare

Choose a tag to compare

View all tags

0.0.2

What's Changed

Docs: Update README.md by @michaelfeil in #8
Update description. Update pyproject.toml by @michaelfeil in #9
Refactor model dir by @michaelfeil in #10
Improve real-time / sleep strategy, async await for queues and result futures by @michaelfeil in #12

Full Changelog: 0.0.1...0.0.2rc0

Contributors

michaelfeil

Assets 2

All reactions

0.0.1

12 Oct 16:41

michaelfeil

0.0.1

30c54f1

Compare

Choose a tag to compare

View all tags

0.0.1

Initial release of Infinity

Assets 2

All reactions

0.0.1-dev3

12 Oct 16:07

michaelfeil

0.0.1-dev3

5bf19c4

Compare

Choose a tag to compare

View all tags

0.0.1-dev3 Pre-release

Pre-release

What's Changed

startup msg, log handling, import by @michaelfeil in #4
update CI to release pypi by @michaelfeil in #7

Full Changelog: 0.0.1-dev2...0.0.1-dev3

Contributors

michaelfeil

Assets 2

All reactions

0.0.1-dev2 - Speedups

12 Oct 01:46

michaelfeil

0.0.1-dev2

3ed24bb

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.

GPG key ID: 4AEE18F83AFDEB23

Expired

Learn about vigilant mode.

Compare

Choose a tag to compare

View all tags

0.0.1-dev2 - Speedups Pre-release

Pre-release

adds new dependency (orjson) for faster response serialization - 300%
uses torch.inference_mode() and delayed moving to CPU - 10%
adds uvicorn[standard] - slightly faster 2-5%?
Updates readme

Assets 2