gputopia-worker-0.3.0
- extended support for /v1/embeddings
- use model "fastembed:BAAI/bge-base-en-v1.5" for the most recent and fastest models
- llama model embeddings work as expected
- download https://gputopia.s3.us-east-2.amazonaws.com/bin/gputopia-worker-cuda-torch-linux-64.tar.gz to support fine tuning