Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPTCache 的工作原理 #16

Open
humyna opened this issue Mar 21, 2024 · 0 comments
Open

GPTCache 的工作原理 #16

humyna opened this issue Mar 21, 2024 · 0 comments

Comments

@humyna
Copy link
Owner

humyna commented Mar 21, 2024

GPTCache 利用在线服务的数据局部性特点,存储常用数据,降低检索时间,减轻后端服务器负载。与传统缓存系统不同,GPTCache 进行语义缓存,识别并存储相似或相关的查询以提高缓存命中率。

GPTCache 通过 embedding 算法将查询问题转换为向量并使用向量数据库进行相似性搜索,从缓存中检索相关查询。 GPTCache 采用了模块化的设计,允许用户灵活自定义每个模块。

虽然语义缓存可能会返回假正类(false positive)和负类(negative)结果,但 GPTCache 提供 3 种性能指标来帮助开发人员优化其缓存系统。

通过上述流程,GPTCache 能够从缓存中寻找并召回相似或相关查询。
image

https://zilliz.com.cn/what-is-gptcache

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant