Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

如何获取tool_quality_classifier模块中[chinese,code,gtp3]这3个模型的权重? #467

Open
3 tasks done
yaun248 opened this issue Oct 30, 2024 · 1 comment
Open
3 tasks done
Assignees
Labels
question Further information is requested

Comments

@yaun248
Copy link

yaun248 commented Oct 30, 2024

Before Asking 在提问之前

  • I have read the README carefully. 我已经仔细阅读了 README 上的操作指引。

  • I have pulled the latest code of main branch to run again and the problem still existed. 我已经拉取了主分支上最新的代码,重新运行之后,问题仍不能解决。

Search before asking 先搜索,再提问

  • I have searched the Data-Juicer issues and found no similar questions. 我已经在 issue列表 中搜索但是没有发现类似的问题。

Question

在使用tool_quality_classifier工具过程中如何获取到分类器模型的权重?我在/root/.cache/data_juicer/models/gpt3_quality_model这个路径下找了下存储大小只有4M,这应该不是一个模型的权重。

Additional 额外信息

No response

@yaun248 yaun248 added the question Further information is requested label Oct 30, 2024
@yaun248 yaun248 changed the title 如何获取tool_quality_classifier模块中[chinese,code,gtp3]折3个模型的权重? 如何获取tool_quality_classifier模块中[chinese,code,gtp3]这3个模型的权重? Oct 30, 2024
@HYLcool
Copy link
Collaborator

HYLcool commented Oct 31, 2024

@yaun248 ,感谢你的关注与使用~

这三个模型均为spark的逻辑斯蒂回归分类器,你找到的那个路径中保存的就是模型的权重

@HYLcool HYLcool self-assigned this Oct 31, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants