-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Generate & Gather CDM Model Training Data #17
Comments
CDM example java project - navigate through rosetta-source/src/main/resources/result-json-files to find test pack samples for anonymized CDM trade representations coming from different contributions. Initially I would suggest you use the |
Questions Examples:
|
I'm adding a collection of question samples we got during CDM/DRR trainings and modeling. I know they may not address completely your request but at least they show the main interests we could observe for participants to extend the model or create their own implementations. We didn't add the answers since we had no capacity but all questions should be found in the CDM Documentation portal and other resources that will get published by FINOS in the near future.
|
Collaboration Call to Action
Description of Problem:
The AI4Finance team has the domain expertise and resources to train, fine-tune, and benchmark LLMs on CDM, but they do not have domain expertise in order to gather existing data/documentation/faqs/etc or to generate a list of "reference meaningful fact based questions & answers" in order to provide a source of truth in their training efforts.
Potential Solutions:
Note: Need to generate V1 ideally by end of third week of August in order to review and get it to the AI4Finance team so that they can begin training with time for OSFF.
The text was updated successfully, but these errors were encountered: