Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dutch Quora, Stack Overflow, Alpaca dataset released #34

Closed
BramVanroy opened this issue Apr 10, 2023 · 3 comments
Closed

Dutch Quora, Stack Overflow, Alpaca dataset released #34

BramVanroy opened this issue Apr 10, 2023 · 3 comments

Comments

@BramVanroy
Copy link

BramVanroy commented Apr 10, 2023

Hello

I saw that you released your dataset for everyone to use, so I translated it with OpenAI's model and released it on the HF Hub. I hope it helps others who want to work on Dutch.

You can find the Quora chat set and the Stack Overflow dataset in Dutch. I've also translated the Alpaca Cleaned dataset into Dutch and also converted it into the Baize format. Feel free to add it here or anywhere with a reference to the repository.

Best

Bram

@BramVanroy BramVanroy changed the title Dutch Quora Chat dataset released Dutch Quora and Stack Overflow Chat dataset released Apr 11, 2023
@BramVanroy BramVanroy changed the title Dutch Quora and Stack Overflow Chat dataset released Dutch Quora, Stack Overflow, Alpaca dataset released Apr 12, 2023
@guoday
Copy link
Collaborator

guoday commented Apr 13, 2023

Thanks Bram.

@JetRunner
Copy link
Contributor

We just added a dedicated section in README for community efforts: 24d1516

@BramVanroy
Copy link
Author

Great, thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants