-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create dataset loader for MongabayConservationDataset #63
Comments
#self-assign |
Hi all sorry for late response, I think it's better if I assign myself to create the dataloader for this dataset. |
Hello @megasiska86, no problem if you want to handle the dataloader for this (I haven't started on it anyways). You can assign yourself now |
thank you @elyanah-aco , anyway how to assign myself here? I can't click |
@megasiska86 Please comment "#self-assign" just like I did in first comment |
#self-assign |
Hi @megasiska86, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help. |
yeah still working on it |
Okay then, @megasiska86. Feel free to let us know if you need any help! |
Hi, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help. |
Hi @megasiska86, may I know the progress of this dataloader creation? Since it has passed 2+2 weeks of expected completion of dataloader, I will clear the assignee if no update is received by Monday 12 PM UTC. |
#self-assign |
Hi @, may I know if you are still working on this issue? Please let @holylovenia @SamuelCahyawijaya @sabilmakbar know if you need any help. |
yes |
There is a problem with the softlabel column in the training data. For "TEXT" schema feature, it is required to specify the possible values for the string. For validation and test data, it is ["negatif", "positif", "netral"]. However, softlabel column value in the traning data is in the form of array of floats, which I currently don't know what rule to turn it into the possible 3 values. How can I resolve this issue? @sabilmakbar . I might also need help on the citation. |
Also for multi-label subset, the softlabel in validation and test data is also in the form of array |
Hi @megasiska86, could you please help answer @Enliven26's questions as the dataset owner? I also have a related question, @megasiska86. The huggingface dataset doesn't seem to have the labels for the topic classification task, so where do we get them? |
May I get an update about the question? I apologize since I haven't gotten any time to read the paper to find the rule to convert the softlabel into single-value label. Also, since in tags classification subset the label is in the form of array of tags, I think "TEXT" schema cant be used (?). |
Let me try mentioning @megasiska86 again in case she missed it, @Enliven26. |
I apologize for missed it for a long time @holylovenia I've updated the homepage of my dataset
|
Okay, I'll try move it to text2text schema. Thank you |
#self-assign |
Hi @holylovenia
|
Hi @megasiska86, can you make a pull request instead of trying to push to On another note, this issue had been Sorry for the inconvenience, @Enliven26. 🙏 |
Yeah I've tried to push it to the new branch and will create PR from that branch, but got this permission error. This permission error also comes up when I tried to clone the repository
Really sorry for the inconvenience 🙏 . I tried to #self-assign because I thought it would give me access to push to the branch, but it didn't. I also creeated the dataloader to solve issue raised in this comment
Again sorry for the inconvenience. |
Have you tried forking (not cloning) the SEACrowd/seacrowd-datahub repo? Here is a detailed guide on how to fork the repo and submit the dataloader. |
I will try it, thank you 🙏 |
Dataloader name:
mongabay/mongabay.py
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?mongabay
The text was updated successfully, but these errors were encountered: