Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Expanding Emoji to German words #5

Open
PiotrCzapla opened this issue Sep 29, 2018 · 5 comments
Open

Expanding Emoji to German words #5

PiotrCzapla opened this issue Sep 29, 2018 · 5 comments
Assignees

Comments

@PiotrCzapla
Copy link
Member

as discussed on the forum I would love to see how this improves the model.

@MicPie
Copy link

MicPie commented Oct 1, 2018

Dear Piotr,

thank you for the organization of the ulmfit4de repo!

I am currently trying to get the german emoji description with beautiful soup into python and then into a csv.

I will get back to you when I have them and checked them.

Best regards
Michael

@MicPie
Copy link

MicPie commented Oct 5, 2018

Dear Piotr,

see the link posted to the xml files with the different emoji translation by Marcin on the fastai-forum:
http://forums.fast.ai/t/ulmfit-german/22529/49

Best regards
Michael

@PiotrCzapla
Copy link
Member Author

Thx. Have you had some time to give it a try?

@MicPie
Copy link

MicPie commented Nov 6, 2018

Dear Piotr,

sorry, for my late reply. - I was busy with get going with the fastai v1 library.

I am also following the ulmfit threads on the fastai forums from you.
What is you plan with the upcoming fastai text v1 and Google BERT?

I will try to cut off some time on weekend to look into the ulmfit4de repo.

Best regards
Michael

@PiotrCzapla
Copy link
Member Author

Don't worry following closely fastaiv1 and doing all the projects will get you a long way so it is a good choice. Re. BERT we want to see how much better /worst it is from ULMFiT on classification problems.
ULMFiT is much faster and easier to train than BERT so I think it still has some good use cases. But we need to compare to see what we are losing / gaining.

btw. I think DE is pretty much done we exceeded SOTA for GE17 and we are very close to STOA GE18, the only thing is to consolidate languages in fastaiv1, and maybe try to use biLM training but that is for later.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants