Hi! I am David Dale, research engineer in natural language processing. 👋
You can read more about me in Russian or English.
I am open to collaboration, especially on creating NLP tools (such as machine translation models) for lower-resourced languages.
Some of my best repos are:
- dialogic - for developing multiplatform chatbots and voice skills in Python
- compress-fasttext - for bringing lightweight, fast and accurate word embeddings to your project
- python-ruwordnet - for those who want to understand language beyound embeddings and need a Russian thesaurus
- dependency-paraphraser - a simple tool for paraphrasing that respect sentence structure
- word-mover-grammar - a constituency grammar parser that supports word embeddings
- weirdMath - a collection of small Python etudes, mostly about data science
You can also take a look at my HugginFace contributions, including:
- a tiny Russian BERT
- the first public Russian NLI model
- the only Russian multitask T5 model
- one of the largest Russian paraphrase datasets
My Telegram channels:
- https://t.me/izolenta_mebiusa - about programming and NLP
- https://t.me/matchast - about applied math and data science
Contacts: