Aleksei Dorkin PRO
adorkin
AI & ML interests
Computational Linguistics
Recent Activity
liked a dataset about 2 hours ago
jhu-clsp/ManyIH-Bench upvoted an article about 11 hours ago
How we OCR'ed 30,000 papers using Codex, open OCR models and Jobs liked a dataset 1 day ago
allenai/dolma3_poolOrganizations
spaces 6
Sleeping
1
NLI Zero Shot Classification
🔍
Zero-shot classification based on natural language inference
Sleeping
2
GliLem
🤓
Lemmatization disambiguation for Estonian with GliNER
Running
SigLIP2 + Clothes
🤔
Text-to-image clothing search using SigLIP2
Sleeping
1
M-CLIP + Clothes
🦀
Text-to-image clothing search using multilingual CLIP
Sleeping
1
Tweet Emoji Predictor
🧐
Predict an emoji for your tweet (...your X?)
Sleeping
Sõnajaht Demo
🐠
Keeltevaheline pöördsõnastik
datasets 17
adorkin/Ling-Coder-DPO-filtered
Viewer • Updated • 93.3k • 14
adorkin/OpenCodeInstruct-filtered-sft
Viewer • Updated • 445k • 32
adorkin/tulu-3-sft-mixture
Viewer • Updated • 939k • 7
adorkin/extended_tweet_emojis
Viewer • Updated • 52.7k • 105 • 3
adorkin/cosmopedia-v2-translate-append-instructions-et
Viewer • Updated • 6.85k • 8
adorkin/flan-v2-converted-en
Viewer • Updated • 58.2k • 8
adorkin/mala-bilingual-et-en-scores
Viewer • Updated • 50.9M • 32
adorkin/dclm-sample-13k-en-et-translation
Viewer • Updated • 13.7k • 4
adorkin/nllb-et-en-scores
Viewer • Updated • 22M • 16
adorkin/Magpie-Llama-3.1-Pro-300K-Filtered-18K-sample-et
Viewer • Updated • 36.6k • 4 • 1