Aleksei Dorkin PRO
adorkin
AI & ML interests
Computational Linguistics
Recent Activity
liked a dataset 3 days ago
nvidia/OCR-Synthetic-Multilingual-v1 upvoted a changelog 4 days ago
Introducing Kernels liked a model 9 days ago
laion/whisper-captioning-ensembleOrganizations
Multilingual Text Embedding Models
-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • Updated • 34.7k • 89 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 41.1k • 155 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.85M • • 656 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • Updated • 1.82M • 251
Multilingual Text Encoders
Multilingual Text Embedding Models
-
tencent/KaLM-Embedding-Gemma3-12B-2511
Sentence Similarity • Updated • 34.7k • 89 -
nvidia/llama-embed-nemotron-8b
Feature Extraction • 8B • Updated • 41.1k • 155 -
Qwen/Qwen3-Embedding-8B
Feature Extraction • 8B • Updated • 1.85M • • 656 -
Qwen/Qwen3-Embedding-4B
Feature Extraction • Updated • 1.82M • 251
spaces 6
Sleeping
Agents
1
NLI Zero Shot Classification
🔍
Zero-shot classification based on natural language inference
Sleeping
Agents
2
GliLem
🤓
Lemmatization disambiguation for Estonian with GliNER
Running
Agents
SigLIP2 + Clothes
🤔
Text-to-image clothing search using SigLIP2
Sleeping
Agents
1
M-CLIP + Clothes
🦀
Text-to-image clothing search using multilingual CLIP
Sleeping
Agents
1
Tweet Emoji Predictor
🧐
Predict an emoji for your tweet (...your X?)
Sleeping
Agents
Sõnajaht Demo
🐠
Keeltevaheline pöördsõnastik
datasets 17
adorkin/Ling-Coder-DPO-filtered
Viewer • Updated • 93.3k • 5
adorkin/OpenCodeInstruct-filtered-sft
Viewer • Updated • 445k • 13
adorkin/tulu-3-sft-mixture
Viewer • Updated • 939k • 16
adorkin/extended_tweet_emojis
Viewer • Updated • 52.7k • 162 • 3
adorkin/cosmopedia-v2-translate-append-instructions-et
Viewer • Updated • 6.85k • 12
adorkin/flan-v2-converted-en
Viewer • Updated • 58.2k • 9
adorkin/mala-bilingual-et-en-scores
Viewer • Updated • 50.9M • 21
adorkin/dclm-sample-13k-en-et-translation
Viewer • Updated • 13.7k • 7
adorkin/nllb-et-en-scores
Viewer • Updated • 22M • 16
adorkin/Magpie-Llama-3.1-Pro-300K-Filtered-18K-sample-et
Viewer • Updated • 36.6k • 9 • 1