2 10 71

Sofi Casadei

sofdog

sofi444

AI & ML interests

NLP

Recent Activity

liked a Space 16 days ago

piimb/pii-masking-benchmark-leaderboard

upvoted a changelog 19 days ago

Spaces agents.md for your coding agents

upvoted an article 4 months ago

How We Built a Semantic Highlight Model To Save Token Cost for RAG

View all activity

Organizations

liked a Space 16 days ago

Pii Masking Benchmark Leaderboard

🥇

PII Masking Benchmark Leaderboard

upvoted a changelog 19 days ago

Hugging Face Changelog

Spaces agents.md for your coding agents

27 days ago

• 267

upvoted an article 4 months ago

Article

How We Built a Semantic Highlight Model To Save Token Cost for RAG

zilliz

•

Jan 15

• 67

liked a Space 4 months ago

ObjectClear

🪄

158

Remove objects from images by clicking

liked 2 Spaces 5 months ago

The Ultra-Scale Playbook

🌌

3.84k

The ultimate guide to training LLM on large GPU Clusters

Autotools

📈

Automate tool use with language models

liked a model 6 months ago

Nashhz/SBERT_KFOLD_Job_Descriptions_to_Skills

liked a model 7 months ago

PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated 14 days ago • 11k • 1.6k

upvoted an article 7 months ago

Article

Supercharge your OCR Pipelines with Open Models

merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq

•

Oct 21, 2025

• 312

New activity in knowledgator/gliner-pii-base-v1.0 8 months ago

Model Sizes and Supported Languages

👀 1

#3 opened 8 months ago by

sofdog

liked a Space 8 months ago

Pteredactyl PII

🐨

Anonymize clinical text to protect patient information

upvoted 2 articles 8 months ago

Article

RexBERT: Encoders for a brave new world of E-Commerce

thebajajra

•

Sep 20, 2025

• 50

Article

Large-scale Near-deduplication Behind BigCode

chenghao

•

May 16, 2023

• 37

upvoted a collection 8 months ago

Multilingual LLM Evaluation

Collection

Multilingual Evaluation Benchmarks • 8 items • Updated Jul 31, 2025 • 34

liked a dataset 8 months ago

CohereLabs/include-base-44

Viewer • Updated Apr 15, 2025 • 23k • 13.6k • 49

updated a model 9 months ago

sofdog/splade-en-fr-eurobert-v1

0.3B • Updated Aug 17, 2025 • 1

liked a model 9 months ago

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated Oct 31, 2025 • 206k • 1.3k

liked a Space 10 months ago

OCR Time Machine

📚

Extract text from images and XML files using OCR models

upvoted an article 10 months ago

Article

Xet is on the Hub

assafvayner, brianronan, seanses, jgodlewski, sirahd, jsulz

•

Mar 18, 2025

• 80

liked a Space 10 months ago

Checkbox Detector

🏆

Analyze scanned documents to detect and classify checkboxes

Sofi Casadei

AI & ML interests

Recent Activity

Organizations

sofdog's activity

Pii Masking Benchmark Leaderboard

Spaces agents.md for your coding agents

How We Built a Semantic Highlight Model To Save Token Cost for RAG

ObjectClear

The Ultra-Scale Playbook

Autotools

Supercharge your OCR Pipelines with Open Models

Model Sizes and Supported Languages

Pteredactyl PII

RexBERT: Encoders for a brave new world of E-Commerce

Large-scale Near-deduplication Behind BigCode

OCR Time Machine

Xet is on the Hub

Checkbox Detector