view article Article How We Built a Semantic Highlight Model To Save Token Cost for RAG zilliz • Jan 15 • 67
Running 3.84k The Ultra-Scale Playbook 🌌 3.84k The ultimate guide to training LLM on large GPU Clusters
Nashhz/SBERT_KFOLD_Job_Descriptions_to_Skills Sentence Similarity • 22.7M • Updated Dec 23, 2024 • 15 • 1
view article Article Supercharge your OCR Pipelines with Open Models +5 merve, ariG23498, davanstrien, hynky, andito, reach-vb, pcuenq • Oct 21, 2025 • 312
view article Article RexBERT: Encoders for a brave new world of E-Commerce thebajajra • Sep 20, 2025 • 50
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks • 8 items • Updated Jul 31, 2025 • 34
Running on Zero Agents 66 OCR Time Machine 📚 66 Extract text from images and XML files using OCR models
view article Article Xet is on the Hub +4 assafvayner, brianronan, seanses, jgodlewski, sirahd, jsulz • Mar 18, 2025 • 80