-
TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar
Paper • 2510.14972 • Published • 35 -
LightMem: Lightweight and Efficient Memory-Augmented Generation
Paper • 2510.18866 • Published • 115 -
Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning
Paper • 2510.19338 • Published • 117 -
The Smol Training Playbook
📚3.11kThe secrets to building world-class LLMs
Jonatan Borkowski
j14i
AI & ML interests
None yet
Recent Activity
liked a model 1 day ago
HauhauCS/Qwen3.6-35B-A3B-Uncensored-HauhauCS-Aggressive upvoted a paper 2 days ago
Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems upvoted an article 16 days ago
Welcome Gemma 4: Frontier multimodal intelligence on device