WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. β’ 113 items β’ Updated 19 minutes ago β’ 14
River-LLM: Large Language Model Seamless Exit Based on KV Share Paper β’ 2604.18396 β’ Published 4 days ago β’ 4
MARCO: Navigating the Unseen Space of Semantic Correspondence Paper β’ 2604.18267 β’ Published 4 days ago β’ 3
RDP LoRA: Geometry-Driven Identification for Parameter-Efficient Adaptation in Large Language Models Paper β’ 2604.19321 β’ Published 3 days ago β’ 5
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. β’ 113 items β’ Updated 19 minutes ago β’ 14
Target-Oriented Pretraining Data Selection via Neuron-Activated Graph Paper β’ 2604.15706 β’ Published 7 days ago β’ 9
ShadowPEFT: Shadow Network for Parameter-Efficient Fine-Tuning Paper β’ 2604.19254 β’ Published 3 days ago β’ 24
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. β’ 113 items β’ Updated 19 minutes ago β’ 14
TEMPO: Scaling Test-time Training for Large Reasoning Models Paper β’ 2604.19295 β’ Published 3 days ago β’ 29
Self-Evolving LLM Memory Extraction Across Heterogeneous Tasks Paper β’ 2604.11610 β’ Published 11 days ago β’ 5
Convergent Evolution: How Different Language Models Learn Similar Number Representations Paper β’ 2604.20817 β’ Published 2 days ago β’ 5
SWE-chat: Coding Agent Interactions From Real Users in the Wild Paper β’ 2604.20779 β’ Published 2 days ago β’ 7 β’ 1
SWE-chat: Coding Agent Interactions From Real Users in the Wild Paper β’ 2604.20779 β’ Published 2 days ago β’ 7
WTF GENIUS PAPERS Collection Papers that made me appreciate my major and my life a little more. obs=Observation, innov=Innovation. Most papers are abt improving tiny models. β’ 113 items β’ Updated 19 minutes ago β’ 14
LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding Paper β’ 2404.16710 β’ Published Apr 25, 2024 β’ 81
HUMAN-WRITTEN & LEGALLY-SOURCED* Collection Datasets written by humans and/or reverse-engineered from text with deterministic algorithms. No illegal scraping or unethical synthesis *...mostly. β’ 168 items β’ Updated 1 day ago β’ 2