Qwen3 checkpoints long-context fine-tuned to 128k using the OctoLong recipe
OctoLong
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 14
OctoLong/Qwen3-14B-Base-Extended
Text Generation • 15B • Updated
OctoLong/Qwen3-8B-Base-Extended
Text Generation • 8B • Updated • 23
OctoLong/Qwen3-4B-Base-Extended
Text Generation • 4B • Updated • 20
OctoLong/Qwen3-1.7B-Base-Extended
Text Generation • 2B • Updated • 21
OctoLong/Qwen3-0.6B-Base-Extended
Text Generation • 0.6B • Updated • 24
OctoLong/Qwen3-8B-Base-Merged
Text Generation • 8B • Updated • 32
OctoLong/Qwen3-4B-Base-Merged
Text Generation • 4B • Updated • 52
OctoLong/Qwen3-1.7B-Base-Merged
Text Generation • 2B • Updated • 26
OctoLong/Qwen3-14B-Base
Text Generation • 15B • Updated • 63
OctoLong/Qwen3-8B-Base
Text Generation • 8B • Updated • 96 • 1
datasets 14
OctoLong/OctoLong-SFT
Viewer • Updated • 3.65M • 45
OctoLong/OctoLong-SFT-Swift
Viewer • Updated • 3.65M • 63
OctoLong/OctoLong-LCFT
Viewer • Updated • 19.4M • 119
OctoLong/temp-collection-meta-complete-64
Viewer • Updated • 68.1k • 3
OctoLong/temp-collection-raw-complete-64
Viewer • Updated • 68.7k • 3
OctoLong/temp-collection-mix-64
Viewer • Updated • 22.8k • 3
OctoLong/temp-collection-meta-64
Viewer • Updated • 22.7k • 3
OctoLong/temp-collection-meta-32
Viewer • Updated • 22.7k • 3
OctoLong/temp-collection-meta-16
Viewer • Updated • 22.7k • 3
OctoLong/temp-collection-raw-64
Viewer • Updated • 22.9k • 1