smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-4-summary-mean-1024-mlp-ov0-causal-3e-5 Updated 3 days ago
smcleish/0.6b-embed-4b-instruct-cs-8-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-3e-5 Updated 3 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-5e-5 Updated 5 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-3e-5 Updated 7 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-8-summary-mean-1024-mlp-ov0-causal-2e-5 Updated 8 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5-post-train-2e-5 Updated 19 days ago
smcleish/tuo-prod-0.6b-embed-4b-instruct-cs-16-summary-mean-1024-mlp-ov0-causal-1e-5 Updated 21 days ago
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-300-chkpt-step-100 Text Generation • 2B • Updated Jan 22 • 4
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-300-chkpt-step-200 Text Generation • 2B • Updated Jan 22 • 4
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-300-chkpt-step-300 Text Generation • 2B • Updated Jan 22 • 2
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-300-chkpt-step-400 Text Generation • 2B • Updated Jan 22 • 3
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-400-chkpt-step-100 Text Generation • 2B • Updated Jan 22 • 6
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-400-chkpt-step-200 Text Generation • 2B • Updated Jan 22 • 3
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-8k-400-chkpt-step-300 Text Generation • 2B • Updated Jan 22 • 2
smcleish/deepscaler-1.5b-8k-hard-first-run-with-shuffle-8k-400-chkpt-16k-400-chkpt-step-200 Text Generation • 2B • Updated Jan 21 • 3
smcleish/deepscaler-1.5b-8k-easy-first-run-with-shuffle-8k-400-chkpt-16k-200-chkpt-step-200 Text Generation • 2B • Updated Jan 21 • 3
smcleish/deepscaler-1.5b-8k-easy-first-run-with-shuffle-8k-400-chkpt-16k-400-chkpt-step-400 Text Generation • 2B • Updated Jan 21 • 2
smcleish/deepscaler-1.5b-8k-easy-first-run-with-shuffle-8k-400-chkpt-16k-400-chkpt-step-200 Text Generation • 2B • Updated Jan 21 • 4
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-step-400 Text Generation • 2B • Updated Jan 20 • 2
smcleish/deepscaler-1.5b-8k-reproduce-first-run-with-shuffle-step-500 Text Generation • 2B • Updated Jan 20 • 2