prithivMLmods/gemma-4-E2B-it-Uncensored-MAX · 📊 Add SWE-bench evaluation results for princeton-nlp/SWE-bench

I just checked the formatter in Transformers inference, particularly the framework.
So I’ll be following the .yaml format in case I merge the evals.

Thank you!

prithivMLmods

Owner 5 days ago

@SaylorTwift

Haha,
I’ve just completed an experimental template for fully eval a language model using Transformers.

And I just saw that you have a blog about it. - benchmarking-on-the-hub
Awesome!

SaylorTwift

5 days ago

ahah hope you will find it useful :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment