Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
64
3
2
Enrico Shippole
conceptofmind
Follow
Alignment-Lab-AI's profile picture
SeanHarrington's profile picture
Dougdevitre's profile picture
160 followers
·
3 following
https://www.teraflopai.com/
EnricoShippole
conceptofmind
AI & ML interests
None yet
Recent Activity
reacted
to
tomaarsen
's
post
with 🔥
1 day ago
🤗 Announcing the Ettin Reranker family: six new state-of-the-art CrossEncoder rerankers for search from 17M to 1B parameters, plus the full training data and the ~150-line recipe. Built on the Ettin ModernBERT encoders, Apache 2.0. Details: All six were trained with the same single-stage pointwise MSE distillation recipe, with mixedbread-ai/mxbai-rerank-large-v2 (1.54B) as the teacher. Only the learning rate and per-device batch size change between sizes. The 1B student matches the teacher within 0.0001 NDCG@10 on MTEB(eng, v2) Retrieval, the 150M is the strongest reranker I tested in the under-600M range, and the 17M beats the 33M ms-marco-MiniLM-L12-v2 by +0.051 NDCG@10 at roughly half the parameter count. Speed matters as much as quality for a reranker, since it determines whether the model fits the latency budget between retrieval and showing results. Our 17M is the fastest reranker in the whole comparison at 7517 pairs/sec on an H100. Our 150M runs 2.3x faster than the two other 150M ModernBERT-base rerankers (gte-reranker-modernbert-base and granite-embedding-reranker-english-r2) because the modular Transformer module propagates unpadded inputs through every layer rather than just the FA2 attention kernel. And our 1B is 2.4x faster than its 1.5B teacher while matching it on quality. I bootstrapped the training recipe with the new train-sentence-transformers Agent Skill shipped in Sentence Transformers v5.5.0. Install it with `hf skills add train-sentence-transformers --claude` and ask Claude Code (or Codex / Cursor / Gemini CLI) to fine-tune a SentenceTransformer, CrossEncoder, or SparseEncoder model on your data. I wrote a blog post walking through usage, results across six embedder pairings, the speed story, and the complete training script. Check it out, or just point your Agent to the URL: https://huggingface.co/blog/ettin-reranker Collection: https://huggingface.co/collections/cross-encoder/ettin-rerankers
updated
a dataset
1 day ago
TeraflopAI/caselaw-evaluation
published
a dataset
1 day ago
TeraflopAI/caselaw-evaluation
View all activity
Organizations
conceptofmind
's datasets
27
Sort:Â Recently updated
conceptofmind/test_sample
Updated
Aug 18, 2025
•
3
conceptofmind/minnesota-caselaw
Viewer
•
Updated
Mar 13, 2025
•
249k
•
2
conceptofmind/test-minn
Viewer
•
Updated
Mar 7, 2025
•
249k
•
2
conceptofmind/joined_minn
Preview
•
Updated
Mar 6, 2025
•
2
conceptofmind/minn-data
Viewer
•
Updated
Mar 6, 2025
•
8.63k
•
2
conceptofmind/minn-annotations
Viewer
•
Updated
Mar 6, 2025
•
8.63k
•
2
conceptofmind/koala-partitions
Viewer
•
Updated
Feb 5, 2025
•
36.1M
•
2
conceptofmind/smithsonian-batch-2
Viewer
•
Updated
Jan 17, 2025
•
10.8M
•
3
•
1
conceptofmind/smithsonian-batch-1
Viewer
•
Updated
Jan 13, 2025
•
5.94M
•
3
conceptofmind/smithsonian-batch-1-old
Viewer
•
Updated
Jan 11, 2025
•
3.5M
•
3
conceptofmind/wikicommons-cc-pd-mark
Updated
Jan 7, 2025
•
3
conceptofmind/daft-video-audio
Updated
Sep 7, 2024
•
8
conceptofmind/minipile_3plus_queries
Viewer
•
Updated
Sep 5, 2024
•
15.4k
•
2
conceptofmind/test_batch_oa
Viewer
•
Updated
Sep 4, 2024
•
16k
•
9
conceptofmind/test_batch
Viewer
•
Updated
Sep 4, 2024
•
35k
•
10
conceptofmind/sali_tags
Viewer
•
Updated
Sep 1, 2024
•
17.2k
•
8
conceptofmind/CAP
Preview
•
Updated
Jul 7, 2024
•
3
•
1
conceptofmind/yt-pleais-2
Viewer
•
Updated
Jun 21, 2024
•
4.49M
•
3
conceptofmind/test_arxiv
Viewer
•
Updated
May 27, 2024
•
1.28k
•
3
conceptofmind/test_merge
Viewer
•
Updated
May 10, 2024
•
20k
•
2
conceptofmind/test3
Viewer
•
Updated
Apr 18, 2024
•
100k
•
3
conceptofmind/test2
Viewer
•
Updated
Apr 12, 2024
•
100k
•
3
conceptofmind/test
Preview
•
Updated
Apr 12, 2024
•
3
conceptofmind/my_dataset
Viewer
•
Updated
Apr 8, 2024
•
1.62k
•
2
conceptofmind/100k-no-markdown
Viewer
•
Updated
Aug 26, 2023
•
203
•
3
•
2
conceptofmind/edgar_text_10k
Viewer
•
Updated
Aug 13, 2023
•
220k
•
12
conceptofmind/r_stack_clean
Viewer
•
Updated
Aug 6, 2023
•
27.9k
•
2
•
1