Massive Text Embedding Benchmark

non-profit

https://github.com/embeddings-benchmark

embeddings-benchmark

Activity Feed

AI & ML interests

Massive Text Embeddings Benchmark

Recent Activity

Samoed new activity about 2 hours ago

mteb/leaderboard:New #1 English model pending review — PR #4707

Samoed new activity about 8 hours ago

mteb/leaderboard:MTEB Leaderboard is not loading / throwing an error

orionweller updated a dataset about 14 hours ago

mteb/results

View all activity

Papers

MAEB: Massive Audio Embedding Benchmark

HUME: Measuring the Human-Model Performance Gap in Text Embedding Task

View all Papers

Samoed

in mteb/leaderboard about 2 hours ago

New #1 English model pending review — PR #4707

#184 opened about 3 hours ago by

JCorners

Samoed

in mteb/leaderboard about 8 hours ago

MTEB Leaderboard is not loading / throwing an error

👀 1

#183 opened about 17 hours ago by

jueunkim

orionweller

updated a dataset about 14 hours ago

mteb/results

Updated about 14 hours ago • 1.37M • 15

orionweller

updated a Space 1 day ago

MTEB Leaderboard

🥇

7.4k

Embedding Leaderboard

Samoed

in mteb/results 2 days ago

Add JCorners/Ingot-8B-R3 MTEB(eng, v2) results (41 tasks)

#9 opened 2 days ago by

JCorners

KennethEnevoldsen

in mteb/results 2 days ago

Add JCorners/Ingot-8B-R3 MTEB(eng, v2) results (41 tasks)

#9 opened 2 days ago by

JCorners

tomaarsen

posted an update 2 days ago

Post

292

🤗 Announcing the Ettin Reranker family: six new state-of-the-art CrossEncoder rerankers for search from 17M to 1B parameters, plus the full training data and the ~150-line recipe. Built on the Ettin ModernBERT encoders, Apache 2.0. Details:

All six were trained with the same single-stage pointwise MSE distillation recipe, with mixedbread-ai/mxbai-rerank-large-v2 (1.54B) as the teacher. Only the learning rate and per-device batch size change between sizes. The 1B student matches the teacher within 0.0001 NDCG@10 on MTEB(eng, v2) Retrieval, the 150M is the strongest reranker I tested in the under-600M range, and the 17M beats the 33M ms-marco-MiniLM-L12-v2 by +0.051 NDCG@10 at roughly half the parameter count.

Speed matters as much as quality for a reranker, since it determines whether the model fits the latency budget between retrieval and showing results. Our 17M is the fastest reranker in the whole comparison at 7517 pairs/sec on an H100. Our 150M runs 2.3x faster than the two other 150M ModernBERT-base rerankers (gte-reranker-modernbert-base and granite-embedding-reranker-english-r2) because the modular Transformer module propagates unpadded inputs through every layer rather than just the FA2 attention kernel. And our 1B is 2.4x faster than its 1.5B teacher while matching it on quality.

I bootstrapped the training recipe with the new train-sentence-transformers Agent Skill shipped in Sentence Transformers v5.5.0. Install it with hf skills add train-sentence-transformers --claude and ask Claude Code (or Codex / Cursor / Gemini CLI) to fine-tune a SentenceTransformer, CrossEncoder, or SparseEncoder model on your data.

I wrote a blog post walking through usage, results across six embedder pairings, the speed story, and the complete training script. Check it out, or just point your Agent to the URL:

https://huggingface.co/blog/ettin-reranker

Collection: https://huggingface.co/collections/cross-encoder/ettin-rerankers

Samoed

updated a dataset 2 days ago

mteb/ucf-crime

Viewer • Updated 2 days ago • 1.9k • 31

Samoed

published a dataset 3 days ago

mteb/ucf-crime

Viewer • Updated 2 days ago • 1.9k • 31

Samoed

updated a dataset 7 days ago

mteb/SoundDescsA2TRetrieval

Updated 7 days ago • 261

Samoed

published a dataset 7 days ago

mteb/SoundDescsA2TRetrieval

Updated 7 days ago • 261

Samoed

updated a dataset 7 days ago

mteb/SoundDescsT2ARetrieval

Viewer • Updated 7 days ago • 14.8k • 563

Samoed

published a dataset 7 days ago

mteb/SoundDescsT2ARetrieval

Viewer • Updated 7 days ago • 14.8k • 563

Samoed

updated 5 datasets 7 days ago

posted an update 9 days ago

Post

362

🤖 I've just published Sentence Transformers v5.5.0, headlined by a new train-sentence-transformers Agent Skill that lets your AI coding agent (Claude Code, Codex, Cursor, Gemini CLI, ...) train and finetune embedding, reranker, and sparse encoder models for you. Plus training losses & fixes. Details:

The skill bundles curated guidance for the whole training workflow across all three model types: base model selection, loss and evaluator choice, hard-negative mining, distillation, LoRA, Matryoshka, multilingual training, static embeddings, etc. It also ships production-ready training template scripts the agent can adapt. Install it with hf skills add train-sentence-transformers, then just describe what you want, e.g. "finetune a reranker on my (question, answer) pairs, mine hard negatives, and push it to the Hub".

On the loss side: EmbedDistillLoss is a new embedding-level distillation loss for SentenceTransformer. Instead of distilling teacher scores like MarginMSELoss, it aligns the student's embeddings directly with pre-computed teacher embeddings, wtih an optional learnable projection for when the student and teacher dimensions differ. Second, ADRMSELoss is a new listwise learning-to-rank loss for CrossEncoder from the Rank-DistilLLM paper, aimed at the LLM-distillation reranking setting.

encode() and predict() also gained a per-call processing_kwargs override, so you can change processor settings like max_length, a vision-language model's image resolution, or a video's fps, for a single call without rebuilding the model.

The Agent Skill is the part of this release I'm most keen for people to try. Curious to hear how it works for you. I've been using it myself a lot to quickly set up some training runs that immediately use a bunch of best practices.

> pip install sentence-transformers==5.5.0
> hf skills add train-sentence-transformers

The full release notes: https://github.com/huggingface/sentence-transformers/releases/tag/v5.5.0