In a Training Loop 🔄

Yuichi Tateno PRO

hotchpotch

https://secon.dev/

AI & ML interests

Information Retrieval with LLMs

Recent Activity

updated a model about 9 hours ago

hotchpotch/japanese-reranker-base-v2

updated a model about 9 hours ago

hotchpotch/japanese-reranker-small-v2

updated a model about 9 hours ago

hotchpotch/japanese-reranker-xsmall-v2

View all activity

Organizations

upvoted an article 12 days ago

Article

Nano-BEIR: A Multilingual Information Retrieval Benchmark with Quality-Enhanced Queries

sionic-ai

•

Dec 22, 2025

• 10

upvoted an article 25 days ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

26 days ago

• 70

upvoted an article about 1 month ago

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 9

• 58

upvoted an article 2 months ago

Article

Introducing Storage Buckets on the Hugging Face Hub

Wauplin, coyotte508, XciD, victor, julien-c, lhoestq, pierric, Sylvestre, hlarcher, rajatarya, seanses, assafvayner

•

Mar 10

• 194

upvoted a paper 3 months ago

Diffusion-Pretrained Dense and Contextual Embeddings

Paper • 2602.11151 • Published Feb 11 • 24

upvoted 2 collections 3 months ago

ColBERT-Zero 🐶

Collection

First large-scale fully pre-trained ColBERT model using only public data, outperforming GTE-ModernColBERT and GTE-ModernBERT • 10 items • Updated Apr 7 • 21

Bharat-NanoBEIR: Indian Language Retrieval Benchmarks

Collection

NanoBEIR retrieval benchmarks translated into 22 Indian languages across 13 datasets. • 22 items • Updated Dec 13, 2025 • 5

upvoted an article 3 months ago

Article

Transformers.js v4: Now Available on NPM!

Xenova, nico-martin

•

Feb 9

• 94

upvoted a collection 3 months ago

CoRNStack

Collection

State-of-the-art code retrieval and re-ranking models and datasets • 9 items • Updated Mar 26, 2025 • 21

upvoted an article 4 months ago

Article

ModernVBERT: Towards Smaller Visual Document Retrievers

paultltc

•

Oct 3, 2025

• 45

upvoted 2 collections 5 months ago

NanoBEIR datasets

Collection

These datasets are compatible with the (Sparse)NanoBEIREvaluator with Sentence Transformers v5.2+. Also CrossEncoderNanoBEIREvaluator if bm25 column • 16 items • Updated Mar 2 • 17

Embedding Model Datasets

Collection

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 70 items • Updated Dec 10, 2025 • 168

upvoted an article 5 months ago

Article

Granite 4.0 Nano: Just how small can you go?

ibm-granite

•

Oct 28, 2025

• 124

upvoted an article 6 months ago

Article

Streaming datasets: 100x More Efficient

andito, lhoestq, burtenshaw, pcuenq, merve

•

Oct 27, 2025

• 86

upvoted 4 articles 7 months ago

Article

Provence: efficient and robust context pruning for retrieval-augmented generation

nadiinchi

•

Jan 28, 2025

• 26

Article

huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

Wauplin, celinah, lysandre, julien-c

•

Oct 27, 2025

• 75

Article

Sentence Transformers is joining Hugging Face!

tomaarsen

•

Oct 22, 2025

• 88

Article

Introducing RTEB: A New Standard for Retrieval Evaluation

fzliu, KennethEnevoldsen, Samoed, isaacchung, tomaarsen, fzoll

•

Oct 1, 2025

• 143

upvoted 2 articles 8 months ago

Article

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

nvidia

•

Sep 23, 2025

• 27

Article

mmBERT: ModernBERT goes Multilingual

mmarone, orionweller, will-fleshman, eugene-yang, dlawrie, vandurme

•

Sep 9, 2025

• 146

Yuichi Tateno PRO

AI & ML interests

Recent Activity

Organizations

hotchpotch's activity

Nano-BEIR: A Multilingual Information Retrieval Benchmark with Quality-Enhanced Queries

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Multimodal Embedding & Reranker Models with Sentence Transformers

Introducing Storage Buckets on the Hugging Face Hub

Transformers.js v4: Now Available on NPM!

ModernVBERT: Towards Smaller Visual Document Retrievers

Granite 4.0 Nano: Just how small can you go?

Streaming datasets: 100x More Efficient

Provence: efficient and robust context pruning for retrieval-augmented generation

huggingface_hub v1.0: Five Years of Building the Foundation of Open Machine Learning

Sentence Transformers is joining Hugging Face!

Introducing RTEB: A New Standard for Retrieval Evaluation

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

mmBERT: ModernBERT goes Multilingual