9 15

Seongtae Hong

hongst

https://scholar.google.com/citations?user=6uU-QJAAAAAJ&hl=en

tate-hong-nlp

AI & ML interests

NLP

Recent Activity

liked a dataset 7 days ago

HuggingFaceFW/finetranslations

upvoted a paper 29 days ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

authored a paper 29 days ago

Cross-Lingual Optimization for Language Transfer in Large Language Models

View all activity

Organizations

upvoted a paper 29 days ago

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

Paper • 2604.10098 • Published Apr 11 • 81

upvoted 3 papers about 1 month ago

DMax: Aggressive Parallel Decoding for dLLMs

Paper • 2604.08302 • Published Apr 9 • 51

Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval

Paper • 2604.04734 • Published Apr 6 • 12

Improving Semantic Proximity in Information Retrieval through Cross-Lingual Alignment

Paper • 2604.05684 • Published Apr 7 • 9

upvoted a collection 3 months ago

ConTEB evaluation datasets

Collection

Evaluation datasets of the ConTEB benchmark. Use "test" split where available, otherwise "validation", otherwise "train". • 8 items • Updated Jun 2, 2025 • 3

upvoted a paper 3 months ago

Diffusion-Pretrained Dense and Contextual Embeddings

Paper • 2602.11151 • Published Feb 11 • 24

upvoted an article 5 months ago

Article

Nano-BEIR: A Multilingual Information Retrieval Benchmark with Quality-Enhanced Queries

sionic-ai

•

Dec 22, 2025

• 10

upvoted a collection 5 months ago

🦢SWIM-IR Dataset [NAACL'24]

Collection

29 million Synthetic Wikipedia-based Multilingual Retrieval Training Pairs. • 4 items • Updated Mar 31, 2025 • 8

upvoted a collection about 1 year ago

Embedding Model Datasets

Collection

A curated subset of the datasets that work out of the box with Sentence Transformers: https://huggingface.co/datasets?other=sentence-transformers • 70 items • Updated Dec 10, 2025 • 169

Seongtae Hong

AI & ML interests

Recent Activity

Organizations

hongst's activity

Nano-BEIR: A Multilingual Information Retrieval Benchmark with Quality-Enhanced Queries