The Personalization Trap: How User Memory Alters Emotional Reasoning in LLMs Paper • 2510.09905 • Published Oct 10 • 6
SATA-Bench Collection SATA-Bench is a multi-domain benchmark designed for 'Select-all-that-apply' questions. • 4 items • Updated Jun 3 • 2
Synthesizing Conversations from Unlabeled Documents using Automatic Response Segmentation Paper • 2406.03703 • Published Jun 6, 2024 • 2
SATA-BENCH: Select All That Apply Benchmark for Multiple Choice Questions Paper • 2506.00643 • Published May 31 • 6
FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning Paper • 2505.08054 • Published May 12 • 3
DeTiME: Diffusion-Enhanced Topic Modeling using Encoder-decoder based LLM Paper • 2310.15296 • Published Oct 23, 2023 • 3