Reinforcement Learning for LLM-based Multi-Agent Systems through Orchestration Traces Paper ⢠2605.02801 ⢠Published 2 days ago ⢠2
HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness Paper ⢠2605.02396 ⢠Published 2 days ago ⢠6
Beyond SFT-to-RL: Pre-alignment via Black-Box On-Policy Distillation for Multimodal RL Paper ⢠2604.28123 ⢠Published 5 days ago ⢠22
google/gemma-4-26B-A4B-it-assistant Any-to-Any ⢠0.4B ⢠Updated about 14 hours ago ⢠470 ⢠44
view post Post 126 š£ I made a visualizer for Hugging Face models: https://hfviewer.com⨠Simply paste a Hugging Face URL to get an interactive visualization of the architecture!š The recent Qwen3.6-27B model as an example: https://hfviewer.com/Qwen/Qwen3.6-27BFeel free to try it out and give me feedback on how it can be improved! ā¤ļø See translation 1 reply Ā· ā¤ļø 15 15 š„ 10 10 š 4 4 𤯠3 3 š¤ 2 2 + Reply
ReAct: Synergizing Reasoning and Acting in Language Models Paper ⢠2210.03629 ⢠Published Oct 6, 2022 ⢠35
Let ViT Speak: Generative Language-Image Pre-training Paper ⢠2605.00809 ⢠Published 5 days ago ⢠20
End-to-End Autoregressive Image Generation with 1D Semantic Tokenizer Paper ⢠2605.00503 ⢠Published 5 days ago ⢠5