LeRobot Humanoid: An Open, Low-Cost, 3D-Printed Humanoid for Robot Learning VirgileBatto β’ 7 days ago β’ 48
Borealis β open data, code, weights recipe for training Audio LLM AlexWortega β’ 3 days ago β’ 14
Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation nvidia β’ 10 days ago β’ 20
A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond karina-zadorozhny β’ Jan 19 β’ 25
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment NormalUhr β’ Feb 11, 2025 β’ 124
MONET: Lowering the bar for World-Class Image Generation research. jasperai β’ about 6 hours ago β’ 3
Eight Days in China: What I Learned from the AI Labs, Robotics Startups and Academia matthew-d-white β’ 6 days ago β’ 3
Should we use genetics instead of system prompts for AI Agents & Personas? nyxia β’ 3 days ago β’ 3
We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 βοΈβοΈβοΈ pollen-robotics β’ Jul 8, 2025 β’ 53
LeRobot Humanoid: An Open, Low-Cost, 3D-Printed Humanoid for Robot Learning VirgileBatto β’ 7 days ago β’ 48
Borealis β open data, code, weights recipe for training Audio LLM AlexWortega β’ 3 days ago β’ 14
Fine-Tuning NVIDIA Cosmos Predict 2.5 with LoRA/DoRA for Robot Video Generation nvidia β’ 10 days ago β’ 20
A Guide to Reinforcement Learning Post-Training for LLMs: PPO, DPO, GRPO, and Beyond karina-zadorozhny β’ Jan 19 β’ 25
Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment NormalUhr β’ Feb 11, 2025 β’ 124
MONET: Lowering the bar for World-Class Image Generation research. jasperai β’ about 6 hours ago β’ 3
Eight Days in China: What I Learned from the AI Labs, Robotics Startups and Academia matthew-d-white β’ 6 days ago β’ 3
Should we use genetics instead of system prompts for AI Agents & Personas? nyxia β’ 3 days ago β’ 3
We're open-sourcing "The Amazing Hand", a fully 3D printed robotic hand for less than $200 βοΈβοΈβοΈ pollen-robotics β’ Jul 8, 2025 β’ 53