Pratyay Banerjee's picture

In a Training Loop 🔄

Pratyay Banerjee

Neilblaze

·

https://neilblaze.live

AI & ML interests

HCI, Computer Vision, Object Detection, Pattern Recognition, NLP, Supervised Learning

Recent Activity

upvoted an article about 12 hours ago

Introducing North Mini Code: Cohere’s First Model For Developers

liked a model about 12 hours ago

CohereLabs/North-Mini-Code-1.0-fp8

liked a dataset about 17 hours ago

nanotron/ultrascale-playbook-data

View all activity

Organizations

upvoted an article about 12 hours ago

Article

Introducing North Mini Code: Cohere’s First Model For Developers

CohereLabs

•

1 day ago

• 47

upvoted a collection 2 days ago

Deepseek Papers

Deepseek papers collection • 31 items • Updated 3 days ago • 350

upvoted a collection 4 days ago

Laguna XS.2

Designed for agentic coding and long-horizon work on a local machine. Apache 2.0. • 5 items • Updated May 7 • 25

upvoted 4 collections 5 days ago

Gemma 4 QAT

Gemma 4 QAT (Quantization-Aware Training) for 3x less memory use and near original accuracy. • 16 items • Updated 5 days ago • 78

Gemma 4 QAT Q4_0

19 items • Updated 5 days ago • 106

Gemma 4 QAT Mobile

4 items • Updated 5 days ago • 31

Bonsai Image

6 items • Updated 6 days ago • 85

upvoted 13 papers 6 days ago

MemTrain: Self-Supervised Context Memory Training

Paper • 2606.03197 • Published 9 days ago • 17

Joint Agent Memory and Exploration Learning via Novelty Signals

Paper • 2606.01528 • Published 10 days ago • 15

Skill is Not One-Size-Fits-All: Model-Aware Skill Alignment for LLM Agents

Paper • 2605.30723 • Published 13 days ago • 16

Self-Distilled Policy Gradient

Paper • 2606.04036 • Published 9 days ago • 24

When Does Multi-Agent RL Improve LLM Workflows? Workflow, Scale, and Policy-Sharing Tradeoffs

Paper • 2605.24202 • Published 20 days ago • 17

Harness Updating Is Not Harness Benefit: Disentangling Evolution Capabilities in Self-Evolving LLM Agents

Paper • 2605.30621 • Published 14 days ago • 22

Streaming Communication in Multi-Agent Reasoning

Paper • 2606.05158 • Published 8 days ago • 29

AutoMedBench: Towards Medical AutoResearch with Agentic AI Models

Paper • 2606.01961 • Published 10 days ago • 27

SkillAdaptor: Self-Adapting Skills for LLM Agents from Trajectories

Paper • 2606.01311 • Published 11 days ago • 35

NITP: Next Implicit Token Prediction for LLM Pre-training

Paper • 2605.24956 • Published 18 days ago • 35

Task-Focused Memorization for Multimodal Agents

Paper • 2605.31075 • Published 13 days ago • 38

dMoE: dLLMs with Learnable Block Experts

Paper • 2605.30876 • Published 13 days ago • 36

Harness-1: Reinforcement Learning for Search Agents with State-Externalizing Harnesses

Paper • 2606.02373 • Published 10 days ago • 51