Omkar Pangarkar

omkarenator

AI & ML interests

None yet

Recent Activity

new activity 17 days ago

LLM360/TxT360:Will the code/scripts be released?

upvoted an article 6 months ago

Mixture of Experts Explained

upvoted a collection 6 months ago

🤖 Agents

View all activity

Organizations

New activity in LLM360/TxT360 17 days ago

Will the code/scripts be released?

#10 opened over 1 year ago by

Leon-Leee

upvoted an article 6 months ago

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.13k

upvoted a collection 6 months ago

🤖 Agents

Collection

21 items • Updated Dec 31, 2024 • 173

upvoted an article 6 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

eliebak, cmpatino, anton-l, edbeeching, m-ric, nouamanetazi, akseljoonas, guipenedo, hynky, clefourrier, SaylorTwift, kashif, qgallouedec, hlarcher, glutamatt, Xenova, reach-vb, ngxson, craffel, lewtun, loubnabnl, lvwerra, thomwolf

•

Jul 8, 2025

• 775

liked a Space 7 months ago

The Smol Training Playbook

📚

3.18k

The secrets to building world-class LLMs

liked a dataset 7 months ago

bigcode/the-stack-github-issues

Viewer • Updated Mar 20, 2023 • 31M • 559 • 48

upvoted a paper 7 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 156

upvoted a collection 8 months ago

The Ultimate Collection of Code Classifiers

Collection

🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated May 5, 2025 • 16

upvoted a paper 9 months ago

Essential-Web v1.0: 24T tokens of organized web data

Paper • 2506.14111 • Published Jun 17, 2025 • 46

upvoted an article 10 months ago

Article

nanoJAXGPT: A pedagogical introduction to JAX/Equinox

sachithgunasekara

•

Oct 23, 2024

• 7

liked a Space about 1 year ago

Predict Memory

🧮

108

Calculate and visualize memory usage for model training

upvoted a paper about 1 year ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17, 2025 • 98

liked a dataset about 1 year ago

WebOrganizer/Corpus-200B

Preview • Updated Feb 19, 2025 • 5.16k • 11

liked a Space about 1 year ago

TxT360: Trillion Extracted Text

📖

134

Explore the TxT360 LLM pre‑training dataset

liked a model about 1 year ago

mlfoundations/fasttext-oh-eli5

Updated Aug 1, 2024 • 30

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.85k

The ultimate guide to training LLM on large GPU Clusters

New activity in LLM360/TxT360 over 1 year ago

fix-deps

#7 opened over 1 year ago by

omkarenator

updated a Space over 1 year ago

TxT360: Trillion Extracted Text

📖

134

Explore the TxT360 LLM pre‑training dataset

New activity in LLM360/TxT360 over 1 year ago

code-formatting

#6 opened over 1 year ago by

omkarenator

liked a Space over 1 year ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

Evaluate multilingual models using FineTasks

Omkar Pangarkar

AI & ML interests

Recent Activity

Organizations

omkarenator's activity

Will the code/scripts be released?

Mixture of Experts Explained

SmolLM3: smol, multilingual, long-context reasoner

The Smol Training Playbook

nanoJAXGPT: A pedagogical introduction to JAX/Equinox

Predict Memory

TxT360: Trillion Extracted Text

The Ultra-Scale Playbook

fix-deps

TxT360: Trillion Extracted Text

code-formatting

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks