16 7 204

Sourab Mangrulkar

smangrul

https://www.linkedin.com/in/sourab-m/

pacman100

AI & ML interests

Machine Learning, Deep Learning, Natural Language Processing, Natural Language Generation, Computer Vision, Reinforcement Learning

Recent Activity

upvoted a collection 2 months ago

NVIDIA Nemotron v3

liked a model 3 months ago

Qwen/Qwen3.5-2B

liked a model 3 months ago

Qwen/Qwen3-VL-2B-Instruct

View all activity

Organizations

published an article about 2 years ago

Article

GaLore: Advancing Large Model Training on Consumer-grade Hardware

Titus-von-Koeller, jiaweizhao, mdouglas, hiyouga, ybelkada, muellerzr, amyeroberts, smangrul, BenjaminB

•

Mar 20, 2024

• 32

published an article over 2 years ago

Article

🤗 PEFT welcomes new merging methods

smangrul, sayakpaul

•

Feb 19, 2024

• 30

published an article over 2 years ago

Article

Mixture of Experts Explained

osanseviero, lewtun, philschmid, smangrul, ybelkada, pcuenq

•

Dec 11, 2023

• 1.13k

published an article over 2 years ago

Article

Personal Copilot: Train Your Own Coding Assistant

smangrul, sayakpaul

•

Oct 27, 2023

• 79

published an article over 2 years ago

Article

Fine-tuning Llama 2 70B using PyTorch FSDP

smangrul, sgugger, lewtun, philschmid

•

Sep 13, 2023

• 32

published an article almost 3 years ago

Article

The Falcon has landed in the Hugging Face ecosystem

lvwerra, ybelkada, smangrul, lewtun, olivierdehaene, pcuenq, philschmid, osanseviero

•

Jun 5, 2023

• 17

published an article almost 3 years ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

ybelkada, timdettmers, artidoro, sgugger, smangrul

•

May 24, 2023

• 180

published an article about 3 years ago

Article

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

edbeeching, ybelkada, lvwerra, smangrul, lewtun, kashif

•

Mar 9, 2023

• 72

published an article over 3 years ago

Article

Parameter-Efficient Fine-Tuning using 🤗 PEFT

smangrul, sayakpaul

•

Feb 10, 2023

• 119

published an article over 3 years ago

Article

Parameter-Efficient Fine-Tuning using 🤗 PEFT

smangrul, sayakpaul

•

Feb 10, 2023

• 119

published an article almost 4 years ago

Article

Accelerate Large Model Training using DeepSpeed

smangrul, sgugger

•

Jun 28, 2022

• 7

published an article about 4 years ago

Article

Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel

smangrul, sgugger

•

May 2, 2022

• 9

published an article about 4 years ago

Article

Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel

smangrul, sgugger

•

May 2, 2022

• 9

Sourab Mangrulkar

AI & ML interests

Recent Activity

Organizations

smangrul's activity

GaLore: Advancing Large Model Training on Consumer-grade Hardware

🤗 PEFT welcomes new merging methods

Mixture of Experts Explained

Personal Copilot: Train Your Own Coding Assistant

Fine-tuning Llama 2 70B using PyTorch FSDP

The Falcon has landed in the Hugging Face ecosystem

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Parameter-Efficient Fine-Tuning using 🤗 PEFT

Parameter-Efficient Fine-Tuning using 🤗 PEFT

Accelerate Large Model Training using DeepSpeed

Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel

Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel