Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
20
9
51
DongJae Shin
ShinDJ
Follow
faceradix's profile picture
huijelee's profile picture
kreamsoup's profile picture
17 followers
·
28 following
faizman31
dongjae-shin-967450238
AI & ML interests
NLP, LLM, Vision-Langauge Model
Recent Activity
upvoted
an
article
10 days ago
We Got Claude to Fine-Tune an Open Source LLM
reacted
to
sergiopaniego
's
post
with 🔥
10 days ago
NEW: @mistralai released a fantastic family of multimodal models, Ministral 3. You can fine-tune them for free on Colab using TRL ⚡️, supporting both SFT and GRPO Link to the notebooks: - SFT: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/sft_ministral3_vl.ipynb - GRPO: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_ministral3_vl.ipynb - TRL and more examples: https://huggingface.co/docs/trl/index
reacted
to
sergiopaniego
's
post
with 👍
20 days ago
Interested in RL training environments? We just released a beginner-friendly walkthrough notebook! Train a model to play Wordle using TRL + OpenEnv (TextArena) + GRPO + vLLM. happy learning! 🌱 Notebook: https://github.com/huggingface/trl/blob/main/examples/notebooks/openenv_wordle_grpo.ipynb OpenEnv guide in TRL: https://huggingface.co/docs/trl/main/en/openenv
View all activity
Organizations
ShinDJ
's datasets
1
Sort: Recently updated
ShinDJ/bllossom_vision_stage2_datasets
Viewer
•
Updated
Sep 28
•
707k
•
10