A collections of Gemma3n E4B models fintuned using the PIPPA and NPC-dialogue datasets, specifically aimed to be embedded in a game.
Hexi Wang PRO
chimbiwide
AI & ML interests
Fine-tuning small models for Roleplaying
Recent Activity
upvoted an article 2 days ago
Synthetic-XRay published an
article
3 days ago
Synthetic-XRay updated
a collection
3 days ago
CXR DenseNet Classifiers Organizations
Gemma3NPC-it-beta
A new attempt at training Gemma3NPC, with less conservative training parameters
GemmaReLe
A collection of all the GemmaReLe models, a special version of Gemma3NPC, made specifically for ReLe
CXR DenseNet Classifiers
Collection of custom trained DenseNet-121 Classifiers for Pneumonia
Gemma3NPC-Filtered-v2
Using the same dataset as Gemma3NPC-Filtered, but with a much higher learning rate and 3 epochs compared to 1.
GemmaThink
A collection of Gemma3-1b-it models that we post-trained using SFT and GRPO to enhance its reasoning capabilities, using Google's new Tunix library.
-
chimbiwide/gemma-3-1b-it-thinking-32k-grpo-merged-Q8_0-GGUF
1.0B • Updated • 20 -
chimbiwide/gemma-3-1b-it-thinking-32k-sft-base-Q8_0-GGUF
Text Generation • 1.0B • Updated • 27 -
chimbiwide/gemma-3-1b-it-thinking-32k-sft-base
Text Generation • Updated -
chimbiwide/gemma-3-1b-it-thinking-32k-grpo-merged
Text Generation • Updated
Gemma3NPC-1b
Collection of Gemma3NPC-1b models. Smaller model for edge usecases.
Gemma3NPC
A collections of Gemma3n E4B models fintuned using the pippadataset, aimed to be a general roleplaying model
CXR X-Ray Diffusers
Collection of Diffusers trained to generate realistic chest x-ray images with or without pneumonia.
Gemma3NPC-Filtered
A collections of Gemma3n E4B models fintuned using the pippa_filtered dataset, aimed to be a roleplaying model, though a pretty bad one
Gemma3NPC-it
A collections of Gemma3n E4B models fintuned using the PIPPA and NPC-dialogue datasets, specifically aimed to be embedded in a game.
GemmaThink
A collection of Gemma3-1b-it models that we post-trained using SFT and GRPO to enhance its reasoning capabilities, using Google's new Tunix library.
-
chimbiwide/gemma-3-1b-it-thinking-32k-grpo-merged-Q8_0-GGUF
1.0B • Updated • 20 -
chimbiwide/gemma-3-1b-it-thinking-32k-sft-base-Q8_0-GGUF
Text Generation • 1.0B • Updated • 27 -
chimbiwide/gemma-3-1b-it-thinking-32k-sft-base
Text Generation • Updated -
chimbiwide/gemma-3-1b-it-thinking-32k-grpo-merged
Text Generation • Updated
Gemma3NPC-it-beta
A new attempt at training Gemma3NPC, with less conservative training parameters
Gemma3NPC-1b
Collection of Gemma3NPC-1b models. Smaller model for edge usecases.
GemmaReLe
A collection of all the GemmaReLe models, a special version of Gemma3NPC, made specifically for ReLe
Gemma3NPC
A collections of Gemma3n E4B models fintuned using the pippadataset, aimed to be a general roleplaying model
CXR DenseNet Classifiers
Collection of custom trained DenseNet-121 Classifiers for Pneumonia
CXR X-Ray Diffusers
Collection of Diffusers trained to generate realistic chest x-ray images with or without pneumonia.
Gemma3NPC-Filtered-v2
Using the same dataset as Gemma3NPC-Filtered, but with a much higher learning rate and 3 epochs compared to 1.
Gemma3NPC-Filtered
A collections of Gemma3n E4B models fintuned using the pippa_filtered dataset, aimed to be a roleplaying model, though a pretty bad one