TurkishCodeMan/Nanbeige4.1-3B-Gmail-Tool-Use-Datasets
Viewer • Updated • 1.7k • 633
Fine-tuned version of Nanbeige/Nanbeige4.1-3B for Gmail tool-calling tasks using a two-stage training pipeline.
A hyper-aligned 3B parameter agent matching GPT-4o-mini performance inside LangGraph.
Training datasets: TurkishCodeMan/Nanbeige4.1-3B-Gmail-Tool-Use-Datasets
sft/traces_chatml_clean.jsonl)dpo/dpo_dataset.jsonl) — 3 rejection strategies:wrong_tool — incorrect tool selected (~34%)missing_args — required arguments omitted (~32%)bad_answer — poor final response (~34%)ref_model=None (PEFT implicit ref)| Tool | Description |
|---|---|
search_emails |
Search Gmail inbox with filters |
read_email |
Read full email content by ID |
send_email |
Send a new email |
draft_email |
Create a draft |
modify_email |
Add/remove labels, mark read/unread |
download_attachment |
Download email attachment |
from transformers import AutoTokenizer, AutoModelForCausalLM
import torch
model = AutoModelForCausalLM.from_pretrained(
"TurkishCodeMan/Nanbeige4.1-3B-Gmail-Tool-Use",
torch_dtype=torch.bfloat16,
trust_remote_code=True,
)
tokenizer = AutoTokenizer.from_pretrained(
"TurkishCodeMan/Nanbeige4.1-3B-Gmail-Tool-Use",
trust_remote_code=True,
)
| Parameter | Value |
|---|---|
| Base model | Nanbeige/Nanbeige4.1-3B |
| SFT LoRA rank | 16 |
| DPO LoRA rank | 16 |
| DPO β | 0.1 |
| Max length | 2682 tokens |
| GPU | 1× RTX 4090 24GB |
| Framework | TRL 0.22 · Transformers 4.57 · PEFT 0.18 |