AI & ML interests
None yet
Organizations
None yet
models
32
fh1628/MNLP_M3_dpo_model_test
Text Generation
•
0.6B
•
Updated
•
1
fh1628/MNLP_M3_open_model_test
Text Generation
•
0.6B
•
Updated
•
1
fh1628/MNLP_M3_dpo_model2_v3
Text Generation
•
0.6B
•
Updated
•
1
fh1628/MNLP_M3_dpo_model2_v2
Text Generation
•
0.6B
•
Updated
fh1628/MNLP_M3_dpo_model1_v3
Text Generation
•
0.6B
•
Updated
fh1628/MNLP_M3_dpo_model1_v2
Text Generation
•
0.6B
•
Updated
Text Generation
•
0.6B
•
Updated
•
1
fh1628/base-qwen-dpo-random-epfl
Text Generation
•
0.6B
•
Updated
•
1
fh1628/base-qwen-dpo-filtered-epfl-with-eval
Text Generation
•
0.6B
•
Updated
•
1
fh1628/base-qwen-dpo-filtered-epfl
Text Generation
•
0.6B
•
Updated
•
3
datasets
31
fh1628/GPT_judgement_base_vs_sft_dpo
Viewer
•
Updated
•
48
•
2
fh1628/GPT_judgement_sft_dpo_vs_base_dpo
Viewer
•
Updated
•
50
•
5
fh1628/m1_data_generated_answers
Viewer
•
Updated
•
50
•
3
fh1628/GPT_judgement_sft_dpo
Viewer
•
Updated
•
50
•
10
fh1628/GPT_judgement_base_vs_sft
Viewer
•
Updated
•
42
•
3
Viewer
•
Updated
•
50
•
47
Viewer
•
Updated
•
3
fh1628/GPT_judgement_base_vs_dpo
Viewer
•
Updated
•
47
•
5
Viewer
•
Updated
•
167k
•
6
fh1628/MNLP_M3_dpo_dataset
Viewer
•
Updated
•
46.7k
•
7