Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
11
Ik-hwan Kim
12kimih
Follow
0 followers
·
11 following
https://github.com/12kimih
12kimih
ik-hwan-kim-083419330
AI & ML interests
Large Language Models, Reinforcement Learning, Multimodal AI, AI Agents, Mechanistic Interpretability
Recent Activity
updated
a dataset
2 days ago
12kimih/r1qa-refined-rollouts
updated
a model
2 days ago
12kimih/Qwen3-0.6B-r1qa-v1
published
a model
2 days ago
12kimih/Qwen3-0.6B-r1qa-v1
View all activity
Organizations
None yet
models
7
Sort: Recently updated
12kimih/Qwen3-0.6B-r1qa-v1
Text Generation
•
0.6B
•
Updated
2 days ago
•
38
12kimih/Qwen3-1.7B-r1qa-v1
Text Generation
•
2B
•
Updated
2 days ago
•
38
12kimih/Qwen3-4B-r1qa-v1
Text Generation
•
4B
•
Updated
2 days ago
•
56
12kimih/Qwen3-0.6B-r1qa-gpt-oss-distill
Text Generation
•
0.6B
•
Updated
2 days ago
•
39
12kimih/Qwen3-1.7B-r1qa-gpt-oss-distill
Text Generation
•
2B
•
Updated
2 days ago
•
37
12kimih/Qwen3-4B-r1qa-gpt-oss-distill
Text Generation
•
4B
•
Updated
2 days ago
•
70
12kimih/Llama-3.2-3B-HiCUPID
Updated
Jun 3
datasets
8
Sort: Recently updated
12kimih/r1qa-refined-rollouts
Viewer
•
Updated
2 days ago
•
633k
•
116
12kimih/r1qa-2wikimultihopqa
Viewer
•
Updated
5 days ago
•
1.08M
•
22
12kimih/r1qa-musique
Viewer
•
Updated
5 days ago
•
134k
•
12
12kimih/r1qa-hotpotqa
Viewer
•
Updated
5 days ago
•
587k
•
39
12kimih/r1qa-guided-rollouts
Viewer
•
Updated
Nov 19
•
1.08M
•
880
12kimih/r1qa-clip-and-guide-using-Qwen3-8B
Viewer
•
Updated
Sep 12
•
2.97k
•
15
12kimih/r1qa-clip-with-perplexity
Viewer
•
Updated
Sep 9
•
2.97k
•
18
12kimih/HiCUPID
Viewer
•
Updated
Jun 3
•
918k
•
144