Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
24
5
24
Benjamin Minixhofer
benjamin
Follow
MikronT's profile picture
notpaulmartin's profile picture
psk's profile picture
40 followers
·
8 following
https://github.com/bminixhofer
bminixhofer
bminixhofer
AI & ML interests
NLP, Efficiency, Machine Learning in Rust, Multilinguality, Transfer Learning
Recent Activity
updated
a model
21 days ago
benjamin/dolma2-tokenizer_superbpe_olmo2_p99_truncate_10G__extend_400K
published
a model
21 days ago
benjamin/dolma2-tokenizer_superbpe_olmo2_p99_truncate_10G__extend_400K
updated
a model
21 days ago
benjamin/dolma2-tokenizer_superbpe_olmo2_p99_truncate_10G__extend_200K
View all activity
Organizations
benjamin
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
updated
a model
21 days ago
benjamin/dolma2-tokenizer_superbpe_olmo2_p99_truncate_10G__extend_400K
Updated
21 days ago
published
a model
21 days ago
benjamin/dolma2-tokenizer_superbpe_olmo2_p99_truncate_10G__extend_400K
Updated
21 days ago
updated
a model
21 days ago
benjamin/dolma2-tokenizer_superbpe_olmo2_p99_truncate_10G__extend_200K
Updated
21 days ago
published
a model
21 days ago
benjamin/dolma2-tokenizer_superbpe_olmo2_p99_truncate_10G__extend_200K
Updated
21 days ago
updated
a dataset
27 days ago
benjamin/SeaExam-formatted
Viewer
•
Updated
27 days ago
•
9.73k
•
39
published
a dataset
27 days ago
benjamin/SeaExam-formatted
Viewer
•
Updated
27 days ago
•
9.73k
•
39
updated
a dataset
3 months ago
benjamin/execute
Viewer
•
Updated
Sep 18
•
172k
•
99
published
a dataset
3 months ago
benjamin/execute
Viewer
•
Updated
Sep 18
•
172k
•
99
published
2 models
3 months ago
benjamin/Llama-3.2-1B-Instruct-flax
Text Generation
•
Updated
Nov 13, 2024
•
46
benjamin/Llama-3.2-1B-flax
Text Generation
•
Updated
Nov 18, 2024
•
1.53k
liked
a model
4 months ago
allenai/dolma2-tokenizer
Updated
Jul 13, 2024
•
4
updated
a model
5 months ago
benjamin/Qwen3-14B-flax
Text Generation
•
Updated
Jul 27
•
7
published
a model
5 months ago
benjamin/Qwen3-14B-flax
Text Generation
•
Updated
Jul 27
•
7
updated
a model
5 months ago
benjamin/gemma-3-1b-it-flax
Text Generation
•
Updated
Jul 25
•
16
published
a model
5 months ago
benjamin/gemma-3-1b-it-flax
Text Generation
•
Updated
Jul 25
•
16
updated
a model
5 months ago
benjamin/gemma-3-12b-it-flax
Text Generation
•
Updated
Jul 25
•
5
published
a model
5 months ago
benjamin/gemma-3-12b-it-flax
Text Generation
•
Updated
Jul 25
•
5
published
a model
6 months ago
benjamin/gemma-3-1b-pt-flax
Text Generation
•
Updated
May 27
•
18
upvoted
a
paper
6 months ago
Inference-Time Hyper-Scaling with KV Cache Compression
Paper
•
2506.05345
•
Published
Jun 5
•
27
published
a model
6 months ago
benjamin/Qwen3-4B-Base-flax
Text Generation
•
Updated
May 27
•
9
•
1
Load more