Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Dimitri von Rütte's picture
24 4

Dimitri von Rütte

dvruette
gentlebowl's profile picture Awillia91's profile picture 21world's profile picture
·
  • dvruette
  • dvruette

AI & ML interests

None yet

Recent Activity

updated a collection about 2 months ago
OpenWebText BPE
updated a collection about 2 months ago
OpenWebText BPE
updated a collection about 2 months ago
OpenWebText BPE
View all activity

Organizations

OpenAssistant's profile picture ETH Zurich's profile picture

updated 2 collections about 2 months ago

OpenWebText BPE

Collection
BPE tokenizers with vocab sizes between 1k and 131k trained on OpenWebText, as well as the pre-tokenized dataset for each of them. • 16 items • Updated Jan 19

Generalized Interpolating Discrete Diffusion

Collection
7 items • Updated Jan 19 • 4
updated a dataset 3 months ago

dvruette/openwebtext-tokenized-131k

Viewer • Updated Dec 19, 2025 • 8.01M • 148
published a dataset 3 months ago

dvruette/openwebtext-tokenized-131k

Viewer • Updated Dec 19, 2025 • 8.01M • 148
updated a dataset 3 months ago

dvruette/openwebtext-tokenized-66k

Viewer • Updated Dec 19, 2025 • 8.01M • 251
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs