Jackmin108 commited on
Commit
7df9506
Β·
verified Β·
1 Parent(s): bf71a86

Add header section

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md CHANGED
@@ -4,6 +4,10 @@ tags:
4
  - prime-rl
5
  - verifiers
6
  - prime-intellect
 
 
 
 
7
  license: mit
8
  language:
9
  - en
@@ -14,6 +18,20 @@ pipeline_tag: text-generation
14
 
15
  # INTELLECT-3
16
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  **INTELLECT-3** is a 106B (A12B) parameter Mixture-of-Experts reasoning model post-trained from [GLM-4.5-Air-Base](https://huggingface.co/zai-org/GLM-4.5-Air-Base) using supervised fine-tuning (SFT) followed by large-scale reinforcement learning (RL).
18
 
19
  Training was performed with [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) using environments built with the [verifiers](https://github.com/PrimeIntellect-ai/verifiers) library. All training and evaluation environments are available on the [Environments Hub](https://app.primeintellect.ai/dashboard/environments).
 
4
  - prime-rl
5
  - verifiers
6
  - prime-intellect
7
+ - reinforcement-learning
8
+ - reasoning
9
+ - agentic
10
+ - mixture-of-experts
11
  license: mit
12
  language:
13
  - en
 
18
 
19
  # INTELLECT-3
20
 
21
+ <div align="center">
22
+ <img src="pi_logo.png" alt="Prime Intellect Logo" width="240px"/>
23
+ </div>
24
+
25
+ <p align="center">
26
+ πŸš€ <strong>State-of-the-art 100B+ parameter Mixture-of-Experts model trained with large-scale reinforcement learning</strong>
27
+ <br><br>
28
+ πŸ“š Trained with <a href="https://github.com/PrimeIntellect-ai/prime-rl">prime-rl</a> infra and <a href="https://github.com/PrimeIntellect-ai/verifiers">verifiers</a> environments | 🌐 Environments on <a href="https://app.primeintellect.ai/dashboard/environments">Environments Hub</a>
29
+ <br>
30
+ πŸ“– Read the <a href="https://primeintellect.ai/blog/intellect-3">Technical Report</a> | πŸ’¬ Join our <a href="https://discord.gg/RC5GvMbfDf">Discord</a>
31
+ </p>
32
+
33
+ ## Introduction
34
+
35
  **INTELLECT-3** is a 106B (A12B) parameter Mixture-of-Experts reasoning model post-trained from [GLM-4.5-Air-Base](https://huggingface.co/zai-org/GLM-4.5-Air-Base) using supervised fine-tuning (SFT) followed by large-scale reinforcement learning (RL).
36
 
37
  Training was performed with [prime-rl](https://github.com/PrimeIntellect-ai/prime-rl) using environments built with the [verifiers](https://github.com/PrimeIntellect-ai/verifiers) library. All training and evaluation environments are available on the [Environments Hub](https://app.primeintellect.ai/dashboard/environments).