Update README.md
Browse files
README.md
CHANGED
|
@@ -73,6 +73,7 @@ The BF16 version can be served on 2x H200s:
|
|
| 73 |
```bash
|
| 74 |
vllm serve PrimeIntellect/INTELLECT-3 \
|
| 75 |
--tensor-parallel-size 2 \
|
|
|
|
| 76 |
--tool-call-parser qwen3_coder \
|
| 77 |
--reasoning-parser deepseek_r1
|
| 78 |
```
|
|
@@ -81,6 +82,7 @@ The FP8 version can be served on a single H200:
|
|
| 81 |
|
| 82 |
```bash
|
| 83 |
vllm serve PrimeIntellect/INTELLECT-3-FP8 \
|
|
|
|
| 84 |
--tool-call-parser qwen3_coder \
|
| 85 |
--reasoning-parser deepseek_r1
|
| 86 |
```
|
|
|
|
| 73 |
```bash
|
| 74 |
vllm serve PrimeIntellect/INTELLECT-3 \
|
| 75 |
--tensor-parallel-size 2 \
|
| 76 |
+
--enable-auto-tool-choice \
|
| 77 |
--tool-call-parser qwen3_coder \
|
| 78 |
--reasoning-parser deepseek_r1
|
| 79 |
```
|
|
|
|
| 82 |
|
| 83 |
```bash
|
| 84 |
vllm serve PrimeIntellect/INTELLECT-3-FP8 \
|
| 85 |
+
--enable-auto-tool-choice \
|
| 86 |
--tool-call-parser qwen3_coder \
|
| 87 |
--reasoning-parser deepseek_r1
|
| 88 |
```
|