Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
OpenNLPLab
/
TransNormerLLM-385M
like
10
Text Generation
Transformers
PyTorch
English
Chinese
TransNormerLLM
custom_code
arxiv:
2307.14995
arxiv:
2009.03300
License:
other
Model card
Files
Files and versions
xet
Community
1
Deploy
Use this model
e423fc2
TransNormerLLM-385M
799 MB
1 contributor
History:
3 commits
OpenNLPLab
Publish 385M Model
e423fc2
over 2 years ago
.gitattributes
1.52 kB
initial commit
over 2 years ago
README.md
98 Bytes
Update README.md
over 2 years ago
config.json
1.03 kB
Publish 385M Model
over 2 years ago
configuration_transnormer.py
2.27 kB
Publish 385M Model
over 2 years ago
generation_config.json
110 Bytes
Publish 385M Model
over 2 years ago
lightning_attention.py
15.3 kB
Publish 385M Model
over 2 years ago
modeling_transnormer.py
40.3 kB
Publish 385M Model
over 2 years ago
norm.py
1.25 kB
Publish 385M Model
over 2 years ago
pytorch_model.bin
798 MB
xet
Publish 385M Model
over 2 years ago
special_tokens_map.json
410 Bytes
Publish 385M Model
over 2 years ago
srmsnorm_triton.py
5.75 kB
Publish 385M Model
over 2 years ago
tokenization_baichuan.py
9.82 kB
Publish 385M Model
over 2 years ago
tokenizer.model
1.14 MB
xet
Publish 385M Model
over 2 years ago
tokenizer_config.json
819 Bytes
Publish 385M Model
over 2 years ago
utils.py
3.77 kB
Publish 385M Model
over 2 years ago