---
base_model:
- hfl/llama-3-chinese-8b-instruct-v3
- UnicomLLM/Unichat-llama3-Chinese-8B
- shenzhi-wang/Llama3-8B-Chinese-Chat
- yaojialzc/Gigi-Llama3-8B-Chinese-zh
- Rookie/Llama-3-8B-Instruct-Chinese
- FlagAlpha/Llama3-Chinese-8B-Instruct
library_name: transformers
tags:
- mergekit
- merge
license: llama3
language:
- en
- zh
---
# Llama3-zhcn
English
# A Merged Llama 3 Model for Enhanced Chinese Understanding
This model is a merge of several pre-trained Llama 3 8B language models specifically focused on Simplified Chinese (China), created using [mergekit](https://github.com/cg123/mergekit).
## Purpose
Llama3-zhcn aims to deliver a Llama 3 model with deep Chinese cultural, historical, and linguistic comprehension. This model serves dual purposes: handling diverse everyday tasks and providing a solid foundation for additional merging and fine-tuning. As Chinese model development trends away from the Llama series, this merge strives to maintain and improve Llama 3's Chinese language capabilities
## Limitations
* **No Vision Capabilities:** This model is based on Llama 3 and does not include vision capabilities found in some later models like 3.1 and 3.2.
* **Historical Accuracy:** While the model possesses a good understanding of Chinese history, fact-checking is still recommended to ensure accuracy.
* **Translation and Revision:** The model is capable of performing translations and revisions in English and Chinese. However, optimal results may require prompt engineering.
## Merge Details
### Merge Method
This model was created using the [Linear](https://arxiv.org/abs/2203.05482) merge method, utilizing the Meta-Llama-3-8B-Instruct tokenizer.
### Models Merged
The following models were included in the merge:
* [hfl/llama-3-chinese-8b-instruct-v3](https://huggingface.co/hfl/llama-3-chinese-8b-instruct-v3)
* [UnicomLLM/Unichat-llama3-Chinese-8B](https://huggingface.co/UnicomLLM/Unichat-llama3-Chinese-8B)
* [shenzhi-wang/Llama3-8B-Chinese-Chat](https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat)
* [yaojialzc/Gigi-Llama3-8B-Chinese-zh](https://huggingface.co/yaojialzc/Gigi-Llama3-8B-Chinese-zh)
* [Rookie/Llama-3-8B-Instruct-Chinese](https://huggingface.co/Rookie/Llama-3-8B-Instruct-Chinese)
* [FlagAlpha/Llama3-Chinese-8B-Instruct](https://huggingface.co/FlagAlpha/Llama3-Chinese-8B-Instruct)
Chinese
# 增强中文理解的合并Llama 3模型
该模型是针对简体中文(中国)的多个预训练了8B语言模式进行融合,使用[mergekit](https://github.com/cg123/mergekit)创建。
## 目标:
Llama3-zhcn旨在提供具有深入的中华文化、历史和语法理解能力。该模型有两个目的:处理各种日常任务,并为进一步组合或微调奠定坚实基础。在中国语言模式发展趋势从Llama系列转向时,这个融合尝试维护并改进Llama 3的中文语言功能。
## 限制:
* **无视觉能力:**该模型基于Llama 3,不包括一些后续版本如3.1和3.2中具有的一些图像处理能力。
* **历史准确性:**虽然这个模型对中国有良好的理解,但仍然建议进行事实核查以保证精度。
* **翻译与修订:**该模型可以在英语和中文之间执行翻译和修改。然而,最佳结果可能需要引导工程。
## 合并细节:
### 组合方法
使用线性(Linear)组合法创建此模型,并利用Meta-Llama-3-8B-Instruct分词器进行处理。
### 被融入的模型
以下是被包含在内的模型:
* [hfl/llama-3-chinese-8b-instruct-v3](https://huggingface.co/hfl/llama-3-chinese-8b-instruct-v3)
* [UnicomLLM/Unichat-llama3-Chinese-8B](https://huggingface.co/UnicomLLM/Unichat-llama3-Chinese-8B)
* [shenzhi-wang/Llama3-8B-Chinese-Chat](https://huggingface.co/shenzhi-wang/Llama3-8B-Chinese-Chat)
* [yaojialzc/Gigi-Llama3-8B-Chinese-zh](https://huggingface.co/yaojialzc/Gigi-Llama3-8B-Chinese-zh)
* [Rookie/Llama-3-8B-Instruct-Chinese](https://huggingface.co/Rookie/Llama-3-8B-Instruct-Chinese)
* [FlagAlpha/Llama3-Chinese-8B-Instruct](https://huggingface.co/FlagAlpha/Llama3-Chinese-8B-Instruct)