Improve model card: Add pipeline tag, library name, paper, code, and detailed description
#1
by
nielsr
HF Staff
- opened
This PR significantly enhances the model card for ExGRPO by:
- Adding
library_name: transformersto enable the automated "How to use" widget, as evidenced byconfig.json(Qwen2ForCausalLMarchitecture). - Adding
pipeline_tag: text-generationfor better discoverability on the Hub, consistent with the model's architecture and purpose. - Incorporating a detailed introduction, key highlights, installation guide, data preparation steps, main results, and the official citation, all sourced directly from the paper's abstract and the GitHub README.
- Linking directly to the research paper ExGRPO: Learning to Reason from Experience and the GitHub repository
https://github.com/ElliottYan/LUFFY/tree/main/ExGRPO.
Please review and merge these enhancements.
rzzhan
changed pull request status to
merged