Improve model card: Add pipeline tag, library name, paper, code, and detailed description

by nielsr HF Staff - opened Oct 4

←

nielsr

Oct 4

This PR significantly enhances the model card for ExGRPO by:

Adding library_name: transformers to enable the automated "How to use" widget, as evidenced by config.json (Qwen2ForCausalLM architecture).
Adding pipeline_tag: text-generation for better discoverability on the Hub, consistent with the model's architecture and purpose.
Incorporating a detailed introduction, key highlights, installation guide, data preparation steps, main results, and the official citation, all sourced directly from the paper's abstract and the GitHub README.
Linking directly to the research paper ExGRPO: Learning to Reason from Experience and the GitHub repository https://github.com/ElliottYan/LUFFY/tree/main/ExGRPO.

Please review and merge these enhancements.

rzzhan changed pull request status to merged Oct 24

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment