Improve model card: Add pipeline tag, library name, paper, code, and detailed description

#1
by nielsr HF Staff - opened

This PR significantly enhances the model card for ExGRPO by:

  • Adding library_name: transformers to enable the automated "How to use" widget, as evidenced by config.json (Qwen2ForCausalLM architecture).
  • Adding pipeline_tag: text-generation for better discoverability on the Hub, consistent with the model's architecture and purpose.
  • Incorporating a detailed introduction, key highlights, installation guide, data preparation steps, main results, and the official citation, all sourced directly from the paper's abstract and the GitHub README.
  • Linking directly to the research paper ExGRPO: Learning to Reason from Experience and the GitHub repository https://github.com/ElliottYan/LUFFY/tree/main/ExGRPO.

Please review and merge these enhancements.

rzzhan changed pull request status to merged

Sign up or log in to comment