Improve model card, add link to code

This PR improves the model card by adding a link to the paper [GuardReasoner: Towards Reasoning-based LLM Safeguards](https://huggingface.co/papers/2501.18492).

It also adds a license and changes the pipeline tag to text generation since the model generates text. It also links to the Github repository.

Please review and merge this PR if everything looks good.

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 library_name: transformers
-license: other
 base_model: meta-llama/Llama-3.2-1B
 tags:
 - llama-factory
@@ -9,7 +9,7 @@ tags:
 model-index:
 - name: GuardReasoner 1B
   results: []
-pipeline_tag: text-classification
 language:
 - en
 metrics:
@@ -20,6 +20,8 @@ metrics:
 This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) via R-SFT and HS-DPO. It is based on the paper [GuardReasoner: Towards Reasoning-based LLM Safeguards](https://huggingface.co/papers/2501.18492).
 The training data of R-SFT can be found in [GuardReasonerTrain](https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain).

 ---
 library_name: transformers
+license: apache-2.0
 base_model: meta-llama/Llama-3.2-1B
 tags:
 - llama-factory
 model-index:
 - name: GuardReasoner 1B
   results: []
+pipeline_tag: text-generation
 language:
 - en
 metrics:
 This model is a fine-tuned version of [meta-llama/Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) via R-SFT and HS-DPO. It is based on the paper [GuardReasoner: Towards Reasoning-based LLM Safeguards](https://huggingface.co/papers/2501.18492).
+Code: https://github.com/yueliu1999/GuardReasoner/
 The training data of R-SFT can be found in [GuardReasonerTrain](https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain).