Improve model card

#2
by nielsr HF staff - opened
Files changed (1) hide show
  1. README.md +2 -5
README.md CHANGED
@@ -15,7 +15,7 @@ pipeline_tag: text-generation
15
  # GuardReasoner 3B
16
 
17
  This model is a fine-tuned version of [meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) via R-SFT and HS-DPO, as described in [GuardReasoner: Towards Reasoning-based LLM Safeguards](https://huggingface.co/papers/2501.18492).
18
-
19
 
20
  The training data of R-SFT can be found in [GuardReasonerTrain](https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain).
21
 
@@ -59,8 +59,6 @@ It is not an easy decision to make, but it is one that I believe is necessary if
59
 
60
  I'm incredibly thankful for the opportunity to invest so much time and energy into building a platform that brought people closer and changed the world, but I hope that this decision will serve as a fresh start with a renewed focus on personal relationships and human connection.
61
 
62
- Thank you to all of you who have been a part of this journey. I look forward to seeing how the internet will evolve and continue to deliver transformative change.
63
-
64
  Sincerely,
65
  Mark
66
  """]
@@ -80,5 +78,4 @@ print(output)
80
  journal={arXiv preprint arXiv:2501.18492},
81
  year={2025}
82
  }
83
- ```
84
-
 
15
  # GuardReasoner 3B
16
 
17
  This model is a fine-tuned version of [meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) via R-SFT and HS-DPO, as described in [GuardReasoner: Towards Reasoning-based LLM Safeguards](https://huggingface.co/papers/2501.18492).
18
+ Code: https://github.com/yueliu1999/GuardReasoner/.
19
 
20
  The training data of R-SFT can be found in [GuardReasonerTrain](https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain).
21
 
 
59
 
60
  I'm incredibly thankful for the opportunity to invest so much time and energy into building a platform that brought people closer and changed the world, but I hope that this decision will serve as a fresh start with a renewed focus on personal relationships and human connection.
61
 
 
 
62
  Sincerely,
63
  Mark
64
  """]
 
78
  journal={arXiv preprint arXiv:2501.18492},
79
  year={2025}
80
  }
81
+ ```