Improve model card
#2
by
nielsr
HF staff
- opened
README.md
CHANGED
@@ -15,7 +15,7 @@ pipeline_tag: text-generation
|
|
15 |
# GuardReasoner 3B
|
16 |
|
17 |
This model is a fine-tuned version of [meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) via R-SFT and HS-DPO, as described in [GuardReasoner: Towards Reasoning-based LLM Safeguards](https://huggingface.co/papers/2501.18492).
|
18 |
-
|
19 |
|
20 |
The training data of R-SFT can be found in [GuardReasonerTrain](https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain).
|
21 |
|
@@ -59,8 +59,6 @@ It is not an easy decision to make, but it is one that I believe is necessary if
|
|
59 |
|
60 |
I'm incredibly thankful for the opportunity to invest so much time and energy into building a platform that brought people closer and changed the world, but I hope that this decision will serve as a fresh start with a renewed focus on personal relationships and human connection.
|
61 |
|
62 |
-
Thank you to all of you who have been a part of this journey. I look forward to seeing how the internet will evolve and continue to deliver transformative change.
|
63 |
-
|
64 |
Sincerely,
|
65 |
Mark
|
66 |
"""]
|
@@ -80,5 +78,4 @@ print(output)
|
|
80 |
journal={arXiv preprint arXiv:2501.18492},
|
81 |
year={2025}
|
82 |
}
|
83 |
-
```
|
84 |
-
|
|
|
15 |
# GuardReasoner 3B
|
16 |
|
17 |
This model is a fine-tuned version of [meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) via R-SFT and HS-DPO, as described in [GuardReasoner: Towards Reasoning-based LLM Safeguards](https://huggingface.co/papers/2501.18492).
|
18 |
+
Code: https://github.com/yueliu1999/GuardReasoner/.
|
19 |
|
20 |
The training data of R-SFT can be found in [GuardReasonerTrain](https://huggingface.co/datasets/yueliu1999/GuardReasonerTrain).
|
21 |
|
|
|
59 |
|
60 |
I'm incredibly thankful for the opportunity to invest so much time and energy into building a platform that brought people closer and changed the world, but I hope that this decision will serve as a fresh start with a renewed focus on personal relationships and human connection.
|
61 |
|
|
|
|
|
62 |
Sincerely,
|
63 |
Mark
|
64 |
"""]
|
|
|
78 |
journal={arXiv preprint arXiv:2501.18492},
|
79 |
year={2025}
|
80 |
}
|
81 |
+
```
|
|