Update README.md
Browse files
README.md
CHANGED
@@ -63,7 +63,10 @@ The model demonstrates that learning to critique is more effective than learning
|
|
63 |
## Evaluation Results
|
64 |
|
65 |
|
|
|
|
|
66 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/636a35eff8d9af4aea181608/ifPVcA7-aAdzbxX8U6wat.png)
|
|
|
67 |
|
68 |
For more details about the model architecture, methodology, and comprehensive evaluation results, please visit our [project webpage](https://tiger-ai-lab.github.io/CritiqueFineTuning).
|
69 |
|
|
|
63 |
## Evaluation Results
|
64 |
|
65 |
|
66 |
+
|
67 |
+
|
68 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/636a35eff8d9af4aea181608/ifPVcA7-aAdzbxX8U6wat.png)
|
69 |
+
*Table 1: Performance comparison of Qwen2.5-Math-7B-CFT vs. other reasoning-specialized models.*
|
70 |
|
71 |
For more details about the model architecture, methodology, and comprehensive evaluation results, please visit our [project webpage](https://tiger-ai-lab.github.io/CritiqueFineTuning).
|
72 |
|