zhao1iang commited on
Commit
0302a27
·
verified ·
1 Parent(s): 3c30432

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -37,7 +37,7 @@ As of September 2024, Skywork-Critic-Llama3.1-70B **ranks first** on RewardBench
37
 
38
  | Model | Chat | Chat Hard | Safety | Reasoning | Overall Score |
39
  | ------------------------------- | :---: | :-------: | :----: | :-------: | :---: |
40
- | **Skywork-Critic-Llama3.1-70B** * | **96.9** | **88.4** | **93.2** | **95.4** | **93.4** |
41
  | Salesforce/SFR-LLaMa-3.1-70B-Judge-r | 96.9 | 84.8 | 91.6 | 97.6 | 92.7 |
42
  | Salesforce/SFR-nemo-12B-Judge-r | 97.2 | 82.2 | 86.5 | 95.1 | 90.3 |
43
  | **Skywork-Critic-Llama3.1-8B** * | **93.6** | **81.4** | **91.1** | **89.8** | **89.0** |
 
37
 
38
  | Model | Chat | Chat Hard | Safety | Reasoning | Overall Score |
39
  | ------------------------------- | :---: | :-------: | :----: | :-------: | :---: |
40
+ | **Skywork-Critic-Llama3.1-70B** * | **96.6** | **87.9** | **93.1** | **95.5** | **93.3** |
41
  | Salesforce/SFR-LLaMa-3.1-70B-Judge-r | 96.9 | 84.8 | 91.6 | 97.6 | 92.7 |
42
  | Salesforce/SFR-nemo-12B-Judge-r | 97.2 | 82.2 | 86.5 | 95.1 | 90.3 |
43
  | **Skywork-Critic-Llama3.1-8B** * | **93.6** | **81.4** | **91.1** | **89.8** | **89.0** |