anindya64 commited on
Commit
0d4834c
·
verified ·
1 Parent(s): c7d2d02

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -5
README.md CHANGED
@@ -1,18 +1,59 @@
1
  ---
2
  library_name: transformers
3
- tags: []
 
 
 
 
 
 
 
 
 
 
4
  ---
5
 
6
  # Prem-1B-SQL
7
 
8
- Prem-1B-SQL is the one of the very first series of fully local Text-to-SQL models developed by Prem AI. Being a 1B parameter model
9
  it easily fits on low GPU devices (and CPU devices when quantized). We believe that AI assisted data analysis should be a Local first
10
- approach. Because exposing Databases to third party closed source models can lead to data security breaches. We will be publishing some
11
- of the public benchmarks results of this model very soon. We will also be iterating on this model for more better results.
12
 
13
  - **Developed by:** [Prem AI](https://www.premai.io/)
14
  - **License:** [MIT]
15
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
16
 
17
  ## How to use Prem-1B-SQL
18
 
@@ -142,4 +183,4 @@ Additionally we made error handling datasets on top of these datasets to make th
142
 
143
  ## Evaluation results of Prem-1B-SQL
144
 
145
- The results of Prem-1B-SQL on some public benchmarks will be published soon.
 
1
  ---
2
  library_name: transformers
3
+ datasets:
4
+ - premai-io/spider
5
+ - premai-io/domains
6
+ - premai-io/birdbench
7
+ - gretelai/synthetic_text_to_sql
8
+
9
+ metrics:
10
+ - accuracy
11
+ base_model:
12
+ - deepseek-ai/deepseek-coder-1.3b-instruct
13
+ pipeline_tag: text2text-generation
14
  ---
15
 
16
  # Prem-1B-SQL
17
 
18
+ Prem-1B-SQL is one of the very first series of fully local Text-to-SQL models developed by Prem AI. Being a 1B parameter model
19
  it easily fits on low GPU devices (and CPU devices when quantized). We believe that AI assisted data analysis should be a Local first
20
+ approach. Because exposing Databases to third-party closed-source models can lead to data security breaches. We will be publishing some
21
+ of the public benchmark results of this model very soon. We will also be iterating on this model for more better results.
22
 
23
  - **Developed by:** [Prem AI](https://www.premai.io/)
24
  - **License:** [MIT]
25
 
26
+ ## Results
27
+
28
+ We evaluated our model on two popular benchmark datasets: BirdBench and Spider. BirdBench consists of a public validation dataset (with 1534 data points) and a private test dataset. Spider comes up with only a public validation dataset. Here are the results:
29
+
30
+ | Dataset | Execution Accuracy |
31
+ |--------------------------|--------------------|
32
+ | BirdBench (validation) | 46% |
33
+ | BirdBench (private test) | 51.54% |
34
+ | Spider | 85% |
35
+
36
+ The BirdBench dataset is distributed across different difficulty levels. Here is a detailed view of the private results across different difficulty levels.
37
+
38
+ | Difficulty | Count | EX | Soft F1 |
39
+ |-------------|-------|---------|---------|
40
+ | Simple | 949 | 60.70 | 61.48 |
41
+ | Moderate | 555 | 47.39 | 49.06 |
42
+ | Challenging | 285 | 29.12 | 31.83 |
43
+ | Total | 1789 | 51.54 | 52.90 |
44
+
45
+
46
+ Here is a more detailed comparison of popular closed- and open-source models.
47
+
48
+ | Model | # Params (in Billion) | BirdBench Test Scores |
49
+ |-------------------------------|-----------------------|-----------------------|
50
+ | AskData + GPT-4o (current winner) | NA | 72.39 |
51
+ | DeepSeek coder 236B | 236 | 56.68 |
52
+ | GPT-4 (2023) | NA | 54.89 |
53
+ | **PremSQL 1B (ours)** | 1 | 51.4 |
54
+ | Qwen 2.5 7B Instruct | 7 | 51.1 |
55
+ | Claude 2 Base (2023) | NA | 49.02 |
56
+
57
 
58
  ## How to use Prem-1B-SQL
59
 
 
183
 
184
  ## Evaluation results of Prem-1B-SQL
185
 
186
+ The results of Prem-1B-SQL on some public benchmarks will be published soon.