rajab7 commited on
Commit
b963f7b
·
verified ·
1 Parent(s): 27de13f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -1
README.md CHANGED
@@ -22,4 +22,14 @@ widget:
22
  3. [Limitations](##limitations)
23
  4. [Training](##training)
24
  5. [License](##license)
25
- 6. [Citation](##citation)
 
 
 
 
 
 
 
 
 
 
 
22
  3. [Limitations](##limitations)
23
  4. [Training](##training)
24
  5. [License](##license)
25
+ 6. [Citation](##citation)
26
+
27
+ ## Model Summary
28
+
29
+ The JavaCoder models are !B parameter models trained on 80+ programming languages from [The Stack (v1.2)](https://huggingface.co/datasets/bigcode/the-stack), with opt-out requests excluded. The model uses [Multi Query Attention](https://arxiv.org/abs/1911.02150), [a context window of 8192 tokens](https://arxiv.org/abs/2205.14135), and was trained using the [Fill-in-the-Middle objective](https://arxiv.org/abs/2207.14255) on 1 trillion tokens.
30
+
31
+ - **Repository:**
32
+ - **Project Website:**
33
+ - **Paper:**
34
+ - **Point of Contact:**
35
+ - **Languages:** 80+ Programming languages