m3rg-iitd commited on
Commit
407931f
·
verified ·
1 Parent(s): fb5c144

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +55 -0
README.md ADDED
@@ -0,0 +1,55 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: llama3
3
+ language:
4
+ - en
5
+ base_model:
6
+ - meta-llama/Meta-Llama-3-8B
7
+ tags:
8
+ - materials science
9
+ - large language model
10
+ ---
11
+ # Model Card for LLaMat-3
12
+
13
+ **LLaMat-3** is a specialized large language model designed to be a foundational large language model for materials science.
14
+
15
+ ---
16
+
17
+ ## Overview
18
+
19
+ - **Model Type:** Large Language Model (LLM)
20
+ - **Base Model:** LLaMat-3 (continued pretraining of LLaMA-3 on material science data)
21
+ - **Language:** English
22
+ - **License:** LLaMA-3 License
23
+ - **Tags:** Material Science, Domain Adaptation, Table Understanding, Scientific Data Parsing, Materials Copilot
24
+
25
+ ---
26
+
27
+ ## Model Details
28
+
29
+ ### Key Features
30
+
31
+ - **Applications:** Can be finetuned for information extraction, table understanding, parsing data for research tasks, and crystal structure generation.
32
+
33
+ ### Development and Support
34
+ - **Developed by:** M3RG, IIT Delhi
35
+ - **Compute Support:**
36
+ - **Edinburgh International Data Facility (EIDF):** Provided access to Cerebras CS2 clusters for pretraining.
37
+ - **IIT Delhi High-Performance Computing Cluster:** Supported fine-tuning and inference stages.
38
+
39
+ ---
40
+
41
+ ## Technical Specifications
42
+
43
+ ### Hardware Infrastructure
44
+ - **Pretraining:** 2 Cerebras CS-2 Wafer-Scale Engines (WSE-2)
45
+
46
+ ### Software Stack
47
+ - **Frameworks:** PyTorch, Hugging Face Transformers
48
+
49
+ ---
50
+
51
+ ## Model Sources
52
+ - **Repository:** [LLaMat on GitHub](https://github.com/M3RG-IITD/llamat)
53
+ - **Compute Resources:** [EIDF Cerebras CS Clusters](https://edinburgh-international-data-facility.ed.ac.uk/services/computing/cerebras-cs)
54
+
55
+ ---