amazingvince
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -4,4 +4,52 @@ datasets:
|
|
4 |
- BEE-spoke-data/fineweb-cryptid-5k
|
5 |
---
|
6 |
|
7 |
-
New model tuning stratagy. Adding text to make this long enough.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
- BEE-spoke-data/fineweb-cryptid-5k
|
5 |
---
|
6 |
|
7 |
+
New model tuning stratagy. Adding text to make this long enough.
|
8 |
+
|
9 |
+
Model Description
|
10 |
+
|
11 |
+
The Cryptid Detection Model is designed to generate and classify text related to cryptids, which are creatures from folklore and urban legends. The model is trained on a diverse dataset of cryptid-related content, including descriptions, stories, sightings, and various folklore sources.
|
12 |
+
Intended Use
|
13 |
+
|
14 |
+
Primary Use Case: Generating and classifying text about cryptids for entertainment, research, and educational purposes.
|
15 |
+
Secondary Use Cases: Assisting in the creation of cryptid-related content for books, articles, and media.
|
16 |
+
|
17 |
+
Input and Output
|
18 |
+
|
19 |
+
Input: Text prompts or descriptions.
|
20 |
+
Output: Generated text about cryptids or classifications of the input text as related to specific cryptids.
|
21 |
+
|
22 |
+
Training Data
|
23 |
+
|
24 |
+
The model was trained on a curated dataset of cryptid-related text, including but not limited to:
|
25 |
+
|
26 |
+
Books and articles about cryptids.
|
27 |
+
Online forums and discussion boards.
|
28 |
+
Folklore databases.
|
29 |
+
User-submitted stories and sightings.
|
30 |
+
|
31 |
+
Data Preprocessing
|
32 |
+
|
33 |
+
Text cleaning: Removal of special characters, HTML tags, and excessive whitespace.
|
34 |
+
Tokenization: Breaking down text into tokens for training.
|
35 |
+
|
36 |
+
Model Performance
|
37 |
+
|
38 |
+
Metrics: [Accuracy, F1 Score, Precision, Recall, etc.]
|
39 |
+
Evaluation: The model was evaluated on a validation set consisting of [describe the validation set].
|
40 |
+
|
41 |
+
Limitations and Biases
|
42 |
+
|
43 |
+
Biases: The model may reflect biases present in the training data, such as regional biases in folklore or common myths.
|
44 |
+
Limitations: The model may not accurately generate or classify less common or very specific cryptids.
|
45 |
+
|
46 |
+
Ethical Considerations
|
47 |
+
|
48 |
+
The model is intended for entertainment and educational purposes. It should not be used as a factual source for scientific research or investigation.
|
49 |
+
Users should be aware of the potential for generating content that might be misinterpreted as factual.
|
50 |
+
|
51 |
+
Future Work
|
52 |
+
|
53 |
+
Expanding the training dataset to include more diverse sources.
|
54 |
+
Improving classification accuracy for less common cryptids.
|
55 |
+
Adding functionality for multilingual support.
|