winddude
/

pb_lora_7b_v0.1

Text Generation

Inference Endpoints

Model card Files Files and versions Community

winddude commited on May 25, 2023

Commit

f7b22e6

·

1 Parent(s): f99c329

Update README.md

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -14,8 +14,7 @@ tags:
 This lora was trained on 250k post and response pairs from 43 different fincial, investing, and crypto subreddits. It is not an instruct model, it is designed to generate a reply to a reddit text post. It was an experiment in fine tuning for specific tasks. **Use it responsibly**
-* Training code will be released soon.
-* Dataset and tools for building the dataset will be released soon.
 ## Training Details
@@ -25,7 +24,7 @@ Base Model: llama 7b
 * 1 note worthy change I will mention now, is this was trained with casualLM rather than seq2seq like a number of the other instruct models have been. I can't explain why they used seq2seq for data collators, other than that's what alpaca lora originally used. Llama as a generative model was trained for casualLM so to me it makes sense to use that when fine tuning.
-* More coming soon.
 ### Training Hyperparams
@@ -59,7 +58,9 @@ Base Model: llama 7b
 | crypto_currency       | 186   | 0.694596            | 1.101901           | 0.026738           |
 | StocksAndTrading      | 93    | 0.184637            | 1.704545           | 0.019066           |
-* to be released soon along with code to recreate it
 ## Usage

 This lora was trained on 250k post and response pairs from 43 different fincial, investing, and crypto subreddits. It is not an instruct model, it is designed to generate a reply to a reddit text post. It was an experiment in fine tuning for specific tasks. **Use it responsibly**
+* Training, dataset and tools available here: <https://github.com/getorca/ProfitsBot_V0_OLLM>
 ## Training Details
 * 1 note worthy change I will mention now, is this was trained with casualLM rather than seq2seq like a number of the other instruct models have been. I can't explain why they used seq2seq for data collators, other than that's what alpaca lora originally used. Llama as a generative model was trained for casualLM so to me it makes sense to use that when fine tuning.
+training code is availble here: <https://github.com/getorca/ProfitsBot_V0_OLLM/tree/main/training>
 ### Training Hyperparams
 | crypto_currency       | 186   | 0.694596            | 1.101901           | 0.026738           |
 | StocksAndTrading      | 93    | 0.184637            | 1.704545           | 0.019066           |
+the dataset is available here: <https://huggingface.co/datasets/winddude/reddit_finance_43_250k>
+code for recreating the dataset is here: <https://github.com/getorca/ProfitsBot_V0_OLLM/tree/main/ds_builder>
 ## Usage