netcat420
/

MFANN3bv0.16.11

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

netcat420 commited on Jul 17, 2024

Commit

60d8d4d

·

verified ·

1 Parent(s): cc9c53c

Update README.md

Files changed (1) hide show

README.md +8 -0

README.md CHANGED Viewed

@@ -1,3 +1,11 @@
 task vector optimization checkpoint ready for merging.
 trained on MFANN for 12000 steps, however due to a slightly higher training loss, im going to merge this model with the last version and retrain, the goal was to use DARE-TIES to reduce the parameters used per vector, and this model will now be merged with the last model before DARE using TIES alone, and will be subsequently retrained.

+---
+license: mit
+datasets:
+- netcat420/MFANN
+language:
+- en
+pipeline_tag: text-generation
+---
 task vector optimization checkpoint ready for merging.
 trained on MFANN for 12000 steps, however due to a slightly higher training loss, im going to merge this model with the last version and retrain, the goal was to use DARE-TIES to reduce the parameters used per vector, and this model will now be merged with the last model before DARE using TIES alone, and will be subsequently retrained.