README.md · netcat420/MFANN3bv0.16.11 at main

metadata

license: mit
datasets:
  - netcat420/MFANN
language:
  - en
pipeline_tag: text-generation

task vector optimization checkpoint ready for merging.

trained on MFANN for 12000 steps, however due to a slightly higher training loss, im going to merge this model with the last version and retrain, the goal was to use DARE-TIES to reduce the parameters used per vector, and this model will now be merged with the last model before DARE using TIES alone, and will be subsequently retrained.